Georgi Gerganov
|
7d9ed7b25f
|
Bump memory buffer
|
2023-03-11 12:45:01 +02:00 |
|
Georgi Gerganov
|
007a8f6f45
|
Support all LLaMA models + change Q4_0 quantization storage
|
2023-03-11 11:28:30 +02:00 |
|
Georgi Gerganov
|
70bc0b8b15
|
Fix a bug in the rope calculation
|
2023-03-10 23:46:57 +02:00 |
|
Georgi Gerganov
|
319cdb3e1f
|
Final touches
|
2023-03-10 21:50:46 +02:00 |
|
Georgi Gerganov
|
26c0846629
|
Initial release
|
2023-03-10 20:56:40 +02:00 |
|