Jani Monoses
|
2c2b284988
|
Get vocab_size from token embeddings size
|
2023-08-26 22:35:55 +03:00 |
|
atamyrat
|
de005474d3
|
Added load_meta_model() to export.py
|
2023-08-21 14:13:47 +03:00 |
|
Andrej Karpathy
|
dd61b13e57
|
delete the save_torchscript export file, but copy its content to the new export.py for the future maybe
|
2023-08-21 05:09:06 +00:00 |
|
atamyrat
|
0dd82158f6
|
removed transformers from requirements.txt, added error message
|
2023-08-21 06:07:29 +03:00 |
|
atamyrat
|
155475a523
|
Fix WQ and WK permutation in huggingface models
|
2023-08-21 05:16:11 +03:00 |
|
atamyrat
|
09db52c69e
|
Added huggingface model loader to export.py
|
2023-08-21 02:59:12 +03:00 |
|
Andrej Karpathy
|
f3db92a2dc
|
use out_file.tell() instead of nbytes += arithmetic
|
2023-08-20 16:51:35 +00:00 |
|
Andrej Karpathy
|
4df5e2e939
|
make version 1 be the legacy export but with new header. version 2 will be Q8_0 export
|
2023-08-19 18:51:32 +00:00 |
|
Andrej Karpathy
|
4212bd6d43
|
oops fix double indent on quantize def
|
2023-08-19 18:34:49 +00:00 |
|
Andrej Karpathy
|
7f551dbfd7
|
new model export: versions 0 (legacy) and 1
|
2023-08-19 18:25:20 +00:00 |
|