Commit Graph

10 Commits

Author SHA1 Message Date
Jani Monoses 2c2b284988 Get vocab_size from token embeddings size 2023-08-26 22:35:55 +03:00
atamyrat de005474d3 Added load_meta_model() to export.py 2023-08-21 14:13:47 +03:00
Andrej Karpathy dd61b13e57 delete the save_torchscript export file, but copy its content to the new export.py for the future maybe 2023-08-21 05:09:06 +00:00
atamyrat 0dd82158f6 removed transformers from requirements.txt, added error message 2023-08-21 06:07:29 +03:00
atamyrat 155475a523 Fix WQ and WK permutation in huggingface models 2023-08-21 05:16:11 +03:00
atamyrat 09db52c69e Added huggingface model loader to export.py 2023-08-21 02:59:12 +03:00
Andrej Karpathy f3db92a2dc use out_file.tell() instead of nbytes += arithmetic 2023-08-20 16:51:35 +00:00
Andrej Karpathy 4df5e2e939 make version 1 be the legacy export but with new header. version 2 will be Q8_0 export 2023-08-19 18:51:32 +00:00
Andrej Karpathy 4212bd6d43 oops fix double indent on quantize def 2023-08-19 18:34:49 +00:00
Andrej Karpathy 7f551dbfd7 new model export: versions 0 (legacy) and 1 2023-08-19 18:25:20 +00:00