llama2.c

Author	SHA1	Message	Date
Jani Monoses	2c2b284988	Get vocab_size from token embeddings size	2023-08-26 22:35:55 +03:00
atamyrat	de005474d3	Added load_meta_model() to export.py	2023-08-21 14:13:47 +03:00
Andrej Karpathy	dd61b13e57	delete the save_torchscript export file, but copy its content to the new export.py for the future maybe	2023-08-21 05:09:06 +00:00
atamyrat	0dd82158f6	removed transformers from requirements.txt, added error message	2023-08-21 06:07:29 +03:00
atamyrat	155475a523	Fix WQ and WK permutation in huggingface models	2023-08-21 05:16:11 +03:00
atamyrat	09db52c69e	Added huggingface model loader to export.py	2023-08-21 02:59:12 +03:00
Andrej Karpathy	f3db92a2dc	use out_file.tell() instead of nbytes += arithmetic	2023-08-20 16:51:35 +00:00
Andrej Karpathy	4df5e2e939	make version 1 be the legacy export but with new header. version 2 will be Q8_0 export	2023-08-19 18:51:32 +00:00
Andrej Karpathy	4212bd6d43	oops fix double indent on quantize def	2023-08-19 18:34:49 +00:00
Andrej Karpathy	7f551dbfd7	new model export: versions 0 (legacy) and 1	2023-08-19 18:25:20 +00:00