9 Commits

Author SHA1 Message Date
David A Roberts 52fe3653e5 Fix vocab_source in sample.py 2023-08-18 18:40:25 +10:00
Andrej Karpathy fe2de68688 fix sample.py from tokenizer changes before 2023-08-15 02:33:01 +00:00
Andrej Karpathy 82ad2ba34e remove tiktoken as dependency 2023-08-14 05:53:57 +00:00
Andrej Karpathy 58075b5ac5 update API of sample.py to be better, small changes here 2023-08-13 20:31:32 +00:00
Andrej Karpathy 38bfac90a8 bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery 2023-08-13 19:34:05 +00:00
Andrej Karpathy b0cfa2458d ok i can train and sample a model with a custom tokenizer 2023-08-11 16:47:29 +00:00
Daniel Gross 518524f458 default to whatever system has 2023-07-23 10:41:03 -07:00
Andrej Karpathy 9414e7a45e tweaks and add a simple test 2023-07-23 14:52:08 +00:00
Andrej Karpathy 5b161abb9a somewhere ~20 hours later 2023-07-23 05:23:45 +00:00