Commit Graph

302 Commits

Author SHA1 Message Date
Andrej 1bcb2d18d6 Merge pull request #284 from karpathy/feature/customtokenizer
multiquery support add
2023-08-13 12:38:06 -07:00
Andrej Karpathy 38bfac90a8 bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery 2023-08-13 19:34:05 +00:00
Andrej b28c1e26c5 Merge pull request #275 from icppWorld/webassembly-internet-computer
Notable fork section for WebAssembly
2023-08-13 10:14:39 -07:00
Andrej 5295cbb821 Merge pull request #281 from lintian06/original_llama2
Update README.md for a new rust port.
2023-08-13 10:14:00 -07:00
Andrej 12dec61fbf Merge pull request #282 from mihainadas/master-1
Fixes https://github.com/karpathy/llama2.c/issues/280
2023-08-13 10:13:08 -07:00
Mihai Nadăș 570789aa04 Fixes https://github.com/karpathy/llama2.c/issues/280
There was a small bug in tinystories.py, described here: https://github.com/karpathy/llama2.c/issues/280

This commit simply passes vocab_size to get_tokenizer_model_path to avoid silent crash when processing shards (in process_shard)
2023-08-13 17:49:10 +03:00
Tian Lin 27adb082f1 Update README.md 2023-08-13 21:58:14 +08:00
Andrej 8b472ded1f Merge pull request #272 from karpathy/feature/customtokenizer
Big Change: Custom Tokenizer training: add the ability to train custom tokenizers instead of using the pretrained Llama 2 tokenizer. This is useful in custom, narrow-domain LLMs because smaller vocab sizes make much smaller, faster, and potentially more capable models. For example, in tinystories a vocab size 4096 custom tokenizer compresses the input text sequences about as well as the Llama 2 tokenizer with vocab size 32000. The result is also "safer" because a badly trained model can't accidentally e.g. output some random chinese character and rapidly go "off the rails" in subsequent tokens.
2023-08-12 20:31:21 -07:00
Andrej Karpathy 9ff459b925 todo changes 2023-08-13 03:24:31 +00:00
Andrej Karpathy 1d14cb8dd8 add note about 4096 vs 32000 token size on tinystories 2023-08-13 03:19:35 +00:00
Andrej Karpathy fe49eb222c readme for custom tokenizers 2023-08-13 03:16:18 +00:00
Andrej Karpathy 9c3cfb46a3 make default be the llama2 tokenizer 2023-08-13 03:08:07 +00:00
Andrej Karpathy 00a61dc7f9 remove the tinyshakespeare dataset until i can bring it back later in a nicer form, otherwise right now we just have a ton of copy paste code here 2023-08-13 02:18:30 +00:00
Andrej Karpathy f5fc0c245f final piece: run.c support for new tokenizer, super ez 2023-08-13 02:12:13 +00:00
Andrej Karpathy ea4cedc588 add ability to export custom tokenizer to .bin format for run.c file 2023-08-13 02:00:19 +00:00
Andrej Karpathy b0cfa2458d ok i can train and sample a model with a custom tokenizer 2023-08-11 16:47:29 +00:00
icpp f96c7afb2d Notable fork section for WebAssembly
Added my repo `icpp-lmm` for running it on the Internet Computer
2023-08-11 10:11:32 -04:00
Andrej Karpathy 4c6f0af9ff add the ability to train a custom sentencepiece tokenizer with a given vocab_size, and pretok with it. some more changes still needed to merge this branch, in train.py and ofc run.c. did this in a sadly bit ugly, but fully backwards compatible way. basically when we use custom tokenizer we create a whole new directory structure for that 2023-08-11 03:58:22 +00:00
Andrej Karpathy c42641205f turn off topp sampling by default because it is a bit too slow to be the default. it is likely that turning it on, e.g. -p 0.9 is midlly higher quality and safer samples, but this comes at a cost of too much performance in double digit percent sometimes, for it to be on by default i think... 2023-08-10 15:23:05 +00:00
Andrej Karpathy 3f69c6cdc4 change the default to use runfast, which imo works just fine 2023-08-10 05:06:49 +00:00
Andrej 5f8068fd43 Merge pull request #260 from madroidmaq/master
Add Jupyter notebook for easier feel the magic
2023-08-09 22:03:36 -07:00
Andrej f60285ee78 Merge pull request #264 from trrahul/master
Added C# port information in readme
2023-08-09 22:00:23 -07:00
Andrej 04121d1b85 Merge pull request #256 from rdentato/patch-rng-seed
Patch rng seed
2023-08-09 21:56:07 -07:00
Rahul TR 256e7f885b Added C# port information in readme 2023-08-09 17:59:47 +05:30
Andrej Karpathy e36e3fb50d Merge branch 'master' of github.com:karpathy/llama2.c 2023-08-09 02:08:37 +00:00
Andrej Karpathy 96873b0274 refine todos section make more concrete and sort 2023-08-09 02:08:33 +00:00
madroid 9713609023 Add Colab GUI: select model/temperature/prompt/etc 2023-08-08 20:29:53 +08:00
madroid 27c5fc76b1 Add Google Colab button 2023-08-08 01:50:19 +08:00
madroid 57ca3c0401 Add run.ipynb for easier feel the magic 2023-08-08 01:32:51 +08:00
rdentato ff6a2f0a7a Reset the #include <omp.h> 2023-08-07 07:28:03 +00:00
rdentato e49c16caa5 Changed how rng_seed is handled. Now 0 is treated as time(NULL). 2023-08-07 06:51:57 +00:00
Remo Dentato 2e5fad83da Merge branch 'karpathy:master' into master 2023-08-07 07:57:42 +02:00
Andrej 3c3b19b14c Merge pull request #242 from tairov/llama2-py
Add a link to simple one file pure Python port
2023-08-06 19:51:30 -07:00
Andrej f4f4cae4cb Merge pull request #241 from danielgrittner/master
add a Rust port
2023-08-06 19:51:13 -07:00
Andrej 09de2cc4ca Merge pull request #250 from npinto/master-1
FIX: model.generate(); forward() only returns logits now.
2023-08-06 18:43:01 -07:00
Nicolas Pinto 98b515e44d FIX: model.generate()
This patch fixes a simple bug in `generate()` due to model's `forward()` only returning logits and not losses since `f2e34e6b0ac55accd6ba930a04c6f683f5158b29`.
2023-08-06 14:48:47 -07:00
rdentato 999b1bf776 Added conditinal include of the OpenMP header. 2023-08-06 21:07:09 +00:00
Aydyn Tairov 2297d158e3 Fix link to a github profile 2023-08-06 21:47:05 +01:00
Daniel Grittner 512f039d5d Merge branch 'master' into master 2023-08-06 19:55:43 +02:00
Aydyn Tairov 6734eaeff5 Rebase chanes to master 2023-08-06 18:47:05 +01:00
Aydyn Tairov 7178facb75 Rebase changes to master 2023-08-06 18:45:47 +01:00
Andrej Karpathy a7a3aa09b8 Merge branch 'master' of github.com:karpathy/llama2.c 2023-08-06 16:33:36 +00:00
Andrej Karpathy 79791f39b4 let's start respecting the BOS token. Don't print it explicitly, and terminate sequence if it appears. This makes sense especially after the recent addition of prompting. Also be careful with timings and making sure they come out right if we exit early in this data-dependent manner 2023-08-06 16:33:23 +00:00
Andrej Karpathy 4e8a3e8d5d fix style issue space with stderr printing 2023-08-06 15:51:58 +00:00
Andrej 7af81ded7e Merge pull request #244 from madroidmaq/master
Update README.md: format notable forks
2023-08-06 08:43:24 -07:00
Andrej a25958fd45 Merge pull request #245 from rdentato/patch-stderr
Errors and info on stderr
2023-08-06 08:42:09 -07:00
Madroid Ma 1f53735d12 Merge branch 'karpathy:master' into master 2023-08-06 18:18:36 +08:00
rdentato 9cfb7efb85 Changed all the printf() for error/info messages so that they print on stderr. 2023-08-06 09:53:02 +00:00
madroid baefaaaf76 Update README.md: add notable forks author's link 2023-08-06 17:42:31 +08:00
Daniel Grittner fcb4cdef8b add a Rust port 2023-08-06 10:44:48 +02:00