Commit Graph

139 Commits

Author SHA1 Message Date
YiMing Han 8607b11ea1 working one 2023-08-18 15:07:41 -04:00
Andrej df6557a10d Merge pull request #267 from krrishnarraj/master
Update readme for openmp on mac
2023-08-15 19:26:34 -07:00
chenyang 2a9a4c4e14 update readme wiht a simple line to introduce llama2.c-zh 2023-08-14 15:12:30 +08:00
chenyang 79900ff68e update readme wiht a simple line to introduce llama2.c-zh 2023-08-14 15:00:33 +08:00
Krishnaraj Bhat eec9ad5a5b Merge remote-tracking branch 'upstream/master' 2023-08-14 12:02:40 +05:30
Andrej bae0bcf484 Small tweaks to Readme intro 2023-08-13 20:03:00 -07:00
Andrej Karpathy 854c97b660 turn topp 0.9 back on by default thanks to recent PR contributions truncating before quicksort 2023-08-14 00:12:45 +00:00
Andrej b51c63b9f2 Merge pull request #283 from wizzard0/wizzard0-mention-1
Add TypeScript port
2023-08-13 14:36:10 -07:00
Andrej Karpathy 8506036185 remove 'revive tests' as a todo from the readme 2023-08-13 21:23:27 +00:00
Andrej Karpathy f0024cfc88 revive tests. now that we have a tiny stories260K model this only requires a 2MB download. phew 2023-08-13 21:22:44 +00:00
Andrej 0805cb2c31 tiny whitespace fix to try to eliminate scrollbar 2023-08-13 13:40:09 -07:00
Andrej b2cce341e0 oops typo fix in readme 2023-08-13 13:39:12 -07:00
Andrej Karpathy 3e989e21f2 link to stories260K model 2023-08-13 20:38:05 +00:00
Andrej Karpathy 58075b5ac5 update API of sample.py to be better, small changes here 2023-08-13 20:31:32 +00:00
Andrej 1bcb2d18d6 Merge pull request #284 from karpathy/feature/customtokenizer
multiquery support add
2023-08-13 12:38:06 -07:00
Andrej Karpathy 38bfac90a8 bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery 2023-08-13 19:34:05 +00:00
Andrej b28c1e26c5 Merge pull request #275 from icppWorld/webassembly-internet-computer
Notable fork section for WebAssembly
2023-08-13 10:14:39 -07:00
Oleksandr Nikitin 0e6213c6e0 Mention I can run the full 7B model 2023-08-13 20:02:34 +03:00
Oleksandr Nikitin 1d68a36d14 Add TypeScript port
I've never been so happy to have missed that the JS port already exists :D also it was nice to discover that the JS can reach 80% of the single-threaded C speed (10 tokens/s for TinyStories-110M)
2023-08-13 19:10:07 +03:00
Tian Lin 27adb082f1 Update README.md 2023-08-13 21:58:14 +08:00
Andrej Karpathy 9ff459b925 todo changes 2023-08-13 03:24:31 +00:00
Andrej Karpathy 1d14cb8dd8 add note about 4096 vs 32000 token size on tinystories 2023-08-13 03:19:35 +00:00
Andrej Karpathy fe49eb222c readme for custom tokenizers 2023-08-13 03:16:18 +00:00
icpp f96c7afb2d Notable fork section for WebAssembly
Added my repo `icpp-lmm` for running it on the Internet Computer
2023-08-11 10:11:32 -04:00
Andrej Karpathy c42641205f turn off topp sampling by default because it is a bit too slow to be the default. it is likely that turning it on, e.g. -p 0.9 is midlly higher quality and safer samples, but this comes at a cost of too much performance in double digit percent sometimes, for it to be on by default i think... 2023-08-10 15:23:05 +00:00
Krishnaraj Bhat 46d7a6b6c6 Merge branch 'karpathy:master' into master 2023-08-10 11:06:19 +05:30
Krishnaraj Bhat d45a36cdd2 Update readme for openmp on mac 2023-08-10 10:59:39 +05:30
Andrej 5f8068fd43 Merge pull request #260 from madroidmaq/master
Add Jupyter notebook for easier feel the magic
2023-08-09 22:03:36 -07:00
Rahul TR 256e7f885b Added C# port information in readme 2023-08-09 17:59:47 +05:30
Andrej Karpathy e36e3fb50d Merge branch 'master' of github.com:karpathy/llama2.c 2023-08-09 02:08:37 +00:00
Andrej Karpathy 96873b0274 refine todos section make more concrete and sort 2023-08-09 02:08:33 +00:00
madroid 27c5fc76b1 Add Google Colab button 2023-08-08 01:50:19 +08:00
Andrej 3c3b19b14c Merge pull request #242 from tairov/llama2-py
Add a link to simple one file pure Python port
2023-08-06 19:51:30 -07:00
Aydyn Tairov 2297d158e3 Fix link to a github profile 2023-08-06 21:47:05 +01:00
Daniel Grittner 512f039d5d Merge branch 'master' into master 2023-08-06 19:55:43 +02:00
Aydyn Tairov 6734eaeff5 Rebase chanes to master 2023-08-06 18:47:05 +01:00
Aydyn Tairov 7178facb75 Rebase changes to master 2023-08-06 18:45:47 +01:00
Madroid Ma 1f53735d12 Merge branch 'karpathy:master' into master 2023-08-06 18:18:36 +08:00
madroid baefaaaf76 Update README.md: add notable forks author's link 2023-08-06 17:42:31 +08:00
Daniel Grittner fcb4cdef8b add a Rust port 2023-08-06 10:44:48 +02:00
Andrej Karpathy 8931d5092e add nucleus sampling. it costs lines of code, but i think thit is the default best way to sample, so it is important to have 2023-08-06 07:22:39 +00:00
madroid 8c1f1b280f Update README.md: format notable forks 2023-08-06 14:23:57 +08:00
Andrej Karpathy 0609eb6601 slightly tune todos 2023-08-05 17:13:35 +00:00
Andrej Karpathy dcef5ff7c7 add a bit less embarassing argparse that uses keyword arguments instead of positional arguments 2023-08-05 17:08:11 +00:00
Andrej 9d001c6249 Merge pull request #223 from LexiestLeszek/master-1
Updated README.md with added steps for junior devs
2023-08-05 09:13:12 -07:00
Andrej f93e7b5626 Merge pull request #228 from aiwizzard/master
Fixed typo in README.md
2023-08-05 09:09:19 -07:00
Andrej ba036696b7 Merge branch 'master' into master 2023-08-05 09:08:51 -07:00
Clemens Akens a4e961f378 Add Zig port to README 2023-08-04 18:00:04 +02:00
madroid ec65aac182 Update README.md: add a Kotlin port of this project 2023-08-04 18:50:06 +08:00
Ajmal K b9f303f3b8 Fixed typo in README.md
Fixed typo
2023-08-04 10:30:11 +05:30