Commit Graph

156 Commits

Author SHA1 Message Date
Jani Monoses 604d3c59c0 Add Code Llama info 2023-08-26 22:36:09 +03:00
Andrej 4a7a62bd21 Merge branch 'master' into feature/chat 2023-08-25 07:58:33 -07:00
Andrej Karpathy fbe324fc5a adjust things a bit 2023-08-25 14:54:05 +00:00
Diego Marcos Segura 19cfbeca71 Fix typo in README.md 2023-08-24 19:46:43 -07:00
Andrej d7cd98633d add todo item to add a PyTorch Engine 2023-08-24 09:04:52 -07:00
Jani Monoses fe9b9f2f15 Train vocab in Python 2023-08-23 19:10:28 +03:00
Andrej Karpathy d1eb18b8ec add BOS and EOS function to the Tokenizer as we start to converge closer to the Llama 2 code from Meta, and as we're about to add the Chat capability 2023-08-23 00:08:22 +00:00
Andrej Karpathy ac6cf8d6e8 tweak todo list 2023-08-22 02:48:51 +00:00
atamyrat 61c26d5392 Updated README to replace export_meta_llama_bin.py script with export.py 2023-08-21 14:24:01 +03:00
Andrej Karpathy ea44f53568 now that the export.py HF functionality is in master, we can delete this file, and update the readme 2023-08-21 04:58:19 +00:00
Harry Gifford a72b3b0206 Update readme with suggestion on number of threads to use
Update the documentation to make suggestions on the number of threads. The performance difference can be very large. Also linked to the PyTorch docs which are relevant here.
2023-08-20 15:01:33 -07:00
Andrej 8c93c7a30e Merge pull request #322 from karpathy/feature/export
New model export (the code remains "dead" and legacy version is still the default behavior, so no breaking changes are introduced). The major benefit is a new export.py file, which we can use to centralize work on formatting: both imports and exports.
2023-08-20 10:08:32 -07:00
Andrej Karpathy 13dcee493a todos update 2023-08-20 17:02:22 +00:00
Andrej 6c5d78fa41 Merge pull request #317 from yiminghan/yhan/old
Add a link to Dart port in README
2023-08-19 10:01:08 -07:00
rahoua 978c311b30 Add pecca-rs to README.md 2023-08-18 14:58:21 -07:00
YiMing Han 882e480bc0 update read me 2023-08-18 15:18:29 -04:00
YiMing Han d09ebbb32b Revert "working one"
This reverts commit 8607b11ea1.
2023-08-18 15:14:08 -04:00
YiMing Han 8607b11ea1 working one 2023-08-18 15:07:41 -04:00
Andrej df6557a10d Merge pull request #267 from krrishnarraj/master
Update readme for openmp on mac
2023-08-15 19:26:34 -07:00
chenyang 2a9a4c4e14 update readme wiht a simple line to introduce llama2.c-zh 2023-08-14 15:12:30 +08:00
chenyang 79900ff68e update readme wiht a simple line to introduce llama2.c-zh 2023-08-14 15:00:33 +08:00
Krishnaraj Bhat eec9ad5a5b Merge remote-tracking branch 'upstream/master' 2023-08-14 12:02:40 +05:30
Andrej bae0bcf484 Small tweaks to Readme intro 2023-08-13 20:03:00 -07:00
Andrej Karpathy 854c97b660 turn topp 0.9 back on by default thanks to recent PR contributions truncating before quicksort 2023-08-14 00:12:45 +00:00
Andrej b51c63b9f2 Merge pull request #283 from wizzard0/wizzard0-mention-1
Add TypeScript port
2023-08-13 14:36:10 -07:00
Andrej Karpathy 8506036185 remove 'revive tests' as a todo from the readme 2023-08-13 21:23:27 +00:00
Andrej Karpathy f0024cfc88 revive tests. now that we have a tiny stories260K model this only requires a 2MB download. phew 2023-08-13 21:22:44 +00:00
Andrej 0805cb2c31 tiny whitespace fix to try to eliminate scrollbar 2023-08-13 13:40:09 -07:00
Andrej b2cce341e0 oops typo fix in readme 2023-08-13 13:39:12 -07:00
Andrej Karpathy 3e989e21f2 link to stories260K model 2023-08-13 20:38:05 +00:00
Andrej Karpathy 58075b5ac5 update API of sample.py to be better, small changes here 2023-08-13 20:31:32 +00:00
Andrej 1bcb2d18d6 Merge pull request #284 from karpathy/feature/customtokenizer
multiquery support add
2023-08-13 12:38:06 -07:00
Andrej Karpathy 38bfac90a8 bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery 2023-08-13 19:34:05 +00:00
Andrej b28c1e26c5 Merge pull request #275 from icppWorld/webassembly-internet-computer
Notable fork section for WebAssembly
2023-08-13 10:14:39 -07:00
Oleksandr Nikitin 0e6213c6e0 Mention I can run the full 7B model 2023-08-13 20:02:34 +03:00
Oleksandr Nikitin 1d68a36d14 Add TypeScript port
I've never been so happy to have missed that the JS port already exists :D also it was nice to discover that the JS can reach 80% of the single-threaded C speed (10 tokens/s for TinyStories-110M)
2023-08-13 19:10:07 +03:00
Tian Lin 27adb082f1 Update README.md 2023-08-13 21:58:14 +08:00
Andrej Karpathy 9ff459b925 todo changes 2023-08-13 03:24:31 +00:00
Andrej Karpathy 1d14cb8dd8 add note about 4096 vs 32000 token size on tinystories 2023-08-13 03:19:35 +00:00
Andrej Karpathy fe49eb222c readme for custom tokenizers 2023-08-13 03:16:18 +00:00
icpp f96c7afb2d Notable fork section for WebAssembly
Added my repo `icpp-lmm` for running it on the Internet Computer
2023-08-11 10:11:32 -04:00
Andrej Karpathy c42641205f turn off topp sampling by default because it is a bit too slow to be the default. it is likely that turning it on, e.g. -p 0.9 is midlly higher quality and safer samples, but this comes at a cost of too much performance in double digit percent sometimes, for it to be on by default i think... 2023-08-10 15:23:05 +00:00
Krishnaraj Bhat 46d7a6b6c6 Merge branch 'karpathy:master' into master 2023-08-10 11:06:19 +05:30
Krishnaraj Bhat d45a36cdd2 Update readme for openmp on mac 2023-08-10 10:59:39 +05:30
Andrej 5f8068fd43 Merge pull request #260 from madroidmaq/master
Add Jupyter notebook for easier feel the magic
2023-08-09 22:03:36 -07:00
Rahul TR 256e7f885b Added C# port information in readme 2023-08-09 17:59:47 +05:30
Andrej Karpathy e36e3fb50d Merge branch 'master' of github.com:karpathy/llama2.c 2023-08-09 02:08:37 +00:00
Andrej Karpathy 96873b0274 refine todos section make more concrete and sort 2023-08-09 02:08:33 +00:00
madroid 27c5fc76b1 Add Google Colab button 2023-08-08 01:50:19 +08:00
Andrej 3c3b19b14c Merge pull request #242 from tairov/llama2-py
Add a link to simple one file pure Python port
2023-08-06 19:51:30 -07:00