Jani Monoses
604d3c59c0
Add Code Llama info
2023-08-26 22:36:09 +03:00
Andrej
4a7a62bd21
Merge branch 'master' into feature/chat
2023-08-25 07:58:33 -07:00
Andrej Karpathy
fbe324fc5a
adjust things a bit
2023-08-25 14:54:05 +00:00
Diego Marcos Segura
19cfbeca71
Fix typo in README.md
2023-08-24 19:46:43 -07:00
Andrej
d7cd98633d
add todo item to add a PyTorch Engine
2023-08-24 09:04:52 -07:00
Jani Monoses
fe9b9f2f15
Train vocab in Python
2023-08-23 19:10:28 +03:00
Andrej Karpathy
d1eb18b8ec
add BOS and EOS function to the Tokenizer as we start to converge closer to the Llama 2 code from Meta, and as we're about to add the Chat capability
2023-08-23 00:08:22 +00:00
Andrej Karpathy
ac6cf8d6e8
tweak todo list
2023-08-22 02:48:51 +00:00
atamyrat
61c26d5392
Updated README to replace export_meta_llama_bin.py script with export.py
2023-08-21 14:24:01 +03:00
Andrej Karpathy
ea44f53568
now that the export.py HF functionality is in master, we can delete this file, and update the readme
2023-08-21 04:58:19 +00:00
Harry Gifford
a72b3b0206
Update readme with suggestion on number of threads to use
...
Update the documentation to make suggestions on the number of threads. The performance difference can be very large. Also linked to the PyTorch docs which are relevant here.
2023-08-20 15:01:33 -07:00
Andrej
8c93c7a30e
Merge pull request #322 from karpathy/feature/export
...
New model export (the code remains "dead" and legacy version is still the default behavior, so no breaking changes are introduced). The major benefit is a new export.py file, which we can use to centralize work on formatting: both imports and exports.
2023-08-20 10:08:32 -07:00
Andrej Karpathy
13dcee493a
todos update
2023-08-20 17:02:22 +00:00
Andrej
6c5d78fa41
Merge pull request #317 from yiminghan/yhan/old
...
Add a link to Dart port in README
2023-08-19 10:01:08 -07:00
rahoua
978c311b30
Add pecca-rs to README.md
2023-08-18 14:58:21 -07:00
YiMing Han
882e480bc0
update read me
2023-08-18 15:18:29 -04:00
YiMing Han
d09ebbb32b
Revert "working one"
...
This reverts commit 8607b11ea1 .
2023-08-18 15:14:08 -04:00
YiMing Han
8607b11ea1
working one
2023-08-18 15:07:41 -04:00
Andrej
df6557a10d
Merge pull request #267 from krrishnarraj/master
...
Update readme for openmp on mac
2023-08-15 19:26:34 -07:00
chenyang
2a9a4c4e14
update readme wiht a simple line to introduce llama2.c-zh
2023-08-14 15:12:30 +08:00
chenyang
79900ff68e
update readme wiht a simple line to introduce llama2.c-zh
2023-08-14 15:00:33 +08:00
Krishnaraj Bhat
eec9ad5a5b
Merge remote-tracking branch 'upstream/master'
2023-08-14 12:02:40 +05:30
Andrej
bae0bcf484
Small tweaks to Readme intro
2023-08-13 20:03:00 -07:00
Andrej Karpathy
854c97b660
turn topp 0.9 back on by default thanks to recent PR contributions truncating before quicksort
2023-08-14 00:12:45 +00:00
Andrej
b51c63b9f2
Merge pull request #283 from wizzard0/wizzard0-mention-1
...
Add TypeScript port
2023-08-13 14:36:10 -07:00
Andrej Karpathy
8506036185
remove 'revive tests' as a todo from the readme
2023-08-13 21:23:27 +00:00
Andrej Karpathy
f0024cfc88
revive tests. now that we have a tiny stories260K model this only requires a 2MB download. phew
2023-08-13 21:22:44 +00:00
Andrej
0805cb2c31
tiny whitespace fix to try to eliminate scrollbar
2023-08-13 13:40:09 -07:00
Andrej
b2cce341e0
oops typo fix in readme
2023-08-13 13:39:12 -07:00
Andrej Karpathy
3e989e21f2
link to stories260K model
2023-08-13 20:38:05 +00:00
Andrej Karpathy
58075b5ac5
update API of sample.py to be better, small changes here
2023-08-13 20:31:32 +00:00
Andrej
1bcb2d18d6
Merge pull request #284 from karpathy/feature/customtokenizer
...
multiquery support add
2023-08-13 12:38:06 -07:00
Andrej Karpathy
38bfac90a8
bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery
2023-08-13 19:34:05 +00:00
Andrej
b28c1e26c5
Merge pull request #275 from icppWorld/webassembly-internet-computer
...
Notable fork section for WebAssembly
2023-08-13 10:14:39 -07:00
Oleksandr Nikitin
0e6213c6e0
Mention I can run the full 7B model
2023-08-13 20:02:34 +03:00
Oleksandr Nikitin
1d68a36d14
Add TypeScript port
...
I've never been so happy to have missed that the JS port already exists :D also it was nice to discover that the JS can reach 80% of the single-threaded C speed (10 tokens/s for TinyStories-110M)
2023-08-13 19:10:07 +03:00
Tian Lin
27adb082f1
Update README.md
2023-08-13 21:58:14 +08:00
Andrej Karpathy
9ff459b925
todo changes
2023-08-13 03:24:31 +00:00
Andrej Karpathy
1d14cb8dd8
add note about 4096 vs 32000 token size on tinystories
2023-08-13 03:19:35 +00:00
Andrej Karpathy
fe49eb222c
readme for custom tokenizers
2023-08-13 03:16:18 +00:00
icpp
f96c7afb2d
Notable fork section for WebAssembly
...
Added my repo `icpp-lmm` for running it on the Internet Computer
2023-08-11 10:11:32 -04:00
Andrej Karpathy
c42641205f
turn off topp sampling by default because it is a bit too slow to be the default. it is likely that turning it on, e.g. -p 0.9 is midlly higher quality and safer samples, but this comes at a cost of too much performance in double digit percent sometimes, for it to be on by default i think...
2023-08-10 15:23:05 +00:00
Krishnaraj Bhat
46d7a6b6c6
Merge branch 'karpathy:master' into master
2023-08-10 11:06:19 +05:30
Krishnaraj Bhat
d45a36cdd2
Update readme for openmp on mac
2023-08-10 10:59:39 +05:30
Andrej
5f8068fd43
Merge pull request #260 from madroidmaq/master
...
Add Jupyter notebook for easier feel the magic
2023-08-09 22:03:36 -07:00
Rahul TR
256e7f885b
Added C# port information in readme
2023-08-09 17:59:47 +05:30
Andrej Karpathy
e36e3fb50d
Merge branch 'master' of github.com:karpathy/llama2.c
2023-08-09 02:08:37 +00:00
Andrej Karpathy
96873b0274
refine todos section make more concrete and sort
2023-08-09 02:08:33 +00:00
madroid
27c5fc76b1
Add Google Colab button
2023-08-08 01:50:19 +08:00
Andrej
3c3b19b14c
Merge pull request #242 from tairov/llama2-py
...
Add a link to simple one file pure Python port
2023-08-06 19:51:30 -07:00