YiMing Han
8607b11ea1
working one
2023-08-18 15:07:41 -04:00
Andrej
df6557a10d
Merge pull request #267 from krrishnarraj/master
...
Update readme for openmp on mac
2023-08-15 19:26:34 -07:00
chenyang
2a9a4c4e14
update readme wiht a simple line to introduce llama2.c-zh
2023-08-14 15:12:30 +08:00
chenyang
79900ff68e
update readme wiht a simple line to introduce llama2.c-zh
2023-08-14 15:00:33 +08:00
Krishnaraj Bhat
eec9ad5a5b
Merge remote-tracking branch 'upstream/master'
2023-08-14 12:02:40 +05:30
Andrej
bae0bcf484
Small tweaks to Readme intro
2023-08-13 20:03:00 -07:00
Andrej Karpathy
854c97b660
turn topp 0.9 back on by default thanks to recent PR contributions truncating before quicksort
2023-08-14 00:12:45 +00:00
Andrej
b51c63b9f2
Merge pull request #283 from wizzard0/wizzard0-mention-1
...
Add TypeScript port
2023-08-13 14:36:10 -07:00
Andrej Karpathy
8506036185
remove 'revive tests' as a todo from the readme
2023-08-13 21:23:27 +00:00
Andrej Karpathy
f0024cfc88
revive tests. now that we have a tiny stories260K model this only requires a 2MB download. phew
2023-08-13 21:22:44 +00:00
Andrej
0805cb2c31
tiny whitespace fix to try to eliminate scrollbar
2023-08-13 13:40:09 -07:00
Andrej
b2cce341e0
oops typo fix in readme
2023-08-13 13:39:12 -07:00
Andrej Karpathy
3e989e21f2
link to stories260K model
2023-08-13 20:38:05 +00:00
Andrej Karpathy
58075b5ac5
update API of sample.py to be better, small changes here
2023-08-13 20:31:32 +00:00
Andrej
1bcb2d18d6
Merge pull request #284 from karpathy/feature/customtokenizer
...
multiquery support add
2023-08-13 12:38:06 -07:00
Andrej Karpathy
38bfac90a8
bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery
2023-08-13 19:34:05 +00:00
Andrej
b28c1e26c5
Merge pull request #275 from icppWorld/webassembly-internet-computer
...
Notable fork section for WebAssembly
2023-08-13 10:14:39 -07:00
Oleksandr Nikitin
0e6213c6e0
Mention I can run the full 7B model
2023-08-13 20:02:34 +03:00
Oleksandr Nikitin
1d68a36d14
Add TypeScript port
...
I've never been so happy to have missed that the JS port already exists :D also it was nice to discover that the JS can reach 80% of the single-threaded C speed (10 tokens/s for TinyStories-110M)
2023-08-13 19:10:07 +03:00
Tian Lin
27adb082f1
Update README.md
2023-08-13 21:58:14 +08:00
Andrej Karpathy
9ff459b925
todo changes
2023-08-13 03:24:31 +00:00
Andrej Karpathy
1d14cb8dd8
add note about 4096 vs 32000 token size on tinystories
2023-08-13 03:19:35 +00:00
Andrej Karpathy
fe49eb222c
readme for custom tokenizers
2023-08-13 03:16:18 +00:00
icpp
f96c7afb2d
Notable fork section for WebAssembly
...
Added my repo `icpp-lmm` for running it on the Internet Computer
2023-08-11 10:11:32 -04:00
Andrej Karpathy
c42641205f
turn off topp sampling by default because it is a bit too slow to be the default. it is likely that turning it on, e.g. -p 0.9 is midlly higher quality and safer samples, but this comes at a cost of too much performance in double digit percent sometimes, for it to be on by default i think...
2023-08-10 15:23:05 +00:00
Krishnaraj Bhat
46d7a6b6c6
Merge branch 'karpathy:master' into master
2023-08-10 11:06:19 +05:30
Krishnaraj Bhat
d45a36cdd2
Update readme for openmp on mac
2023-08-10 10:59:39 +05:30
Andrej
5f8068fd43
Merge pull request #260 from madroidmaq/master
...
Add Jupyter notebook for easier feel the magic
2023-08-09 22:03:36 -07:00
Rahul TR
256e7f885b
Added C# port information in readme
2023-08-09 17:59:47 +05:30
Andrej Karpathy
e36e3fb50d
Merge branch 'master' of github.com:karpathy/llama2.c
2023-08-09 02:08:37 +00:00
Andrej Karpathy
96873b0274
refine todos section make more concrete and sort
2023-08-09 02:08:33 +00:00
madroid
27c5fc76b1
Add Google Colab button
2023-08-08 01:50:19 +08:00
Andrej
3c3b19b14c
Merge pull request #242 from tairov/llama2-py
...
Add a link to simple one file pure Python port
2023-08-06 19:51:30 -07:00
Aydyn Tairov
2297d158e3
Fix link to a github profile
2023-08-06 21:47:05 +01:00
Daniel Grittner
512f039d5d
Merge branch 'master' into master
2023-08-06 19:55:43 +02:00
Aydyn Tairov
6734eaeff5
Rebase chanes to master
2023-08-06 18:47:05 +01:00
Aydyn Tairov
7178facb75
Rebase changes to master
2023-08-06 18:45:47 +01:00
Madroid Ma
1f53735d12
Merge branch 'karpathy:master' into master
2023-08-06 18:18:36 +08:00
madroid
baefaaaf76
Update README.md: add notable forks author's link
2023-08-06 17:42:31 +08:00
Daniel Grittner
fcb4cdef8b
add a Rust port
2023-08-06 10:44:48 +02:00
Andrej Karpathy
8931d5092e
add nucleus sampling. it costs lines of code, but i think thit is the default best way to sample, so it is important to have
2023-08-06 07:22:39 +00:00
madroid
8c1f1b280f
Update README.md: format notable forks
2023-08-06 14:23:57 +08:00
Andrej Karpathy
0609eb6601
slightly tune todos
2023-08-05 17:13:35 +00:00
Andrej Karpathy
dcef5ff7c7
add a bit less embarassing argparse that uses keyword arguments instead of positional arguments
2023-08-05 17:08:11 +00:00
Andrej
9d001c6249
Merge pull request #223 from LexiestLeszek/master-1
...
Updated README.md with added steps for junior devs
2023-08-05 09:13:12 -07:00
Andrej
f93e7b5626
Merge pull request #228 from aiwizzard/master
...
Fixed typo in README.md
2023-08-05 09:09:19 -07:00
Andrej
ba036696b7
Merge branch 'master' into master
2023-08-05 09:08:51 -07:00
Clemens Akens
a4e961f378
Add Zig port to README
2023-08-04 18:00:04 +02:00
madroid
ec65aac182
Update README.md: add a Kotlin port of this project
2023-08-04 18:50:06 +08:00
Ajmal K
b9f303f3b8
Fixed typo in README.md
...
Fixed typo
2023-08-04 10:30:11 +05:30