Andrej
|
1bcb2d18d6
|
Merge pull request #284 from karpathy/feature/customtokenizer
multiquery support add
|
2023-08-13 12:38:06 -07:00 |
|
Andrej Karpathy
|
38bfac90a8
|
bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery
|
2023-08-13 19:34:05 +00:00 |
|
Andrej
|
b28c1e26c5
|
Merge pull request #275 from icppWorld/webassembly-internet-computer
Notable fork section for WebAssembly
|
2023-08-13 10:14:39 -07:00 |
|
Tian Lin
|
27adb082f1
|
Update README.md
|
2023-08-13 21:58:14 +08:00 |
|
Andrej Karpathy
|
9ff459b925
|
todo changes
|
2023-08-13 03:24:31 +00:00 |
|
Andrej Karpathy
|
1d14cb8dd8
|
add note about 4096 vs 32000 token size on tinystories
|
2023-08-13 03:19:35 +00:00 |
|
Andrej Karpathy
|
fe49eb222c
|
readme for custom tokenizers
|
2023-08-13 03:16:18 +00:00 |
|
icpp
|
f96c7afb2d
|
Notable fork section for WebAssembly
Added my repo `icpp-lmm` for running it on the Internet Computer
|
2023-08-11 10:11:32 -04:00 |
|
Andrej Karpathy
|
c42641205f
|
turn off topp sampling by default because it is a bit too slow to be the default. it is likely that turning it on, e.g. -p 0.9 is midlly higher quality and safer samples, but this comes at a cost of too much performance in double digit percent sometimes, for it to be on by default i think...
|
2023-08-10 15:23:05 +00:00 |
|
Andrej
|
5f8068fd43
|
Merge pull request #260 from madroidmaq/master
Add Jupyter notebook for easier feel the magic
|
2023-08-09 22:03:36 -07:00 |
|
Rahul TR
|
256e7f885b
|
Added C# port information in readme
|
2023-08-09 17:59:47 +05:30 |
|
Andrej Karpathy
|
e36e3fb50d
|
Merge branch 'master' of github.com:karpathy/llama2.c
|
2023-08-09 02:08:37 +00:00 |
|
Andrej Karpathy
|
96873b0274
|
refine todos section make more concrete and sort
|
2023-08-09 02:08:33 +00:00 |
|
madroid
|
27c5fc76b1
|
Add Google Colab button
|
2023-08-08 01:50:19 +08:00 |
|
Andrej
|
3c3b19b14c
|
Merge pull request #242 from tairov/llama2-py
Add a link to simple one file pure Python port
|
2023-08-06 19:51:30 -07:00 |
|
Aydyn Tairov
|
2297d158e3
|
Fix link to a github profile
|
2023-08-06 21:47:05 +01:00 |
|
Daniel Grittner
|
512f039d5d
|
Merge branch 'master' into master
|
2023-08-06 19:55:43 +02:00 |
|
Aydyn Tairov
|
6734eaeff5
|
Rebase chanes to master
|
2023-08-06 18:47:05 +01:00 |
|
Aydyn Tairov
|
7178facb75
|
Rebase changes to master
|
2023-08-06 18:45:47 +01:00 |
|
Madroid Ma
|
1f53735d12
|
Merge branch 'karpathy:master' into master
|
2023-08-06 18:18:36 +08:00 |
|
madroid
|
baefaaaf76
|
Update README.md: add notable forks author's link
|
2023-08-06 17:42:31 +08:00 |
|
Daniel Grittner
|
fcb4cdef8b
|
add a Rust port
|
2023-08-06 10:44:48 +02:00 |
|
Andrej Karpathy
|
8931d5092e
|
add nucleus sampling. it costs lines of code, but i think thit is the default best way to sample, so it is important to have
|
2023-08-06 07:22:39 +00:00 |
|
madroid
|
8c1f1b280f
|
Update README.md: format notable forks
|
2023-08-06 14:23:57 +08:00 |
|
Andrej Karpathy
|
0609eb6601
|
slightly tune todos
|
2023-08-05 17:13:35 +00:00 |
|
Andrej Karpathy
|
dcef5ff7c7
|
add a bit less embarassing argparse that uses keyword arguments instead of positional arguments
|
2023-08-05 17:08:11 +00:00 |
|
Andrej
|
9d001c6249
|
Merge pull request #223 from LexiestLeszek/master-1
Updated README.md with added steps for junior devs
|
2023-08-05 09:13:12 -07:00 |
|
Andrej
|
f93e7b5626
|
Merge pull request #228 from aiwizzard/master
Fixed typo in README.md
|
2023-08-05 09:09:19 -07:00 |
|
Andrej
|
ba036696b7
|
Merge branch 'master' into master
|
2023-08-05 09:08:51 -07:00 |
|
Clemens Akens
|
a4e961f378
|
Add Zig port to README
|
2023-08-04 18:00:04 +02:00 |
|
madroid
|
ec65aac182
|
Update README.md: add a Kotlin port of this project
|
2023-08-04 18:50:06 +08:00 |
|
Ajmal K
|
b9f303f3b8
|
Fixed typo in README.md
Fixed typo
|
2023-08-04 10:30:11 +05:30 |
|
Leszek Mielnikow
|
99ea2f7070
|
Updated README.md with added steps for junior devs
|
2023-08-03 09:10:13 +02:00 |
|
Andrej
|
9819ae4b3d
|
Merge branch 'master' into patch-1
|
2023-08-02 10:06:01 -07:00 |
|
Alfonso² Peterssen
|
30974307af
|
Add Java port.
|
2023-08-02 15:01:28 +02:00 |
|
Gottfried Haider
|
8dd9baddaa
|
Update README.md
|
2023-08-02 18:09:06 +08:00 |
|
Gottfried Haider
|
b2b5514a92
|
Add link to Emscripten port in README
|
2023-08-02 17:58:15 +08:00 |
|
Andrej
|
e2d4a386c7
|
Merge pull request #186 from vodkaslime/master
add vodkaslime llama.zig to readme
|
2023-08-01 09:00:37 -07:00 |
|
Andrej
|
e270c6eb3c
|
Update README.md: add mention of -f unroll loops option for gcc
|
2023-08-01 08:59:00 -07:00 |
|
Andrej
|
23f608350a
|
Merge branch 'master' into master
|
2023-08-01 08:50:52 -07:00 |
|
Andrej
|
221f4f929f
|
Merge branch 'master' into patch-1
|
2023-08-01 08:50:08 -07:00 |
|
Andrej
|
217667d085
|
Merge branch 'master' into notable-forks-patch
|
2023-08-01 08:49:08 -07:00 |
|
Andrej
|
9023840fe0
|
Merge branch 'master' into master
|
2023-08-01 08:34:23 -07:00 |
|
Andrej
|
f971b76008
|
Merge pull request #188 from leo-du/llama2.c
Add another Rust port in Readme
|
2023-08-01 08:21:06 -07:00 |
|
Andrej
|
9942a33ae2
|
Merge pull request #194 from celikin/patch-1
added JNI for android with compiler optimization
|
2023-08-01 08:20:50 -07:00 |
|
Andrej
|
13d22efe48
|
Merge branch 'master' into llama2.c
|
2023-08-01 08:19:31 -07:00 |
|
juvi
|
338f606c59
|
Merge branch 'master' into patch-1
|
2023-08-01 12:36:19 +02:00 |
|
Johannes Rudolph
|
4c0a88249d
|
add link to scala port
|
2023-07-31 14:59:11 +02:00 |
|
Vulcan
|
d0702edf1f
|
README.md - Update notable forks section
|
2023-07-31 18:12:56 +05:30 |
|
Leo Du
|
3b446baeb3
|
update readme
|
2023-07-31 03:34:34 -04:00 |
|