YiMing Han
|
8607b11ea1
|
working one
|
2023-08-18 15:07:41 -04:00 |
|
Andrej Karpathy
|
bd182289c5
|
calculate the freq_cis online, no need to write/read them to/from checkpoints
|
2023-08-17 04:13:13 +00:00 |
|
Andrej
|
b68a6d2ab5
|
Merge pull request #307 from madroidmaq/master
Jupter Notebook: Add run Meta's Llama 2 models
|
2023-08-16 20:09:32 -07:00 |
|
Andrej
|
57bf0e9ee4
|
Merge pull request #306 from rdentato/patch-utf8-no-validation
minimal protection against invalid UTF8 encoding.
|
2023-08-16 09:51:11 -07:00 |
|
madroid
|
9fbe96fc2e
|
Jupter Notebook: Add run Meta's Llama 2 models
|
2023-08-16 20:27:28 +08:00 |
|
rdentato
|
55e60740f5
|
Added space to str_buffer in case max_token_length is 1.
|
2023-08-16 07:58:07 +00:00 |
|
rdentato
|
befe4867b3
|
minimal protection against invalid UTF8 encoding.
|
2023-08-16 07:42:53 +00:00 |
|
Andrej
|
df6557a10d
|
Merge pull request #267 from krrishnarraj/master
Update readme for openmp on mac
|
2023-08-15 19:26:34 -07:00 |
|
Andrej Karpathy
|
65c899314c
|
Merge branch 'Majdoddin-ci-tiny-model'
|
2023-08-16 02:22:26 +00:00 |
|
Andrej Karpathy
|
62a6d69d86
|
style changes and remove spurious runc test call at the bottom
|
2023-08-16 02:22:13 +00:00 |
|
Andrej Karpathy
|
d47fc41b6a
|
Merge branch 'ci-tiny-model' of https://github.com/Majdoddin/llama2.c into Majdoddin-ci-tiny-model
|
2023-08-16 02:20:34 +00:00 |
|
Andrej Karpathy
|
ca67253f28
|
smallfix: not sure what the point of this indirection was
|
2023-08-15 16:09:33 +00:00 |
|
Andrej Karpathy
|
4c63c5608d
|
shorten top comment on run.c file
|
2023-08-15 16:07:48 +00:00 |
|
Andrej Karpathy
|
a47f9b3969
|
collapsing copy paste code because it's driving my ocd crazy
|
2023-08-15 16:03:11 +00:00 |
|
Ruhollah Majdoddin
|
87b11edf27
|
modifiying test_all so it can safely run on windows
|
2023-08-15 16:01:53 +00:00 |
|
Ruhollah Majdoddin
|
66c9f5e6c8
|
Adding pytest with the tiny model to macOS and windows (except amd64_arm64) runners
|
2023-08-15 15:58:04 +00:00 |
|
Andrej Karpathy
|
88eb238255
|
add tests into Makefile convenience
|
2023-08-15 15:57:27 +00:00 |
|
Andrej
|
600cedb33d
|
Merge pull request #297 from karpathy/feature/utf8
Add UTF-8 support to prompts
|
2023-08-14 19:54:49 -07:00 |
|
Andrej Karpathy
|
fe2de68688
|
fix sample.py from tokenizer changes before
|
2023-08-15 02:33:01 +00:00 |
|
Andrej Karpathy
|
a9a0628c92
|
thoroughly commented the UTF-8 byte reading code
|
2023-08-15 02:18:49 +00:00 |
|
Andrej Karpathy
|
d459fd4243
|
add back careful processing of the byte tokens
|
2023-08-15 01:42:33 +00:00 |
|
Andrej Karpathy
|
4bf36ecc17
|
get rid of the special byte decoding logic
|
2023-08-15 01:04:10 +00:00 |
|
Andrej Karpathy
|
8417cb438d
|
Merge branch 'utf8' of https://github.com/atamurad/llama2.c into feature/utf8
|
2023-08-15 00:18:53 +00:00 |
|
Andrej Karpathy
|
94a3a5e0a5
|
Merge branch 'master' of github.com:karpathy/llama2.c
|
2023-08-14 14:52:15 +00:00 |
|
Andrej Karpathy
|
32c1ff97fb
|
missed p->dim to kv_dim for k,v vectors. we're not doing anything wrong we're just being wasteful with memory. thanks @xefoci7612 for pointing out
|
2023-08-14 14:52:07 +00:00 |
|
Andrej
|
013e012b87
|
Merge pull request #286 from Nick-infinity/master
[Feat]: Add support for meta llama hf model conversion
|
2023-08-14 07:46:39 -07:00 |
|
Andrej
|
50f970d170
|
Merge pull request #289 from chenyangMl/update_readme
Update readme to introduce llama2.c-zh
|
2023-08-14 07:41:13 -07:00 |
|
chenyang
|
2a9a4c4e14
|
update readme wiht a simple line to introduce llama2.c-zh
|
2023-08-14 15:12:30 +08:00 |
|
chenyang
|
79900ff68e
|
update readme wiht a simple line to introduce llama2.c-zh
|
2023-08-14 15:00:33 +08:00 |
|
Krishnaraj Bhat
|
eec9ad5a5b
|
Merge remote-tracking branch 'upstream/master'
|
2023-08-14 12:02:40 +05:30 |
|
Andrej Karpathy
|
82ad2ba34e
|
remove tiktoken as dependency
|
2023-08-14 05:53:57 +00:00 |
|
Nikhil Gupta
|
c39f19f1a9
|
[Feat]: Add support for meta llama hf model conversion
Description:
Llama 2 hf models have weights stored with diff name
Signed-off-by: Nikhil Gupta <nikhilg.me@gmail.com>
|
2023-08-14 10:18:51 +05:30 |
|
Andrej
|
bae0bcf484
|
Small tweaks to Readme intro
|
2023-08-13 20:03:00 -07:00 |
|
Andrej Karpathy
|
45afa91dca
|
the accum function has been bothering me, there is no real need to add a function here, it does something trivial and is only used twice, scrap
|
2023-08-14 02:54:27 +00:00 |
|
Andrej Karpathy
|
854c97b660
|
turn topp 0.9 back on by default thanks to recent PR contributions truncating before quicksort
|
2023-08-14 00:12:45 +00:00 |
|
Andrej
|
4a2c375df9
|
Merge pull request #276 from jrudolph/improve-top-p
optimize sample_topp by filtering out small value elements up front
|
2023-08-13 17:05:38 -07:00 |
|
Andrej
|
b3d6a9e6b5
|
Merge pull request #285 from karpathy/feature/civ2
Upgrading CI to run our new pytest
|
2023-08-13 16:55:01 -07:00 |
|
Andrej
|
091c799653
|
Merge branch 'master' into feature/civ2
|
2023-08-13 16:54:24 -07:00 |
|
Andrej Karpathy
|
c970f69334
|
oops i should probably call this function lol
|
2023-08-13 23:48:01 +00:00 |
|
Andrej Karpathy
|
223a67048a
|
add optional manual dispatch of actions
|
2023-08-13 23:39:37 +00:00 |
|
Andrej Karpathy
|
86325bf7e8
|
attempt to upgrade the CI to run our pytest
|
2023-08-13 23:35:29 +00:00 |
|
Andrej
|
b51c63b9f2
|
Merge pull request #283 from wizzard0/wizzard0-mention-1
Add TypeScript port
|
2023-08-13 14:36:10 -07:00 |
|
Andrej Karpathy
|
8506036185
|
remove 'revive tests' as a todo from the readme
|
2023-08-13 21:23:27 +00:00 |
|
Andrej Karpathy
|
f0024cfc88
|
revive tests. now that we have a tiny stories260K model this only requires a 2MB download. phew
|
2023-08-13 21:22:44 +00:00 |
|
Andrej
|
0805cb2c31
|
tiny whitespace fix to try to eliminate scrollbar
|
2023-08-13 13:40:09 -07:00 |
|
Andrej
|
b2cce341e0
|
oops typo fix in readme
|
2023-08-13 13:39:12 -07:00 |
|
Andrej Karpathy
|
3e989e21f2
|
link to stories260K model
|
2023-08-13 20:38:05 +00:00 |
|
Andrej Karpathy
|
58075b5ac5
|
update API of sample.py to be better, small changes here
|
2023-08-13 20:31:32 +00:00 |
|
atamyrat
|
36b54321e5
|
bugfix: allocate +1 in tokens buffer for dummy whitespace
|
2023-08-13 23:23:32 +03:00 |
|
Andrej
|
1bcb2d18d6
|
Merge pull request #284 from karpathy/feature/customtokenizer
multiquery support add
|
2023-08-13 12:38:06 -07:00 |
|