Andrej Karpathy
|
d1a59a9ca8
|
use EXIT_FAILURE instead of 1
|
2023-08-05 19:19:49 +00:00 |
|
Andrej
|
f3e7710763
|
Merge pull request #215 from Majdoddin/windows
replaced __int64 with int64_t and DWORD with uint_32
|
2023-08-05 12:14:39 -07:00 |
|
Andrej Karpathy
|
0447b06e7c
|
simplify rope
|
2023-08-05 18:41:31 +00:00 |
|
Andrej Karpathy
|
8719d5f5a8
|
Merge branch 'mpcusack-mpcusack/jitsave'
|
2023-08-05 18:13:07 +00:00 |
|
Andrej Karpathy
|
e03d7ecf12
|
Merge branch 'mpcusack/jitsave' of https://github.com/mpcusack/llama2.c into mpcusack-mpcusack/jitsave
|
2023-08-05 18:11:21 +00:00 |
|
Andrej Karpathy
|
0609eb6601
|
slightly tune todos
|
2023-08-05 17:13:35 +00:00 |
|
Andrej Karpathy
|
dcef5ff7c7
|
add a bit less embarassing argparse that uses keyword arguments instead of positional arguments
|
2023-08-05 17:08:11 +00:00 |
|
Andrej Karpathy
|
837796e0b7
|
get rid of unneeded comment now
|
2023-08-05 16:19:27 +00:00 |
|
Andrej
|
db4ad580f3
|
Merge pull request #225 from RahulSChand/rope_changes
Fixing max_seq_len passed to RoPE implementation. Minor comment changes
|
2023-08-05 09:18:47 -07:00 |
|
Andrej
|
9d001c6249
|
Merge pull request #223 from LexiestLeszek/master-1
Updated README.md with added steps for junior devs
|
2023-08-05 09:13:12 -07:00 |
|
Andrej
|
f93e7b5626
|
Merge pull request #228 from aiwizzard/master
Fixed typo in README.md
|
2023-08-05 09:09:19 -07:00 |
|
Andrej
|
2abd77a57f
|
Merge pull request #231 from madroidmaq/master
Update README.md: add a Kotlin port of this project
|
2023-08-05 09:08:59 -07:00 |
|
Andrej
|
ba036696b7
|
Merge branch 'master' into master
|
2023-08-05 09:08:51 -07:00 |
|
Andrej
|
4b1e5d57a1
|
Merge pull request #232 from clebert/zig-port
Add Zig port to README
|
2023-08-05 09:08:31 -07:00 |
|
Michael Cusack
|
13f342af9e
|
docs typo
|
2023-08-04 23:12:06 +07:00 |
|
Michael Cusack
|
f4c96b7339
|
Add options to save_torchscript
|
2023-08-04 23:11:33 +07:00 |
|
Michael Cusack
|
4b3a41b8fc
|
Add options to save_torchscript
|
2023-08-04 23:10:14 +07:00 |
|
Clemens Akens
|
a4e961f378
|
Add Zig port to README
|
2023-08-04 18:00:04 +02:00 |
|
Michael Cusack
|
113c675bc9
|
Rename save_model.py
|
2023-08-04 20:31:44 +07:00 |
|
madroid
|
ec65aac182
|
Update README.md: add a Kotlin port of this project
|
2023-08-04 18:50:06 +08:00 |
|
Michael Cusack
|
305d920862
|
Zero'ing params docs
|
2023-08-04 17:33:23 +07:00 |
|
Michael Cusack
|
dfff7812db
|
Zero'ing params docs
|
2023-08-04 17:31:31 +07:00 |
|
Michael Cusack
|
34f0402501
|
Zero'ing params docs
|
2023-08-04 17:31:11 +07:00 |
|
Michael Cusack
|
d4cdd6259e
|
Zero'ing params docs
|
2023-08-04 17:30:05 +07:00 |
|
Michael Cusack
|
9f8e0857ee
|
Typo
|
2023-08-04 17:22:27 +07:00 |
|
Michael Cusack
|
f8d45f180d
|
Reinline loss function
|
2023-08-04 17:21:29 +07:00 |
|
Michael Cusack
|
f67185958b
|
Model args in save script
|
2023-08-04 17:07:41 +07:00 |
|
Michael Cusack
|
fd5e2cc7bc
|
Updating training code for loss result
|
2023-08-04 17:03:11 +07:00 |
|
Michael Cusack
|
ac2b435151
|
docs
|
2023-08-04 16:55:26 +07:00 |
|
Michael Cusack
|
11a8348dfc
|
extra line
|
2023-08-04 16:52:04 +07:00 |
|
Michael Cusack
|
f2e34e6b0a
|
Resolve jit.save errors
|
2023-08-04 16:49:26 +07:00 |
|
Ajmal K
|
b9f303f3b8
|
Fixed typo in README.md
Fixed typo
|
2023-08-04 10:30:11 +05:30 |
|
rahulschand
|
02cf3c7311
|
Small changes to ROPE & comments
|
2023-08-03 20:13:50 +05:30 |
|
Leszek Mielnikow
|
99ea2f7070
|
Updated README.md with added steps for junior devs
|
2023-08-03 09:10:13 +02:00 |
|
Andrej
|
af8708d87b
|
Merge pull request #216 from gohai/patch-1
Add link to Emscripten port in README
|
2023-08-02 10:06:09 -07:00 |
|
Andrej
|
9819ae4b3d
|
Merge branch 'master' into patch-1
|
2023-08-02 10:06:01 -07:00 |
|
Andrej
|
5b47cd1cfb
|
Merge pull request #211 from wlamond/tinyshakespeare
Add tinyshakespeare dataset
|
2023-08-02 09:42:11 -07:00 |
|
Andrej
|
574be29e4f
|
Merge pull request #217 from mukel/llama2.java
Add Java port.
|
2023-08-02 09:40:02 -07:00 |
|
Alfonso² Peterssen
|
30974307af
|
Add Java port.
|
2023-08-02 15:01:28 +02:00 |
|
Gottfried Haider
|
8dd9baddaa
|
Update README.md
|
2023-08-02 18:09:06 +08:00 |
|
Gottfried Haider
|
b2b5514a92
|
Add link to Emscripten port in README
|
2023-08-02 17:58:15 +08:00 |
|
Ruhollah Majdoddin
|
f50804a5ae
|
replaced __int64 with int64_t and DWORD with uint_32
|
2023-08-02 10:18:30 +02:00 |
|
Will Lamond
|
e592ed5d64
|
Add tinyshakespeare dataset
|
2023-08-01 15:26:47 -07:00 |
|
Andrej
|
a8f3e1c499
|
Merge pull request #175 from tairov/ci-mingw
Add build step for win64 msys2/mingw
|
2023-08-01 09:02:35 -07:00 |
|
Andrej
|
b7f026ffc8
|
Merge pull request #179 from richinseattle/windows-ftell64-fix
use ssize_t/int64 and 64bit version of ftell on windows
|
2023-08-01 09:01:50 -07:00 |
|
Andrej
|
e2d4a386c7
|
Merge pull request #186 from vodkaslime/master
add vodkaslime llama.zig to readme
|
2023-08-01 09:00:37 -07:00 |
|
Andrej
|
e270c6eb3c
|
Update README.md: add mention of -f unroll loops option for gcc
|
2023-08-01 08:59:00 -07:00 |
|
Andrej
|
23f608350a
|
Merge branch 'master' into master
|
2023-08-01 08:50:52 -07:00 |
|
Andrej
|
def12a29c6
|
Merge pull request #173 from juvi21/patch-1
Added julia port to notable forks section in README.md
|
2023-08-01 08:50:14 -07:00 |
|
Andrej
|
221f4f929f
|
Merge branch 'master' into patch-1
|
2023-08-01 08:50:08 -07:00 |
|