Commit Graph

242 Commits

Author SHA1 Message Date
Andrej Karpathy d1a59a9ca8 use EXIT_FAILURE instead of 1 2023-08-05 19:19:49 +00:00
Andrej f3e7710763 Merge pull request #215 from Majdoddin/windows
replaced __int64 with int64_t and DWORD with uint_32
2023-08-05 12:14:39 -07:00
Andrej Karpathy 0447b06e7c simplify rope 2023-08-05 18:41:31 +00:00
Andrej Karpathy 8719d5f5a8 Merge branch 'mpcusack-mpcusack/jitsave' 2023-08-05 18:13:07 +00:00
Andrej Karpathy e03d7ecf12 Merge branch 'mpcusack/jitsave' of https://github.com/mpcusack/llama2.c into mpcusack-mpcusack/jitsave 2023-08-05 18:11:21 +00:00
Andrej Karpathy 0609eb6601 slightly tune todos 2023-08-05 17:13:35 +00:00
Andrej Karpathy dcef5ff7c7 add a bit less embarassing argparse that uses keyword arguments instead of positional arguments 2023-08-05 17:08:11 +00:00
Andrej Karpathy 837796e0b7 get rid of unneeded comment now 2023-08-05 16:19:27 +00:00
Andrej db4ad580f3 Merge pull request #225 from RahulSChand/rope_changes
Fixing max_seq_len passed to RoPE implementation. Minor comment changes
2023-08-05 09:18:47 -07:00
Andrej 9d001c6249 Merge pull request #223 from LexiestLeszek/master-1
Updated README.md with added steps for junior devs
2023-08-05 09:13:12 -07:00
Andrej f93e7b5626 Merge pull request #228 from aiwizzard/master
Fixed typo in README.md
2023-08-05 09:09:19 -07:00
Andrej 2abd77a57f Merge pull request #231 from madroidmaq/master
Update README.md: add a Kotlin port of this project
2023-08-05 09:08:59 -07:00
Andrej ba036696b7 Merge branch 'master' into master 2023-08-05 09:08:51 -07:00
Andrej 4b1e5d57a1 Merge pull request #232 from clebert/zig-port
Add Zig port to README
2023-08-05 09:08:31 -07:00
Michael Cusack 13f342af9e docs typo 2023-08-04 23:12:06 +07:00
Michael Cusack f4c96b7339 Add options to save_torchscript 2023-08-04 23:11:33 +07:00
Michael Cusack 4b3a41b8fc Add options to save_torchscript 2023-08-04 23:10:14 +07:00
Clemens Akens a4e961f378 Add Zig port to README 2023-08-04 18:00:04 +02:00
Michael Cusack 113c675bc9 Rename save_model.py 2023-08-04 20:31:44 +07:00
madroid ec65aac182 Update README.md: add a Kotlin port of this project 2023-08-04 18:50:06 +08:00
Michael Cusack 305d920862 Zero'ing params docs 2023-08-04 17:33:23 +07:00
Michael Cusack dfff7812db Zero'ing params docs 2023-08-04 17:31:31 +07:00
Michael Cusack 34f0402501 Zero'ing params docs 2023-08-04 17:31:11 +07:00
Michael Cusack d4cdd6259e Zero'ing params docs 2023-08-04 17:30:05 +07:00
Michael Cusack 9f8e0857ee Typo 2023-08-04 17:22:27 +07:00
Michael Cusack f8d45f180d Reinline loss function 2023-08-04 17:21:29 +07:00
Michael Cusack f67185958b Model args in save script 2023-08-04 17:07:41 +07:00
Michael Cusack fd5e2cc7bc Updating training code for loss result 2023-08-04 17:03:11 +07:00
Michael Cusack ac2b435151 docs 2023-08-04 16:55:26 +07:00
Michael Cusack 11a8348dfc extra line 2023-08-04 16:52:04 +07:00
Michael Cusack f2e34e6b0a Resolve jit.save errors 2023-08-04 16:49:26 +07:00
Ajmal K b9f303f3b8 Fixed typo in README.md
Fixed typo
2023-08-04 10:30:11 +05:30
rahulschand 02cf3c7311 Small changes to ROPE & comments 2023-08-03 20:13:50 +05:30
Leszek Mielnikow 99ea2f7070 Updated README.md with added steps for junior devs 2023-08-03 09:10:13 +02:00
Andrej af8708d87b Merge pull request #216 from gohai/patch-1
Add link to Emscripten port in README
2023-08-02 10:06:09 -07:00
Andrej 9819ae4b3d Merge branch 'master' into patch-1 2023-08-02 10:06:01 -07:00
Andrej 5b47cd1cfb Merge pull request #211 from wlamond/tinyshakespeare
Add tinyshakespeare dataset
2023-08-02 09:42:11 -07:00
Andrej 574be29e4f Merge pull request #217 from mukel/llama2.java
Add Java port.
2023-08-02 09:40:02 -07:00
Alfonso² Peterssen 30974307af Add Java port. 2023-08-02 15:01:28 +02:00
Gottfried Haider 8dd9baddaa Update README.md 2023-08-02 18:09:06 +08:00
Gottfried Haider b2b5514a92 Add link to Emscripten port in README 2023-08-02 17:58:15 +08:00
Ruhollah Majdoddin f50804a5ae replaced __int64 with int64_t and DWORD with uint_32 2023-08-02 10:18:30 +02:00
Will Lamond e592ed5d64 Add tinyshakespeare dataset 2023-08-01 15:26:47 -07:00
Andrej a8f3e1c499 Merge pull request #175 from tairov/ci-mingw
Add build step for win64 msys2/mingw
2023-08-01 09:02:35 -07:00
Andrej b7f026ffc8 Merge pull request #179 from richinseattle/windows-ftell64-fix
use ssize_t/int64 and 64bit version of ftell on windows
2023-08-01 09:01:50 -07:00
Andrej e2d4a386c7 Merge pull request #186 from vodkaslime/master
add vodkaslime llama.zig to readme
2023-08-01 09:00:37 -07:00
Andrej e270c6eb3c Update README.md: add mention of -f unroll loops option for gcc 2023-08-01 08:59:00 -07:00
Andrej 23f608350a Merge branch 'master' into master 2023-08-01 08:50:52 -07:00
Andrej def12a29c6 Merge pull request #173 from juvi21/patch-1
Added julia port to notable forks section in README.md
2023-08-01 08:50:14 -07:00
Andrej 221f4f929f Merge branch 'master' into patch-1 2023-08-01 08:50:08 -07:00