Commit Graph

118 Commits

Author SHA1 Message Date
Travis Cline 01c06fa83c readme: Include reference to go port
This adds a link to my go port of this project.
2023-07-26 22:44:15 -07:00
Andrej Karpathy b7efb1b5c9 Merge branch 'richinseattle-patch-2' 2023-07-27 05:23:49 +00:00
Andrej Karpathy 0d18fa7780 Merge branch 'patch-2' of https://github.com/richinseattle/llama2.c into richinseattle-patch-2 2023-07-27 05:23:05 +00:00
Andrej Karpathy eff1c1b425 Merge branch 'master' of github.com:karpathy/llama2.c 2023-07-27 05:20:59 +00:00
Andrej 5c55d59325 Merge pull request #128 from richinseattle/patch-1
update openmp pragmas for MSVC compatibility
2023-07-26 22:20:49 -07:00
richinseattle 37e8c20f4f Windows compat: Use GetTickCount for delta timer
Intentionally not including a windows header here to avoid merge conflict on include with mmap support. cl.exe doesn't complain, mingw warns.
2023-07-26 22:19:49 -07:00
Andrej Karpathy b35e82f63b Merge branch 'richinseattle-patch-1' 2023-07-27 05:18:39 +00:00
Andrej Karpathy 815ce33569 Merge branch 'patch-1' of https://github.com/richinseattle/llama2.c into richinseattle-patch-1 2023-07-27 05:15:52 +00:00
richinseattle 539dc73196 fix whitespace 2023-07-26 22:12:32 -07:00
Andrej 34cce6a6b5 Merge pull request #126 from som-sama/patch-1
Center align cute llama image in README
2023-07-26 22:09:34 -07:00
Andrej Karpathy 530ef8e778 light touchups to export script so one doesn't need to pass in a slash at the end 2023-07-27 05:08:45 +00:00
richinseattle 7f7a3b2d56 update openmp pragmas for MSVC compatibility
This has no negative impact on Linux and is in preparation for windows support. Windows compiles will not work without additional timer and mmap compatibility patches
2023-07-26 22:06:23 -07:00
Som 7887133145 Center align cute llama image in README
Added HTML code to center align the cute llama image in the README file. The image is wrapped in a <p> tag with the "align" attribute set to "center" for proper alignment. Additionally, an "alt" attribute is included for accessibility.
2023-07-27 09:26:20 +05:30
Andrej Karpathy 5f681b64b1 oops missed a section somehow, updating readme 2023-07-27 03:01:48 +00:00
Andrej Karpathy c2bbe9c6fb link to the huggingface hub models instead 2023-07-27 00:14:23 +00:00
Andrej Karpathy 7a4ca4a98b add contributing section to readme, and also notable forks section 2023-07-26 23:58:49 +00:00
Andrej 4085e8971f Merge pull request #119 from kroggen/code-comments
add some code comments
2023-07-26 15:50:01 -07:00
Bernardo Ramos 57034480b6 add some code comments 2023-07-26 19:48:14 -03:00
Andrej Karpathy f0f43b7288 small note on traing times 2023-07-26 22:12:50 +00:00
Andrej Karpathy 2711ae8c32 make compiler tunable in Makefile, i think potentially nice and useful 2023-07-26 16:40:40 +00:00
Andrej 7059d7dba9 Update README.md 2023-07-26 09:06:08 -07:00
Andrej 7496ea8108 Update README.md 2023-07-26 08:59:42 -07:00
Andrej f5d8797af2 Update README.md 2023-07-26 08:59:12 -07:00
Andrej Karpathy 3aedfe59f1 Merge branch 'aegkmq-master' 2023-07-26 15:43:06 +00:00
aegkmq 8986005f23 Minor cleanup 2023-07-26 16:57:08 +09:00
aegkmq 36c522a0d8 Improve locality 2023-07-26 13:24:27 +09:00
Andrej Karpathy f5650891d5 honestly at this point this is a lot more my nanogpt code than llama code 2023-07-25 23:57:03 +00:00
Andrej 7f9f5ca853 Update README.md: new llama model export 2023-07-25 16:30:28 -07:00
Andrej 5bcd19a204 Merge pull request #85 from python273/export-llama-without-llama
Export llama without llama
2023-07-25 16:23:56 -07:00
Andrej 614bf91e5d Merge pull request #60 from emma-eva/patch-1
Fixed time_in_ms() compile time error (termux and neoterm)
2023-07-25 16:06:41 -07:00
Andrej 366711acf8 Merge pull request #77 from madroidmaq/master
Update README.md: formate output samples
2023-07-25 16:01:55 -07:00
python273 4d1fa2f2c6 Export llama without llama 2023-07-26 01:32:00 +04:00
madroid ac22fbce7e Update README.md: formate output samples 2023-07-26 00:46:14 +08:00
Andrej 6cf34d610a Update README.md 2023-07-25 08:14:48 -07:00
Andrej Karpathy 34ccb64ed8 fix typo in readme after adding the 110m model 2023-07-25 15:02:11 +00:00
Andrej Karpathy 94730f1766 add the 110m model, as it finished training 2023-07-25 15:00:57 +00:00
Andrej Karpathy 05ee4cbf38 fix bug in timing - use steps not max seq len doh 2023-07-25 14:21:37 +00:00
Andrej d359fae505 Merge pull request #69 from RichardScottOZ/patch-1
intimately
2023-07-25 07:04:17 -07:00
RichardScottOZ f3a1e227fe intimately 2023-07-25 21:26:30 +09:30
Emma Eva 6ce91b1b3b Fixed time_in_ms() compile time error (termux and neoterm)
clang version 16.0.4
2023-07-25 12:12:40 +06:00
Andrej 98ec4ba23d Update README.md 2023-07-24 22:54:54 -07:00
Andrej 81c90bfcb7 Update README.md: small tweaks 2023-07-24 22:51:39 -07:00
Andrej cf625ecd7e Update README.md 2023-07-24 21:25:31 -07:00
Andrej Karpathy c3e0d73bd2 we can inference Meta's Llama 2 7B, yay 2023-07-25 04:21:07 +00:00
Andrej 133ad3ffff Merge pull request #50 from karpathy/memmap
candidate memmap implementation
2023-07-24 18:59:29 -07:00
Andrej Karpathy a1f6b4653e merge conflict resolve with imports 2023-07-25 01:58:46 +00:00
Andrej d18e9efd77 Merge pull request #48 from richinseattle/richinseattle-patch-1
MSVC Compatibility fix for timer
2023-07-24 16:37:37 -07:00
richinseattle b2857c6af2 Switch to using timespec_get() for cross OS compatibility 2023-07-24 16:31:38 -07:00
richinseattle f121f5f0c5 Merge branch 'karpathy:master' into richinseattle-patch-1 2023-07-24 16:30:07 -07:00
Andrej Karpathy cae88dfbab tune readme around timings etc 2023-07-24 23:27:48 +00:00