llama2.c

Author	SHA1	Message	Date
Jani Monoses	604d3c59c0	Add Code Llama info	2023-08-26 22:36:09 +03:00
Andrej	4a7a62bd21	Merge branch 'master' into feature/chat	2023-08-25 07:58:33 -07:00
Andrej Karpathy	fbe324fc5a	adjust things a bit	2023-08-25 14:54:05 +00:00
Diego Marcos Segura	19cfbeca71	Fix typo in README.md	2023-08-24 19:46:43 -07:00
Andrej	d7cd98633d	add todo item to add a PyTorch Engine	2023-08-24 09:04:52 -07:00
Jani Monoses	fe9b9f2f15	Train vocab in Python	2023-08-23 19:10:28 +03:00
Andrej Karpathy	d1eb18b8ec	add BOS and EOS function to the Tokenizer as we start to converge closer to the Llama 2 code from Meta, and as we're about to add the Chat capability	2023-08-23 00:08:22 +00:00
Andrej Karpathy	ac6cf8d6e8	tweak todo list	2023-08-22 02:48:51 +00:00
atamyrat	61c26d5392	Updated README to replace export_meta_llama_bin.py script with export.py	2023-08-21 14:24:01 +03:00
Andrej Karpathy	ea44f53568	now that the export.py HF functionality is in master, we can delete this file, and update the readme	2023-08-21 04:58:19 +00:00
Harry Gifford	a72b3b0206	Update readme with suggestion on number of threads to use Update the documentation to make suggestions on the number of threads. The performance difference can be very large. Also linked to the PyTorch docs which are relevant here.	2023-08-20 15:01:33 -07:00
Andrej	8c93c7a30e	Merge pull request #322 from karpathy/feature/export New model export (the code remains "dead" and legacy version is still the default behavior, so no breaking changes are introduced). The major benefit is a new export.py file, which we can use to centralize work on formatting: both imports and exports.	2023-08-20 10:08:32 -07:00
Andrej Karpathy	13dcee493a	todos update	2023-08-20 17:02:22 +00:00
Andrej	6c5d78fa41	Merge pull request #317 from yiminghan/yhan/old Add a link to Dart port in README	2023-08-19 10:01:08 -07:00
rahoua	978c311b30	Add pecca-rs to README.md	2023-08-18 14:58:21 -07:00
YiMing Han	882e480bc0	update read me	2023-08-18 15:18:29 -04:00
YiMing Han	d09ebbb32b	Revert "working one" This reverts commit `8607b11ea1`.	2023-08-18 15:14:08 -04:00
YiMing Han	8607b11ea1	working one	2023-08-18 15:07:41 -04:00
Andrej	df6557a10d	Merge pull request #267 from krrishnarraj/master Update readme for openmp on mac	2023-08-15 19:26:34 -07:00
chenyang	2a9a4c4e14	update readme wiht a simple line to introduce llama2.c-zh	2023-08-14 15:12:30 +08:00
chenyang	79900ff68e	update readme wiht a simple line to introduce llama2.c-zh	2023-08-14 15:00:33 +08:00
Krishnaraj Bhat	eec9ad5a5b	Merge remote-tracking branch 'upstream/master'	2023-08-14 12:02:40 +05:30
Andrej	bae0bcf484	Small tweaks to Readme intro	2023-08-13 20:03:00 -07:00
Andrej Karpathy	854c97b660	turn topp 0.9 back on by default thanks to recent PR contributions truncating before quicksort	2023-08-14 00:12:45 +00:00
Andrej	b51c63b9f2	Merge pull request #283 from wizzard0/wizzard0-mention-1 Add TypeScript port	2023-08-13 14:36:10 -07:00
Andrej Karpathy	8506036185	remove 'revive tests' as a todo from the readme	2023-08-13 21:23:27 +00:00
Andrej Karpathy	f0024cfc88	revive tests. now that we have a tiny stories260K model this only requires a 2MB download. phew	2023-08-13 21:22:44 +00:00
Andrej	0805cb2c31	tiny whitespace fix to try to eliminate scrollbar	2023-08-13 13:40:09 -07:00
Andrej	b2cce341e0	oops typo fix in readme	2023-08-13 13:39:12 -07:00
Andrej Karpathy	3e989e21f2	link to stories260K model	2023-08-13 20:38:05 +00:00
Andrej Karpathy	58075b5ac5	update API of sample.py to be better, small changes here	2023-08-13 20:31:32 +00:00
Andrej	1bcb2d18d6	Merge pull request #284 from karpathy/feature/customtokenizer multiquery support add	2023-08-13 12:38:06 -07:00
Andrej Karpathy	38bfac90a8	bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery	2023-08-13 19:34:05 +00:00
Andrej	b28c1e26c5	Merge pull request #275 from icppWorld/webassembly-internet-computer Notable fork section for WebAssembly	2023-08-13 10:14:39 -07:00
Oleksandr Nikitin	0e6213c6e0	Mention I can run the full 7B model	2023-08-13 20:02:34 +03:00
Oleksandr Nikitin	1d68a36d14	Add TypeScript port I've never been so happy to have missed that the JS port already exists :D also it was nice to discover that the JS can reach 80% of the single-threaded C speed (10 tokens/s for TinyStories-110M)	2023-08-13 19:10:07 +03:00
Tian Lin	27adb082f1	Update README.md	2023-08-13 21:58:14 +08:00
Andrej Karpathy	9ff459b925	todo changes	2023-08-13 03:24:31 +00:00
Andrej Karpathy	1d14cb8dd8	add note about 4096 vs 32000 token size on tinystories	2023-08-13 03:19:35 +00:00
Andrej Karpathy	fe49eb222c	readme for custom tokenizers	2023-08-13 03:16:18 +00:00
icpp	f96c7afb2d	Notable fork section for WebAssembly Added my repo `icpp-lmm` for running it on the Internet Computer	2023-08-11 10:11:32 -04:00
Andrej Karpathy	c42641205f	turn off topp sampling by default because it is a bit too slow to be the default. it is likely that turning it on, e.g. -p 0.9 is midlly higher quality and safer samples, but this comes at a cost of too much performance in double digit percent sometimes, for it to be on by default i think...	2023-08-10 15:23:05 +00:00
Krishnaraj Bhat	46d7a6b6c6	Merge branch 'karpathy:master' into master	2023-08-10 11:06:19 +05:30
Krishnaraj Bhat	d45a36cdd2	Update readme for openmp on mac	2023-08-10 10:59:39 +05:30
Andrej	5f8068fd43	Merge pull request #260 from madroidmaq/master Add Jupyter notebook for easier feel the magic	2023-08-09 22:03:36 -07:00
Rahul TR	256e7f885b	Added C# port information in readme	2023-08-09 17:59:47 +05:30
Andrej Karpathy	e36e3fb50d	Merge branch 'master' of github.com:karpathy/llama2.c	2023-08-09 02:08:37 +00:00
Andrej Karpathy	96873b0274	refine todos section make more concrete and sort	2023-08-09 02:08:33 +00:00
madroid	27c5fc76b1	Add Google Colab button	2023-08-08 01:50:19 +08:00
Andrej	3c3b19b14c	Merge pull request #242 from tairov/llama2-py Add a link to simple one file pure Python port	2023-08-06 19:51:30 -07:00

1 2 3 4

156 Commits