llama2.c

Author	SHA1	Message	Date
Andrej	1bcb2d18d6	Merge pull request #284 from karpathy/feature/customtokenizer multiquery support add	2023-08-13 12:38:06 -07:00
Andrej Karpathy	38bfac90a8	bigchange: add multiquery support in run.c. we can now train and inference multiquery models (where n_kv_heads < n_heads). this also means that we, in principle, support Llama 2 34B and 70B models, which are multiquery	2023-08-13 19:34:05 +00:00
Andrej	b28c1e26c5	Merge pull request #275 from icppWorld/webassembly-internet-computer Notable fork section for WebAssembly	2023-08-13 10:14:39 -07:00
Andrej	5295cbb821	Merge pull request #281 from lintian06/original_llama2 Update README.md for a new rust port.	2023-08-13 10:14:00 -07:00
Andrej	12dec61fbf	Merge pull request #282 from mihainadas/master-1 Fixes https://github.com/karpathy/llama2.c/issues/280	2023-08-13 10:13:08 -07:00
Mihai Nadăș	570789aa04	Fixes https://github.com/karpathy/llama2.c/issues/280 There was a small bug in tinystories.py, described here: https://github.com/karpathy/llama2.c/issues/280 This commit simply passes vocab_size to get_tokenizer_model_path to avoid silent crash when processing shards (in process_shard)	2023-08-13 17:49:10 +03:00
Tian Lin	27adb082f1	Update README.md	2023-08-13 21:58:14 +08:00
Andrej	8b472ded1f	Merge pull request #272 from karpathy/feature/customtokenizer Big Change: Custom Tokenizer training: add the ability to train custom tokenizers instead of using the pretrained Llama 2 tokenizer. This is useful in custom, narrow-domain LLMs because smaller vocab sizes make much smaller, faster, and potentially more capable models. For example, in tinystories a vocab size 4096 custom tokenizer compresses the input text sequences about as well as the Llama 2 tokenizer with vocab size 32000. The result is also "safer" because a badly trained model can't accidentally e.g. output some random chinese character and rapidly go "off the rails" in subsequent tokens.	2023-08-12 20:31:21 -07:00
Andrej Karpathy	9ff459b925	todo changes	2023-08-13 03:24:31 +00:00
Andrej Karpathy	1d14cb8dd8	add note about 4096 vs 32000 token size on tinystories	2023-08-13 03:19:35 +00:00
Andrej Karpathy	fe49eb222c	readme for custom tokenizers	2023-08-13 03:16:18 +00:00
Andrej Karpathy	9c3cfb46a3	make default be the llama2 tokenizer	2023-08-13 03:08:07 +00:00
Andrej Karpathy	00a61dc7f9	remove the tinyshakespeare dataset until i can bring it back later in a nicer form, otherwise right now we just have a ton of copy paste code here	2023-08-13 02:18:30 +00:00
Andrej Karpathy	f5fc0c245f	final piece: run.c support for new tokenizer, super ez	2023-08-13 02:12:13 +00:00
Andrej Karpathy	ea4cedc588	add ability to export custom tokenizer to .bin format for run.c file	2023-08-13 02:00:19 +00:00
Andrej Karpathy	b0cfa2458d	ok i can train and sample a model with a custom tokenizer	2023-08-11 16:47:29 +00:00
icpp	f96c7afb2d	Notable fork section for WebAssembly Added my repo `icpp-lmm` for running it on the Internet Computer	2023-08-11 10:11:32 -04:00
Andrej Karpathy	4c6f0af9ff	add the ability to train a custom sentencepiece tokenizer with a given vocab_size, and pretok with it. some more changes still needed to merge this branch, in train.py and ofc run.c. did this in a sadly bit ugly, but fully backwards compatible way. basically when we use custom tokenizer we create a whole new directory structure for that	2023-08-11 03:58:22 +00:00
Andrej Karpathy	c42641205f	turn off topp sampling by default because it is a bit too slow to be the default. it is likely that turning it on, e.g. -p 0.9 is midlly higher quality and safer samples, but this comes at a cost of too much performance in double digit percent sometimes, for it to be on by default i think...	2023-08-10 15:23:05 +00:00
Andrej Karpathy	3f69c6cdc4	change the default to use runfast, which imo works just fine	2023-08-10 05:06:49 +00:00
Andrej	5f8068fd43	Merge pull request #260 from madroidmaq/master Add Jupyter notebook for easier feel the magic	2023-08-09 22:03:36 -07:00
Andrej	f60285ee78	Merge pull request #264 from trrahul/master Added C# port information in readme	2023-08-09 22:00:23 -07:00
Andrej	04121d1b85	Merge pull request #256 from rdentato/patch-rng-seed Patch rng seed	2023-08-09 21:56:07 -07:00
Rahul TR	256e7f885b	Added C# port information in readme	2023-08-09 17:59:47 +05:30
Andrej Karpathy	e36e3fb50d	Merge branch 'master' of github.com:karpathy/llama2.c	2023-08-09 02:08:37 +00:00
Andrej Karpathy	96873b0274	refine todos section make more concrete and sort	2023-08-09 02:08:33 +00:00
madroid	9713609023	Add Colab GUI: select model/temperature/prompt/etc	2023-08-08 20:29:53 +08:00
madroid	27c5fc76b1	Add Google Colab button	2023-08-08 01:50:19 +08:00
madroid	57ca3c0401	Add run.ipynb for easier feel the magic	2023-08-08 01:32:51 +08:00
rdentato	ff6a2f0a7a	Reset the #include <omp.h>	2023-08-07 07:28:03 +00:00
rdentato	e49c16caa5	Changed how rng_seed is handled. Now 0 is treated as time(NULL).	2023-08-07 06:51:57 +00:00
Remo Dentato	2e5fad83da	Merge branch 'karpathy:master' into master	2023-08-07 07:57:42 +02:00
Andrej	3c3b19b14c	Merge pull request #242 from tairov/llama2-py Add a link to simple one file pure Python port	2023-08-06 19:51:30 -07:00
Andrej	f4f4cae4cb	Merge pull request #241 from danielgrittner/master add a Rust port	2023-08-06 19:51:13 -07:00
Andrej	09de2cc4ca	Merge pull request #250 from npinto/master-1 FIX: model.generate(); forward() only returns logits now.	2023-08-06 18:43:01 -07:00
Nicolas Pinto	98b515e44d	FIX: model.generate() This patch fixes a simple bug in `generate()` due to model's `forward()` only returning logits and not losses since `f2e34e6b0ac55accd6ba930a04c6f683f5158b29`.	2023-08-06 14:48:47 -07:00
rdentato	999b1bf776	Added conditinal include of the OpenMP header.	2023-08-06 21:07:09 +00:00
Aydyn Tairov	2297d158e3	Fix link to a github profile	2023-08-06 21:47:05 +01:00
Daniel Grittner	512f039d5d	Merge branch 'master' into master	2023-08-06 19:55:43 +02:00
Aydyn Tairov	6734eaeff5	Rebase chanes to master	2023-08-06 18:47:05 +01:00
Aydyn Tairov	7178facb75	Rebase changes to master	2023-08-06 18:45:47 +01:00
Andrej Karpathy	a7a3aa09b8	Merge branch 'master' of github.com:karpathy/llama2.c	2023-08-06 16:33:36 +00:00
Andrej Karpathy	79791f39b4	let's start respecting the BOS token. Don't print it explicitly, and terminate sequence if it appears. This makes sense especially after the recent addition of prompting. Also be careful with timings and making sure they come out right if we exit early in this data-dependent manner	2023-08-06 16:33:23 +00:00
Andrej Karpathy	4e8a3e8d5d	fix style issue space with stderr printing	2023-08-06 15:51:58 +00:00
Andrej	7af81ded7e	Merge pull request #244 from madroidmaq/master Update README.md: format notable forks	2023-08-06 08:43:24 -07:00
Andrej	a25958fd45	Merge pull request #245 from rdentato/patch-stderr Errors and info on stderr	2023-08-06 08:42:09 -07:00
Madroid Ma	1f53735d12	Merge branch 'karpathy:master' into master	2023-08-06 18:18:36 +08:00
rdentato	9cfb7efb85	Changed all the printf() for error/info messages so that they print on stderr.	2023-08-06 09:53:02 +00:00
madroid	baefaaaf76	Update README.md: add notable forks author's link	2023-08-06 17:42:31 +08:00
Daniel Grittner	fcb4cdef8b	add a Rust port	2023-08-06 10:44:48 +02:00

1 2 3 4 5 ...

302 Commits