Merge branch 'master' of github.com:karpathy/llama2.c

2023-07-27 05:20:59 +00:00
parent b35e82f63b 5c55d59325
commit eff1c1b425
1 changed files with 3 additions and 1 deletions
@@ -1,6 +1,8 @@
 ## llama2.c

-<img src="assets/llama_cute.jpg" width="300" height="300">
+<p align="center">
+  <img src="assets/llama_cute.jpg" width="300" height="300" alt="Cute Llama">
+</p>

 With the code in this repo you can train the Llama 2 LLM architecture from scratch in PyTorch, then export the weights to a binary file, and load that into one ~simple 500-line C file ([run.c](run.c)) that inferences the model. Alternatively, you can load, finetune, and inference Meta's Llama 2 (but this is still being actively fleshed out). Hence, this repo is a "fullstack" train + inference solution for Llama 2 LLM, with a focus on minimalism and simplicity. You might think that you need many billion parameter LLMs to do anything useful, but in fact very small LLMs can have surprisingly strong performance if you make the domain narrow enough. I recommend looking at the [TinyStories](https://huggingface.co/datasets/roneneldan/TinyStories) paper for inspiration.