add note on code llama being a bit wrong

This commit is contained in:
Andrej Karpathy
2023-08-26 21:22:19 +00:00
parent 7b0017c6cd
commit 91d57db925
+1 -2
View File
@@ -95,8 +95,7 @@ Then chat with it by specifying the chat mode using the `-m` flag, e.g.:
./run llama2_7b_chat.bin -m chat
```
You can also try Meta's Code Llama models even if support for them is incomplete.
Make sure to build the tokenizer for the plain and instruct variants and pass it when doing inference.
You can also try Meta's Code Llama models even if support for them is incomplete. In particular, some hyperparameters changed (e.g. the constant in RoPE layer), so the inference is not exactly correct and a bit buggy right now. Looking into fixes. Make sure to build the tokenizer for the plain and instruct variants and pass it when doing inference.
```bash
python export.py codellama2_7b.bin --meta-llama /path/to/CodeLlama-7b