add note on code llama being a bit wrong

2023-08-26 21:22:19 +00:00
parent 7b0017c6cd
commit 91d57db925
1 changed files with 1 additions and 2 deletions
@@ -95,8 +95,7 @@ Then chat with it by specifying the chat mode using the `-m` flag, e.g.:
 ./run llama2_7b_chat.bin -m chat
 ```

-You can also try Meta's Code Llama models even if support for them is incomplete.
-Make sure to build the tokenizer for the plain and instruct variants and pass it when doing inference.
+You can also try Meta's Code Llama models even if support for them is incomplete. In particular, some hyperparameters changed (e.g. the constant in RoPE layer), so the inference is not exactly correct and a bit buggy right now. Looking into fixes. Make sure to build the tokenizer for the plain and instruct variants and pass it when doing inference.

 ```bash
 python export.py codellama2_7b.bin --meta-llama /path/to/CodeLlama-7b