Merge pull request #10 from luigifcruz/patch-1
Bigger number better with -funsafe-math-optimizations flag.
This commit is contained in:
@@ -19,7 +19,7 @@ Let's just run a baby Llama 2 model in C. You need a model checkpoint. Download
|
|||||||
wget https://karpathy.ai/llama2c/model.bin -P out
|
wget https://karpathy.ai/llama2c/model.bin -P out
|
||||||
```
|
```
|
||||||
|
|
||||||
(if that doesn't work try [google drive](https://drive.google.com/file/d/1aTimLdx3JktDXxcHySNrZJOOk8Vb1qBR/view?usp=share_link)). Compile and run the C code:
|
(if that doesn't work try [google drive](https://drive.google.com/file/d/1aTimLdx3JktDXxcHySNrZJOOk8Vb1qBR/view?usp=share_link)). Compile and run the C code (check [howto](#howto) for faster optimization flags):
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
gcc -O3 -o run run.c -lm
|
gcc -O3 -o run run.c -lm
|
||||||
@@ -68,6 +68,12 @@ Once we have the model.bin file, we can inference in C. Compile the C code first
|
|||||||
gcc -O3 -o run run.c -lm
|
gcc -O3 -o run run.c -lm
|
||||||
```
|
```
|
||||||
|
|
||||||
|
Alternatively, if you want to increase the inference performance and are confident in using unsafe math optimizations, which are probably fine for this application, you can compile the code with the `-funsafe-math-optimizations` flag as shown below:
|
||||||
|
|
||||||
|
```bash
|
||||||
|
gcc -O3 -funsafe-math-optimizations -o run run.c -lm
|
||||||
|
```
|
||||||
|
|
||||||
You can now run it simply as
|
You can now run it simply as
|
||||||
|
|
||||||
```bash
|
```bash
|
||||||
|
|||||||
Reference in New Issue
Block a user