Merge pull request #5 from danielgross/pleasantify-dx
Make sample.py work out of the box
This commit is contained in:
@@ -80,7 +80,7 @@ But note that this only emits the SentencePiece tokens. To decode the tokens int
|
||||
python run_wrap.py
|
||||
```
|
||||
|
||||
Watch the tokens stream by, fun! We can also run the PyTorch inference script for comparison:
|
||||
Watch the tokens stream by, fun! We can also run the PyTorch inference script for comparison (to run, add [model.ckpt](https://drive.google.com/file/d/1SM0rMxzy7babB-v4MfTg1GFqOCgWar5w/view?usp=share_link) to /out if you haven't already):
|
||||
|
||||
```bash
|
||||
python sample.py
|
||||
@@ -92,7 +92,7 @@ Which gives the same results. More detailed testing will be done in `test_all.py
|
||||
$ pytest
|
||||
```
|
||||
|
||||
Currently you will need two files to run the test: the [model.bin](https://drive.google.com/file/d/1aTimLdx3JktDXxcHySNrZJOOk8Vb1qBR/view?usp=share_link) file and the [model.ckpt](https://drive.google.com/file/d/1SM0rMxzy7babB-v4MfTg1GFqOCgWar5w/view?usp=share_link) file from PyTorch training I ran earlier. I have to think through running the tests without having to download 200MB of data.
|
||||
Currently you will need two files to test or sample: the [model.bin](https://drive.google.com/file/d/1aTimLdx3JktDXxcHySNrZJOOk8Vb1qBR/view?usp=share_link) file and the [model.ckpt](https://drive.google.com/file/d/1SM0rMxzy7babB-v4MfTg1GFqOCgWar5w/view?usp=share_link) file from PyTorch training I ran earlier. I have to think through running the tests without having to download 200MB of data.
|
||||
|
||||
## unsorted todos
|
||||
|
||||
|
||||
@@ -17,7 +17,7 @@ max_new_tokens = 100 # number of tokens generated in each sample
|
||||
temperature = 1.0 # 1.0 = no change, < 1.0 = less random, > 1.0 = more random, in predictions
|
||||
top_k = 300 # retain only the top_k most likely tokens, clamp others to have 0 probability
|
||||
seed = 1337
|
||||
device = 'cuda' # examples: 'cpu', 'cuda', 'cuda:0', 'cuda:1', etc.
|
||||
device = 'cuda' if torch.cuda.is_available() else 'cpu' # examples: 'cpu', 'cuda', 'cuda:0', 'cuda:1', etc.
|
||||
#dtype = 'bfloat16' if torch.cuda.is_available() and torch.cuda.is_bf16_supported() else 'float16' # 'float32' or 'bfloat16' or 'float16'
|
||||
dtype = "float32"
|
||||
compile = False # use PyTorch 2.0 to compile the model to be faster
|
||||
|
||||
Reference in New Issue
Block a user