todo changes
This commit is contained in:
@@ -292,12 +292,12 @@ If your candidate PRs have elements of these it doesn't mean they won't get merg
|
||||
|
||||
## unsorted todos
|
||||
|
||||
- revive tests; train a tiny Llama test model (committed to repo) and use it as reference in unit tests
|
||||
- make it easier to add a new dataset with not too much pain
|
||||
- add multiquery support into run.c
|
||||
- add custom bpe training code and the ability to train a smaller vocabulary (32K is to much)
|
||||
- should calculate freq_cis online in the script run.c instead of loading them
|
||||
- int4/8 quantization
|
||||
- export the model in a more sensible output format with a proper header, etc.
|
||||
- train a tiny Llama test model (committed to repo) and use it as reference in unit tests
|
||||
- support Llama 2 7B Chat models and tune run.c to Chat UI/UX
|
||||
- llama2.cu investigate and merge
|
||||
- (LoRA) finetuning and export of Llama 2 models
|
||||
|
||||
Reference in New Issue
Block a user