todos update
This commit is contained in:
@@ -308,11 +308,11 @@ If your candidate PRs have elements of these it doesn't mean they won't get merg
|
|||||||
|
|
||||||
## unsorted todos
|
## unsorted todos
|
||||||
|
|
||||||
- make it easier to add a new dataset with not too much pain
|
- delete the export_meta_llama_bin.py and export_meta_llama_hf_bin.py files. instead, import both of these into a proper model.py Transformer instance, and then export using the export script as usual.
|
||||||
- should calculate freq_cis online in the script run.c instead of loading them
|
- migrate the code to work with the new versions export and deprecate the original .bin files
|
||||||
- int4/8 quantization
|
|
||||||
- export the model in a more sensible output format with a proper header, etc.
|
|
||||||
- support Llama 2 7B Chat models and tune run.c to Chat UI/UX
|
- support Llama 2 7B Chat models and tune run.c to Chat UI/UX
|
||||||
|
- make it easier to add a new dataset with not too much pain
|
||||||
|
- int8 quantization
|
||||||
- llama2.cu investigate and merge
|
- llama2.cu investigate and merge
|
||||||
- (LoRA) finetuning and export of Llama 2 models
|
- (LoRA) finetuning and export of Llama 2 models
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user