prompt tokenizer improvements: utf8 support, add_dummy_prefix and byte_fallback options to match sentencepiece

This commit is contained in:
atamyrat
2023-08-04 04:18:20 +03:00
parent 3c3b19b14c
commit c02865df30
3 changed files with 38 additions and 10 deletions
BIN
View File
Binary file not shown.