prompt tokenizer improvements: utf8 support, add_dummy_prefix and byte_fallback options to match sentencepiece
This commit is contained in:
Binary file not shown.
Reference in New Issue
Block a user