From b2b5514a92a5e145b8109aaa3fb7fe6b7fdc76c8 Mon Sep 17 00:00:00 2001 From: Gottfried Haider Date: Wed, 2 Aug 2023 17:58:15 +0800 Subject: [PATCH] Add link to Emscripten port in README --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index 613f7c3..ef34318 100644 --- a/README.md +++ b/README.md @@ -211,12 +211,12 @@ If your candidate PRs have elements of these it doesn't mean they won't get merg - [llama2.c - Llama 2 Everywhere](https://github.com/trholding/llama2.c) by @trholding: Standalone, Bootable & Portable Binary Llama 2 - [llama2.rs](https://github.com/leo-du/llama2.rs) by @leo-du: A Rust port of this project - [llama2.scala](https://github.com/jrudolph/llama2.scala) by @jrudolph: a Scala port of this project +- [llama2.c-emscripten](https://github.com/gohai/llama2.c-emscripten) by @gohai: Emscripten (JavaScript) port, based on @ggerganov initial prototype ## unsorted todos - support Llama 2 7B Chat model and tune run.c to Chat UI/UX - speed up 7B Llama 2 models sufficiently to work at interactive rates on Apple Silicon MacBooks -- possibly include emscripten / web backend (as seen in @gg PR) - currently the project only runs in fp32, how easy would it be to different precisions? - look into quantization and what would be involved - todo multiquery support? doesn't seem as useful for smaller models that run on CPU (?)