mirror of
https://github.com/bentoml/OpenLLM.git
synced 2026-03-14 13:06:09 -04:00
2a53faee9c26cbac78e2a7849c61d80bea882f83
since tokenizer are relatively light, all default LLM will bundle the tokenizer with itself. Maybe we can put the tokenizer in its own runner in the future Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
OpenLLM
REST/gRPC API server for running any Open Large-Language Model - StableLM, Llama, Alpaca, Dolly, Flan-T5, and more
Powered by BentoML 🍱 + HuggingFace 🤗
Description
Languages
Python
95.9%
Shell
4.1%