mirror of
https://github.com/bentoml/OpenLLM.git
synced 2026-01-24 07:17:53 -05:00
2a53faee9c26cbac78e2a7849c61d80bea882f83
since tokenizer are relatively light, all default LLM will bundle the tokenizer with itself. Maybe we can put the tokenizer in its own runner in the future Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
OpenLLM
REST/gRPC API server for running any Open Large-Language Model - StableLM, Llama, Alpaca, Dolly, Flan-T5, and more
Powered by BentoML 🍱 + HuggingFace 🤗
Description
Languages
Python
95.9%
Shell
4.1%