Aaron 2a53faee9c infra: add structure and cleanup separation of tokenizer
since tokenizer are relatively light, all default LLM will bundle the
tokenizer with itself.

Maybe we can put the tokenizer in its own runner in the future

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-05 11:57:39 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00
2023-05-03 17:50:14 -07:00

OpenLLM


REST/gRPC API server for running any Open Large-Language Model - StableLM, Llama, Alpaca, Dolly, Flan-T5, and more
Powered by BentoML 🍱 + HuggingFace 🤗
Description
No description provided
Readme Apache-2.0 49 MiB
Languages
Python 95.9%
Shell 4.1%