mirror of
https://github.com/bentoml/OpenLLM.git
synced 2026-06-12 02:20:32 -04:00
fix(doc): README.md
Signed-off-by: bojiang <5886138+bojiang@users.noreply.github.com>
This commit is contained in:
@@ -163,7 +163,7 @@ openllm deploy llama3:8b
|
||||
```
|
||||
|
||||
> [!NOTE]
|
||||
> If you are deploying a gated models, make sure to add `--env HF_TOKEN=$HF_TOKEN`
|
||||
> If you are deploying a gated models, make sure to set HF_TOKEN in enviroment variables
|
||||
|
||||
Once the deployment is complete, you can run model inference on the BentoCloud console:
|
||||
|
||||
|
||||
Reference in New Issue
Block a user