Chaoyu
e2b26adf2f
chore(docs): update README.md
...
See #12
2023-06-10 00:21:21 -04:00
Aaron
1597d5d4bb
chore(readme): update stablelm [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-10 00:21:21 -04:00
Aaron
bca133f389
revert: update metadata for Python 3.8 and 3.9
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-10 00:21:20 -04:00
Aaron Pham [bot]
11cedce974
infra: bump to dev version of 0.0.33.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-10 00:21:20 -04:00
Aaron Pham [bot]
03ac525949
infra: prepare for release 0.0.32 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.32
2023-06-09 19:05:09 +00:00
Aaron
9bbe1ff4bf
chore(stablelm): make stablelm run explicitly with GPU
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-09 14:57:12 -04:00
Aaron
c51e944cb2
chore(version): remove support for 3.8 and 3.9 for now
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 22:47:57 -04:00
Aaron
b72317db67
fix(import): lazy load torch
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 22:05:41 -04:00
Aaron
16df0f4393
chore(infra): increase timeout to 60m
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 18:18:51 -04:00
Aaron Pham [bot]
d005760c68
infra: bump to dev version of 0.0.32.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-08 22:15:29 +00:00
Aaron Pham [bot]
e2813f843e
infra: prepare for release 0.0.31 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.31
2023-06-08 22:04:19 +00:00
Aaron
ebe5ae797e
fix(script): avoid using private variable
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 17:59:06 -04:00
Aaron
f5edd4fcf4
feat(script): add easy script to release
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 17:52:39 -04:00
Aaron
f284c64370
docs: update release-notes run with ref for tags
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 17:18:23 -04:00
aarnphm-ec2-dev
acf78ce731
fix(saving): make sure to cleanup cuda cache after using default
...
import
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 21:11:07 +00:00
Aaron Pham [bot]
a451b03a0a
infra: bump to dev version of 0.0.31.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-08 21:10:01 +00:00
Aaron Pham [bot]
55d584a986
infra: prepare for release 0.0.30 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.30
2023-06-08 20:55:39 +00:00
aarnphm-ec2-dev
2f9bd2f6fe
fix(packaging): make sure to add BENTOML_CONFIG_OPTIONS into
...
Dockerfile
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 20:33:20 +00:00
Aaron
71198b66cc
revert: move release-notes to separate actions
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 16:03:41 -04:00
Aaron
1902954463
infra: bump to dev version of 0.0.30.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 16:03:36 -04:00
Aaron Pham [bot]
2db7663ba5
infra: prepare for release 0.0.29 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.29
2023-06-08 19:56:51 +00:00
aarnphm-ec2-dev
42f8d0271c
chore(model_name): shorten model name
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 19:41:59 +00:00
aarnphm-ec2-dev
d86fb322d0
fix(containerize): Install base openllm for non OpenLLM dev build
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 19:31:36 +00:00
aarnphm-ec2-dev
1c9c9645a7
fix(label): make sure to convert labels to all string
...
to avoid warning from bentoml
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 19:25:55 +00:00
aarnphm-ec2-dev
0f7840626d
fix(cli): make sure to allow user to pass endpointu
...
--endpoint http://0.0.0.0:3000
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 19:23:04 +00:00
aarnphm-ec2-dev
f84b975a55
fix(llm): build to include openllm_client
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 19:20:43 +00:00
Jian Shen
e6dd1b1c39
docs: Update README.md
...
Signed-off-by: Jian Shen <jianshen92@gmail.com >
2023-06-09 03:02:53 +08:00
aarnphm-ec2-dev
15cb13839d
fix(load_model): make sure to use correct implementation
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 19:01:01 +00:00
Aaron
a84661142c
chore(cli): remove --local for query
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 14:53:11 -04:00
Aaron
7a162402a1
fix(llm): make sure to use correct load_model
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 14:50:58 -04:00
Aaron
20bc9153b1
fix(ci): checkout version on actions
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 14:40:38 -04:00
Aaron
20416ab107
infra: bump to dev version of 0.0.29.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 14:40:29 -04:00
Aaron Pham [bot]
f6d6b08369
infra: prepare for release 0.0.28 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.28
2023-06-08 13:25:55 +00:00
Aaron
400445da6f
fix(deps): broken name for bitsandbytes
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 09:19:05 -04:00
Aaron
067a7a8e81
chore(ci): add check script for README table update
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 09:16:28 -04:00
Aaron
c0418b76ec
feat(infra): add tools for managing optional-dependencies
...
based on llm config
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 08:57:19 -04:00
Aaron
23d98a2729
feat(tooling): add script to auto update readme table of supported
...
models
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 08:22:55 -04:00
Aaron
0680059a21
chore(ci): cleanup workflow
...
make it a pipeline for release now
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 05:28:56 -04:00
Aaron
5ecbc0017f
infra: bump to dev version of 0.0.28.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 05:28:44 -04:00
Aaron Pham [bot]
4c86f661ec
infra: prepare for release 0.0.27 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.27
2023-06-08 09:11:30 +00:00
aarnphm-ec2-dev
e9e12a66a8
fix(falcon): custom load
...
This has to do with pipeline load is pretty magical and broken
on transformers
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 09:03:34 +00:00
Aaron
378b209d67
feat(llm): custom load_model
...
This has to with loading models that requires more attention
than the default bentoml.transformers.load_model
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 04:07:07 -04:00
Aaron
4369395520
chore(docs): running formatter
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 03:14:14 -04:00
aarnphm-ec2-dev
5060f22600
fix(stablelm): disable running on 8bit
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 06:40:18 +00:00
aarnphm-ec2-dev
e276b948f0
chore(stablelm): normalize keys name
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 06:29:11 +00:00
Aaron Pham
33d0af82a7
chore(readme): align badges to middle
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-06-07 18:33:27 -04:00
Jian Shen
66104a017f
docs: Update Badge
...
Signed-off-by: Jian Shen <jianshen92@gmail.com >
2023-06-08 02:06:09 +08:00
Jian Shen
f4c0ef6d0c
docs: Add Badges
...
Signed-off-by: Jian Shen <jianshen92@gmail.com >
2023-06-08 01:58:20 +08:00
Jian Shen
de273a7dd2
doc: Update Readme with Integrations Section
2023-06-08 01:36:05 +08:00
Aaron
f2771bfe49
chore(cli): move back --version
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-07 03:41:50 -04:00