Aaron
378b209d67
feat(llm): custom load_model
...
This has to with loading models that requires more attention
than the default bentoml.transformers.load_model
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 04:07:07 -04:00
Aaron
4369395520
chore(docs): running formatter
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 03:14:14 -04:00
aarnphm-ec2-dev
5060f22600
fix(stablelm): disable running on 8bit
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 06:40:18 +00:00
aarnphm-ec2-dev
e276b948f0
chore(stablelm): normalize keys name
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 06:29:11 +00:00
Aaron Pham
33d0af82a7
chore(readme): align badges to middle
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-06-07 18:33:27 -04:00
Jian Shen
66104a017f
docs: Update Badge
...
Signed-off-by: Jian Shen <jianshen92@gmail.com >
2023-06-08 02:06:09 +08:00
Jian Shen
f4c0ef6d0c
docs: Add Badges
...
Signed-off-by: Jian Shen <jianshen92@gmail.com >
2023-06-08 01:58:20 +08:00
Jian Shen
de273a7dd2
doc: Update Readme with Integrations Section
2023-06-08 01:36:05 +08:00
Aaron
f2771bfe49
chore(cli): move back --version
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-07 03:41:50 -04:00
aarnphm-ec2-dev
afb5c7bead
chore(deps): lowerbound tabulate to 0.9.0
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-07 06:57:59 +00:00
aarnphm-ec2-dev
b794f75744
fix: move accelerate to fine-tune
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-07 06:51:24 +00:00
aarnphm-ec2-dev
170be0ebc8
fix(cli): make sure make_tag to respect config trust_remote_code
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-07 04:35:15 +00:00
Aaron
ce7143060e
chore(ci): to run release note on all tag
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-07 00:24:25 -04:00
aarnphm-ec2-dev
c960b3edff
feat(client): add postprocess for processing client output call
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-07 00:24:20 -04:00
Aaron
5ed71e7121
infra: bump to dev version of 0.0.27.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-07 00:24:20 -04:00
Aaron Pham [bot]
3121576de6
infra: prepare for release 0.0.26 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.26
2023-06-07 03:33:24 +00:00
Aaron
d6d2de6748
feat(cli): prune
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 23:24:50 -04:00
Aaron
aa50b5279e
fix(falcon): loading based on model registration
...
remove duplicate events
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 22:42:28 -04:00
Aaron
ffac6d8916
ui: improve usability with config
...
set load_in_mha to false by default
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 19:29:21 -04:00
Aaron
45f022eea3
fix(dolly): no need to pass do_sample to pipeline
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 18:46:01 -04:00
Aaron
8823c70e5a
chore: rename variants to pretrained for consistency
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 18:45:45 -04:00
Aaron
2a778a6fa6
fix(ci): update detached HEAD to main
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 18:23:50 -04:00
aarnphm-ec2-dev
14d702a34f
fix(dolly-v2): using pipeline for latest implementation
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-06 22:22:03 +00:00
Aaron
f9535d60e7
fix(deps): bound to 23.1
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 17:37:32 -04:00
Aaron
aab8bdba18
infra: bump to dev version of 0.0.26.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 17:37:32 -04:00
Aaron Pham [bot]
f0db182753
infra: prepare for release 0.0.25 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.25
2023-06-06 21:23:30 +00:00
Aaron
f78d55f0fd
fix(cli): type handling for specific container types
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 17:18:25 -04:00
Aaron
b7a6f5cd2a
infra: bump to dev version of 0.0.25.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 17:18:08 -04:00
Aaron Pham [bot]
aa34f0f03b
infra: prepare for release 0.0.24 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.24
2023-06-06 19:59:06 +00:00
Aaron
ed54a0b746
chore: fix type issue on 3.8
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 15:53:02 -04:00
Aaron
b446b65642
chore(cli): remove alias and use build to be consistent with BentoML
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 15:51:13 -04:00
Jian Shen
fcc67824cb
docs: Update README.md
...
Signed-off-by: Jian Shen <jianshen92@gmail.com >
2023-06-06 23:40:38 +08:00
Aaron
f7ba6208c3
infra: bump to dev version of 0.0.24dev0
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 09:18:14 -04:00
Aaron
f5ab01f2dd
infra(release): update logic on push
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 09:16:37 -04:00
Aaron Pham [bot]
80ce543311
infra: prepare for release 0.0.23 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.23
2023-06-06 12:52:21 +00:00
Aaron
44ac29b9dd
infra: update release scripts to run on actions only
...
setup release notes to make sure it runs after pushing tag
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 08:45:51 -04:00
Aaron
a0749d0a80
chore: update version message
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 08:31:40 -04:00
Aaron Pham [bot]
06f501247c
infra: bump to dev version of 0.0.23.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-06 12:22:41 +00:00
Aaron Pham [bot]
a1461fa39b
infra: prepare for release 0.0.22 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.22
2023-06-06 12:22:33 +00:00
Aaron
dd7f1001d1
infra: set base BentoML to 1.0.21
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 08:15:44 -04:00
Jian Shen
f373e0ad4f
docs: chore and add section on integrating model
...
Signed-off-by: Jian Shen <jianshen92@gmail.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 08:05:14 -04:00
Aaron
1707beb7aa
feat(cli): openllm query
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 08:05:13 -04:00
Jian Shen
41a6bd03a6
docs: Readme and Developer Guide
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 08:05:13 -04:00
Aaron
f840222d12
feat(service): add timeout to metadata
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-05 00:46:02 -07:00
Aaron
8ef4c9cb19
fix(types): broken import and add hints for client
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-05 00:10:44 -07:00
Aaron
b7420bf6ce
chore: add assert license headers
...
Current limitation is that it only prints copyright of BentoML team.
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-04 16:35:33 -07:00
Aaron
ec941c95d5
chore: add license header
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-04 16:22:37 -07:00
Aaron Pham [bot]
c5170c9c47
infra: bump to dev version of 0.0.22.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-04 22:46:00 +00:00
Aaron Pham [bot]
c76ff987c3
infra: prepare for release 0.0.21 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
v0.0.21
2023-06-04 22:45:50 +00:00
Aaron
5ebe7e151a
fix: set AUTHOR_NAME and AUTHOR_EMAIL for automatic release
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-04 15:39:32 -07:00