aarnphm-ec2-dev
f84b975a55
fix(llm): build to include openllm_client
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 19:20:43 +00:00
aarnphm-ec2-dev
15cb13839d
fix(load_model): make sure to use correct implementation
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 19:01:01 +00:00
Aaron
a84661142c
chore(cli): remove --local for query
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 14:53:11 -04:00
Aaron
7a162402a1
fix(llm): make sure to use correct load_model
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 14:50:58 -04:00
Aaron
20416ab107
infra: bump to dev version of 0.0.29.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 14:40:29 -04:00
Aaron Pham [bot]
f6d6b08369
infra: prepare for release 0.0.28 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-08 13:25:55 +00:00
Aaron
400445da6f
fix(deps): broken name for bitsandbytes
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 09:19:05 -04:00
Aaron
c0418b76ec
feat(infra): add tools for managing optional-dependencies
...
based on llm config
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 08:57:19 -04:00
Aaron
23d98a2729
feat(tooling): add script to auto update readme table of supported
...
models
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 08:22:55 -04:00
Aaron
5ecbc0017f
infra: bump to dev version of 0.0.28.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 05:28:44 -04:00
Aaron Pham [bot]
4c86f661ec
infra: prepare for release 0.0.27 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-08 09:11:30 +00:00
aarnphm-ec2-dev
e9e12a66a8
fix(falcon): custom load
...
This has to do with pipeline load is pretty magical and broken
on transformers
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 09:03:34 +00:00
Aaron
378b209d67
feat(llm): custom load_model
...
This has to with loading models that requires more attention
than the default bentoml.transformers.load_model
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-08 04:07:07 -04:00
aarnphm-ec2-dev
5060f22600
fix(stablelm): disable running on 8bit
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 06:40:18 +00:00
aarnphm-ec2-dev
e276b948f0
chore(stablelm): normalize keys name
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-08 06:29:11 +00:00
Aaron
f2771bfe49
chore(cli): move back --version
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-07 03:41:50 -04:00
aarnphm-ec2-dev
b794f75744
fix: move accelerate to fine-tune
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-07 06:51:24 +00:00
aarnphm-ec2-dev
170be0ebc8
fix(cli): make sure make_tag to respect config trust_remote_code
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-07 04:35:15 +00:00
aarnphm-ec2-dev
c960b3edff
feat(client): add postprocess for processing client output call
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-07 00:24:20 -04:00
Aaron
5ed71e7121
infra: bump to dev version of 0.0.27.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-07 00:24:20 -04:00
Aaron Pham [bot]
3121576de6
infra: prepare for release 0.0.26 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-07 03:33:24 +00:00
Aaron
d6d2de6748
feat(cli): prune
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 23:24:50 -04:00
Aaron
aa50b5279e
fix(falcon): loading based on model registration
...
remove duplicate events
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 22:42:28 -04:00
Aaron
ffac6d8916
ui: improve usability with config
...
set load_in_mha to false by default
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 19:29:21 -04:00
Aaron
45f022eea3
fix(dolly): no need to pass do_sample to pipeline
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 18:46:01 -04:00
Aaron
8823c70e5a
chore: rename variants to pretrained for consistency
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 18:45:45 -04:00
aarnphm-ec2-dev
14d702a34f
fix(dolly-v2): using pipeline for latest implementation
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-06 22:22:03 +00:00
Aaron
aab8bdba18
infra: bump to dev version of 0.0.26.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 17:37:32 -04:00
Aaron Pham [bot]
f0db182753
infra: prepare for release 0.0.25 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-06 21:23:30 +00:00
Aaron
f78d55f0fd
fix(cli): type handling for specific container types
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 17:18:25 -04:00
Aaron
b7a6f5cd2a
infra: bump to dev version of 0.0.25.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 17:18:08 -04:00
Aaron Pham [bot]
aa34f0f03b
infra: prepare for release 0.0.24 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-06 19:59:06 +00:00
Aaron
ed54a0b746
chore: fix type issue on 3.8
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 15:53:02 -04:00
Aaron
b446b65642
chore(cli): remove alias and use build to be consistent with BentoML
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 15:51:13 -04:00
Aaron
f7ba6208c3
infra: bump to dev version of 0.0.24dev0
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 09:18:14 -04:00
Aaron Pham [bot]
80ce543311
infra: prepare for release 0.0.23 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-06 12:52:21 +00:00
Aaron
a0749d0a80
chore: update version message
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 08:31:40 -04:00
Aaron Pham [bot]
06f501247c
infra: bump to dev version of 0.0.23.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-06 12:22:41 +00:00
Aaron Pham [bot]
a1461fa39b
infra: prepare for release 0.0.22 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-06 12:22:33 +00:00
Aaron
1707beb7aa
feat(cli): openllm query
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-06 08:05:13 -04:00
Aaron
f840222d12
feat(service): add timeout to metadata
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-05 00:46:02 -07:00
Aaron
8ef4c9cb19
fix(types): broken import and add hints for client
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-05 00:10:44 -07:00
Aaron
ec941c95d5
chore: add license header
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-04 16:22:37 -07:00
Aaron Pham [bot]
c5170c9c47
infra: bump to dev version of 0.0.22.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-04 22:46:00 +00:00
Aaron Pham [bot]
c76ff987c3
infra: prepare for release 0.0.21 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-04 22:45:50 +00:00
Aaron
52e5f91bd1
infra: bump to dev version of 0.0.21.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-04 15:39:25 -07:00
Aaron Pham [bot]
414341d8d0
infra: prepare for release 0.0.20 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-04 22:31:13 +00:00
Aaron Pham [bot]
01e846ad95
infra: bump to dev version of 0.0.20.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-04 21:33:19 +00:00
Aaron Pham [bot]
7682ea928b
infra: prepare for release 0.0.19 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-04 21:33:11 +00:00
Aaron
e66340aaba
chore: using typing.get_origin
...
since we only support 3.8+, get_origin and get_args are available
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-04 14:24:08 -07:00