Commit Graph

189 Commits

Author SHA1 Message Date
Aaron
c0418b76ec feat(infra): add tools for managing optional-dependencies
based on llm config

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-08 08:57:19 -04:00
Aaron
23d98a2729 feat(tooling): add script to auto update readme table of supported
models

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-08 08:22:55 -04:00
Aaron
0680059a21 chore(ci): cleanup workflow
make it a pipeline for release now

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-08 05:28:56 -04:00
Aaron
5ecbc0017f infra: bump to dev version of 0.0.28.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-08 05:28:44 -04:00
Aaron Pham [bot]
4c86f661ec infra: prepare for release 0.0.27 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.0.27
2023-06-08 09:11:30 +00:00
aarnphm-ec2-dev
e9e12a66a8 fix(falcon): custom load
This has to do with pipeline load is pretty magical and broken
on transformers

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-08 09:03:34 +00:00
Aaron
378b209d67 feat(llm): custom load_model
This has to with loading models that requires more attention
than the default bentoml.transformers.load_model

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-08 04:07:07 -04:00
Aaron
4369395520 chore(docs): running formatter
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-08 03:14:14 -04:00
aarnphm-ec2-dev
5060f22600 fix(stablelm): disable running on 8bit
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-08 06:40:18 +00:00
aarnphm-ec2-dev
e276b948f0 chore(stablelm): normalize keys name
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-08 06:29:11 +00:00
Aaron Pham
33d0af82a7 chore(readme): align badges to middle
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-06-07 18:33:27 -04:00
Jian Shen
66104a017f docs: Update Badge
Signed-off-by: Jian Shen <jianshen92@gmail.com>
2023-06-08 02:06:09 +08:00
Jian Shen
f4c0ef6d0c docs: Add Badges
Signed-off-by: Jian Shen <jianshen92@gmail.com>
2023-06-08 01:58:20 +08:00
Jian Shen
de273a7dd2 doc: Update Readme with Integrations Section 2023-06-08 01:36:05 +08:00
Aaron
f2771bfe49 chore(cli): move back --version
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-07 03:41:50 -04:00
aarnphm-ec2-dev
afb5c7bead chore(deps): lowerbound tabulate to 0.9.0
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-07 06:57:59 +00:00
aarnphm-ec2-dev
b794f75744 fix: move accelerate to fine-tune
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-07 06:51:24 +00:00
aarnphm-ec2-dev
170be0ebc8 fix(cli): make sure make_tag to respect config trust_remote_code
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-07 04:35:15 +00:00
Aaron
ce7143060e chore(ci): to run release note on all tag
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-07 00:24:25 -04:00
aarnphm-ec2-dev
c960b3edff feat(client): add postprocess for processing client output call
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-07 00:24:20 -04:00
Aaron
5ed71e7121 infra: bump to dev version of 0.0.27.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-07 00:24:20 -04:00
Aaron Pham [bot]
3121576de6 infra: prepare for release 0.0.26 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.0.26
2023-06-07 03:33:24 +00:00
Aaron
d6d2de6748 feat(cli): prune
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 23:24:50 -04:00
Aaron
aa50b5279e fix(falcon): loading based on model registration
remove duplicate events

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 22:42:28 -04:00
Aaron
ffac6d8916 ui: improve usability with config
set load_in_mha to false by default

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 19:29:21 -04:00
Aaron
45f022eea3 fix(dolly): no need to pass do_sample to pipeline
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 18:46:01 -04:00
Aaron
8823c70e5a chore: rename variants to pretrained for consistency
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 18:45:45 -04:00
Aaron
2a778a6fa6 fix(ci): update detached HEAD to main
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 18:23:50 -04:00
aarnphm-ec2-dev
14d702a34f fix(dolly-v2): using pipeline for latest implementation
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-06 22:22:03 +00:00
Aaron
f9535d60e7 fix(deps): bound to 23.1
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 17:37:32 -04:00
Aaron
aab8bdba18 infra: bump to dev version of 0.0.26.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 17:37:32 -04:00
Aaron Pham [bot]
f0db182753 infra: prepare for release 0.0.25 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.0.25
2023-06-06 21:23:30 +00:00
Aaron
f78d55f0fd fix(cli): type handling for specific container types
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 17:18:25 -04:00
Aaron
b7a6f5cd2a infra: bump to dev version of 0.0.25.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 17:18:08 -04:00
Aaron Pham [bot]
aa34f0f03b infra: prepare for release 0.0.24 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.0.24
2023-06-06 19:59:06 +00:00
Aaron
ed54a0b746 chore: fix type issue on 3.8
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 15:53:02 -04:00
Aaron
b446b65642 chore(cli): remove alias and use build to be consistent with BentoML
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 15:51:13 -04:00
Jian Shen
fcc67824cb docs: Update README.md
Signed-off-by: Jian Shen <jianshen92@gmail.com>
2023-06-06 23:40:38 +08:00
Aaron
f7ba6208c3 infra: bump to dev version of 0.0.24dev0
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 09:18:14 -04:00
Aaron
f5ab01f2dd infra(release): update logic on push
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 09:16:37 -04:00
Aaron Pham [bot]
80ce543311 infra: prepare for release 0.0.23 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.0.23
2023-06-06 12:52:21 +00:00
Aaron
44ac29b9dd infra: update release scripts to run on actions only
setup release notes to make sure it runs after pushing tag

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 08:45:51 -04:00
Aaron
a0749d0a80 chore: update version message
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 08:31:40 -04:00
Aaron Pham [bot]
06f501247c infra: bump to dev version of 0.0.23.dev0 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
2023-06-06 12:22:41 +00:00
Aaron Pham [bot]
a1461fa39b infra: prepare for release 0.0.22 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.0.22
2023-06-06 12:22:33 +00:00
Aaron
dd7f1001d1 infra: set base BentoML to 1.0.21
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 08:15:44 -04:00
Jian Shen
f373e0ad4f docs: chore and add section on integrating model
Signed-off-by: Jian Shen <jianshen92@gmail.com>
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 08:05:14 -04:00
Aaron
1707beb7aa feat(cli): openllm query
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 08:05:13 -04:00
Jian Shen
41a6bd03a6 docs: Readme and Developer Guide
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-06 08:05:13 -04:00
Aaron
f840222d12 feat(service): add timeout to metadata
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-05 00:46:02 -07:00