Commit Graph

97 Commits

Author SHA1 Message Date
Aaron
c33a90a0cc chore: add annotations for attrs and eval correct annotation type
eval will be here once I find a different way to parse types into
python

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-31 17:11:20 -07:00
Aaron
e910b6d3bd infra: bump to dev version of 0.0.15.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-31 14:12:57 -07:00
Aaron
feb7265c80 infra: prepare for release 0.0.14 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
v0.0.14
2023-05-31 14:12:37 -07:00
Aaron
c2d620c772 fix(load): Make sure we safely load environment variables
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-31 14:12:19 -07:00
Aaron
4e2d5e330c refactor(cli): move CLI to address anti-pattern
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-31 13:53:40 -07:00
Aaron
a72f4d7117 infra: bump to dev version of 0.0.14.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-30 14:56:59 -07:00
Aaron
169a1ca9eb infra: prepare for release 0.0.13 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
v0.0.13
2023-05-30 14:56:41 -07:00
Aaron
33e7004e66 format: consistent CLI outputs
vendorred type-related module from bentoml._internal.types

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-30 14:56:11 -07:00
Aaron
5b1b7d6ab8 chore(llm): expose langchain API to the runner
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-30 14:37:00 -07:00
Aaron
36ba176bd5 infra: bump to dev version of 0.0.13.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-30 12:02:36 -07:00
Aaron
21eb852242 infra: prepare for release 0.0.12 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
v0.0.12
2023-05-30 12:02:18 -07:00
Aaron
fa16c67131 fix(cli): remove debug print
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-30 12:02:00 -07:00
Aaron
e836218053 infra: bump to dev version of 0.0.12.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-30 12:00:16 -07:00
Aaron
cf1d3b4d55 infra: prepare for release 0.0.11 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
v0.0.11
2023-05-30 11:59:55 -07:00
Aaron Pham
01517e37c6 migration: attrs (#7)
Move configuration to attrs

Depends on https://github.com/bentoml/BentoML/pull/3906
2023-05-30 11:59:21 -07:00
Aaron
3e69c49ff0 infra: bump to dev version of 0.0.11.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 19:15:41 -07:00
Aaron
ad5d126d95 infra: prepare for release 0.0.10 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
v0.0.10
2023-05-28 19:15:15 -07:00
Aaron
d5f213e49c compat: provide shim for pydantic 1 and 2
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 19:14:51 -07:00
Aaron
aff23584a5 docs: jot down some thoughts for API design
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 17:04:15 -07:00
Aaron
41706eee5b feat(save_model): passing tag
update with upstream bentoml

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 17:04:15 -07:00
aarnphm-ec2-dev
f68c764f9b chore: disable bettertransformer when bitsandbytes is used
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-05-28 20:09:57 +00:00
aarnphm-ec2-dev
b4b65fc609 chore(stablelm): tuning half when 8bit is not loaded
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-05-28 19:55:26 +00:00
Aaron Pham
ceb9f7bded feat: BetterTransformer during inference (#6) 2023-05-28 11:56:49 -07:00
Aaron
ac710dfd54 revert(perf): remove group alias
There is no need for this feature

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 10:04:33 -07:00
Aaron
454b1edc32 infra: bump to dev version of 0.0.10.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 10:00:34 -07:00
Aaron
ef0ddebd60 infra: prepare for release 0.0.9 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
v0.0.9
2023-05-28 10:00:08 -07:00
Aaron
435129372e perf(cli): lazily cached start commands
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 09:59:43 -07:00
aarnphm-ec2-dev
9c1c4ca0bf perf(cli): using click instead of rich console
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 09:32:56 -07:00
aarnphm-ec2-dev
3f36d81744 infra: docs and normalize formatting
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-05-28 15:00:17 +00:00
aarnphm-ec2-dev
8ca488d8fc fix(stablelm): Ensure passing EOS_TOKEN_ID for generation
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-05-28 14:43:00 +00:00
Aaron
b4403c24b0 fix(model): Make sure we download the model before starting the
service

This will ensure we don't deadlock among processes

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-05-28 14:01:49 +00:00
Aaron
8e8d95cae9 feat: StableLM tuned and alpha
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 06:36:06 -07:00
Aaron
99d8e450c2 infra: cleanup unused bazel rules
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 06:09:32 -07:00
Aaron
78358dbb8d fix(type): configuration and dependencies
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 06:01:30 -07:00
Aaron
0df8d8b9a6 perf: reduce unecessary object creation for config class
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 05:22:22 -07:00
Aaron
3fb1e5338a feat(dependencies): add optional for model
pretty print failed models loading due to missing dependencies

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-28 00:11:36 -07:00
Aaron
86b35105c1 deps: add beginning requirements for fine-tune
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 23:55:16 -07:00
Aaron
89a714b8c2 infra: bump to dev version of 0.0.9.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 23:42:56 -07:00
Aaron
ffc441a227 infra: prepare for release 0.0.8 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
v0.0.8
2023-05-27 23:42:39 -07:00
Aaron
6aca092bfc infra: prepare for 0.0.8 release
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 23:42:06 -07:00
aarnphm-ec2-dev
1e254a4cf3 feat: FalconLM
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-05-28 06:39:37 +00:00
Aaron
c84f653b77 feat(cli): add output for build and bundle
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 22:37:10 -07:00
Aaron
f24f13e6e4 chore(cli): consistency between table and json format
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 22:29:54 -07:00
Aaron
186658be63 docs: delay GPU to model check,
allow users to package and interact with models that requires GPU even on device without GPU

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 22:24:18 -07:00
Aaron
e0fc37e47f fix(docs): update docs about saving custom fine-tuned
and update annotations for client

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 21:15:44 -07:00
Aaron
fd48cbdeb2 infra: bump to dev version of 0.0.8.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 20:57:57 -07:00
Aaron
98cd06a686 infra: prepare for release 0.0.7 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
v0.0.7
2023-05-27 20:57:38 -07:00
Aaron
52d65f999f feat(telemetry): add support for usage tracking
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 20:39:13 -07:00
Aaron
a55817d647 feat(cli): update nicely formatted commands with shared output logics
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 15:51:03 -07:00
Aaron
fa895c329c feat: pre-commit setup
also sync JS release with Python version

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-05-27 06:54:22 -07:00