Aaron
fd48cbdeb2
infra: bump to dev version of 0.0.8.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 20:57:57 -07:00
Aaron
98cd06a686
infra: prepare for release 0.0.7 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
v0.0.7
2023-05-27 20:57:38 -07:00
Aaron
52d65f999f
feat(telemetry): add support for usage tracking
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 20:39:13 -07:00
Aaron
a55817d647
feat(cli): update nicely formatted commands with shared output logics
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 15:51:03 -07:00
Aaron
fa895c329c
feat: pre-commit setup
...
also sync JS release with Python version
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 06:54:22 -07:00
Aaron
775ddab941
fix(generation): correct type of top_k to int instead of float
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 05:23:44 -07:00
Aaron
c595b63736
infra: bump to dev version of 0.0.7.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 05:20:45 -07:00
Aaron
b9e1ca5514
infra: prepare for release 0.0.6 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
v0.0.6
2023-05-27 05:20:27 -07:00
Aaron
cf4e55c36f
fix(client): implement per client framework and model_name getters
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 05:20:07 -07:00
Aaron
aa72c16651
docs: update quickstart and remove unused yarn.lock
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 05:01:50 -07:00
Aaron
4da7aa4911
infra: bump to dev version of 0.0.6.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 04:48:52 -07:00
Aaron
6439ad55fc
infra: prepare for release 0.0.5 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
v0.0.5
2023-05-27 04:48:28 -07:00
Aaron Pham
0a37bac25d
feat(codegen): using black parser ( #5 )
2023-05-27 04:46:54 -07:00
Aaron
d64407150c
feat(script): Added ability to tag release
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 01:26:29 -07:00
Aaron
70bdaf9c05
infra: bump to dev version of 0.0.5.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 01:26:19 -07:00
Aaron
8ab7e87083
infra: prepare for release 0.0.4 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
v0.0.4
2023-05-27 01:21:34 -07:00
Aaron
f7c4142606
fix(infra): make sure release scripts will handle correct release
...
cycle
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 01:19:15 -07:00
Aaron
c73732db6f
fix(configuration): Make sure GenerationInput dumped the correct
...
dictionary for llm_config
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 01:01:32 -07:00
aarnphm-ec2-dev
8ee5b048f3
feat(client): Async and Sync client
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-26 22:51:21 -07:00
aarnphm-ec2-dev
7b5ddc1efa
fix(configuration): make sure we don't override per-request
...
configuration
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-05-27 03:13:40 +00:00
Aaron
b7f3a10910
refactor: migrate __init_subclass__ to Metaclass
...
LLMMetaclass will now responsible for generate internal attributes
add llm_type and identifying_params to Runnable class
subclass of openllm.LLM now can set a class attribute
__openllm_internal__ to let openllm knows that this is an internal class
implementation, instead of providing a _internal in the class
initialization.
support for preprocess_parameters and postprocess_parameters on client
side for better client UX
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-05-27 03:09:45 +00:00
aarnphm-ec2-dev
150ea33d2d
infra: bump to dev version [generated]
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-05-26 23:18:30 +00:00
aarnphm-ec2-dev
26df699a03
feat(tools): add release script
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-05-26 21:29:57 +00:00
Aaron
ee3d17fdfa
infra: prepare for 0.0.3 release
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-26 14:10:06 -07:00
Aaron
85252f13c4
fix(cli): simplify register code for start
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-26 01:45:03 -07:00
aarnphm-ec2-dev
4127961c5c
feat: openllm.client
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-05-26 07:17:28 +00:00
aarnphm-ec2-dev
ac933d60f1
fix(cli): Make sure to skips models that only runs on GPU
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-05-26 00:43:14 +00:00
dependabot[bot]
9cdc3545aa
build(deps): Bump bufbuild/buf-setup-action from 1.17.0 to 1.19.0 ( #2 )
...
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-05-25 16:23:04 -07:00
aarnphm-ec2-dev
fed17fafdc
migrate(configuration): remove deprecated max_length in favor of
...
max_new_tokens
Preparation for transformers 5
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-25 16:22:07 -07:00
aarnphm-ec2-dev
b502703f67
fix(chatglm): make sure to check for required dependencies cpm_kernels
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-25 16:22:07 -07:00
aarnphm-ec2-dev
5c416fa218
feat: StarCoder
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-25 16:22:07 -07:00
aarnphm-ec2-dev
3fe6b14dbf
fix(cli): make sure __main__ is not convoluted
...
CLI should lives under openllm.cli, and the actual click.Group can be
created from create_cli lazily
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-25 16:22:06 -07:00
aarnphm-ec2-dev
765c1a6e5c
feat: requires_gpu for specific LLM.
...
This will determine the behaviour of SUPPORTED_RESOURCES
TODO: Support TPU
supports for requirements for specific LLM
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-25 16:22:06 -07:00
aarnphm-ec2-dev
83a8a7cb4f
docs(codegen): make sure the generated dostring is correct
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-25 16:22:06 -07:00
aarnphm-ec2-dev
20b3a0260f
refactor: move Prompt object to client specific attributes
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-25 16:22:06 -07:00
aarnphm-ec2-dev
545515c01f
infra: Install BentoML from main and its auxilary dependencies
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-25 16:22:06 -07:00
aarnphm-ec2-dev
73d152fc77
feat(gpu): Make sure that we run models on GPU if available
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-24 19:31:22 -07:00
Aaron
135bafacaf
fix(chatglm): support MacOS deployment
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-24 05:06:06 -07:00
aarnphm-ec2-dev
9139360426
fix(coverage): Make sure to exclude the correct TYPE_CHECKING in
...
openllm
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-05-24 11:32:06 +00:00
Aaron
2676085b59
feat: chatglm and configuration naming type
...
by default, it is dasherize, but for cases like chatglm, it can be
lowercase as well
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-24 04:20:24 -07:00
Aaron Pham
11c7783a0e
fix(infra): feature_request.yml missing title
2023-05-23 16:54:38 -07:00
Aaron Pham
427106df98
fix(infra): bug_report.yml missing title
2023-05-23 16:54:09 -07:00
Aaron
162c021cae
feat(timeout): support server_timeout and LLM timeout
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-23 16:48:01 -07:00
Aaron
b1c07946c1
feat: dolly-v2 and general cleanup
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-23 14:27:27 -07:00
Aaron
a63cec8fa3
improve(flan-t5): update default generation config
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-16 12:20:38 -07:00
Aaron
602294b782
fix(start): silence error logs for now
...
respect BENTOML_HOME and BENTOML_DO_NOT_TRACK
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-16 12:19:23 -07:00
Aaron
549b0c54e9
feat: codegen and bundle build
...
fix configuration generation for runnable
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-15 18:22:25 -07:00
Aaron
d31d450526
feat: Adding central service definition and init openllm_client
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-15 00:33:05 -07:00
Aaron
2a53faee9c
infra: add structure and cleanup separation of tokenizer
...
since tokenizer are relatively light, all default LLM will bundle the
tokenizer with itself.
Maybe we can put the tokenizer in its own runner in the future
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-05 11:57:39 -07:00
Aaron
426a61713f
feat: start and start_grpc API
...
with_options listen from environment variable for said models.
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-05 11:07:52 -07:00