pre-commit-ci[bot]
c2ed1d56da
chore(release): update base container restriction ( #173 )
...
Prepare for 0.2.12 release
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 15:25:17 -04:00
Aaron Pham
ef94c6b98a
feat(container): vLLM build and base image strategies ( #142 )
2023-07-31 02:44:52 -04:00
Aaron Pham
693631958a
feat(service): provisional API ( #133 )
2023-07-23 02:15:39 -04:00
Aaron Pham
c1ddb9ed7c
feat: GPTQ + vLLM and LlaMA ( #113 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-19 18:12:12 -04:00
aarnphm-ec2-dev
dbca689c65
chore(stubs): delete invalid stubs [skip ci]
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-18 16:28:04 +00:00
dependabot[bot]
9833d2f46f
fix(ci): correct setup tests and auto-bot ( #118 )
...
Bump pypa/gh-action-pypi-publish from 1.8.7 to 1.8.8
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-17 14:37:46 -04:00
Aaron Pham
fc963c42ce
fix: build isolation ( #116 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-16 01:52:21 -04:00
aarnphm-ec2-dev
d37d14e52b
fix(tests): mark package on CI to xfail
...
XXX: @aarnphm to solve build isolation when have bandwidth. Currently
this is not a problem when running locally.
`openllm build` just works, where as `openllm.build` won't work
sequentially.
Address some type stubs for jupytext
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-15 12:48:28 +00:00
Aaron Pham
b2dba6143f
fix(resource): correctly parse CUDA_VISIBLE_DEVICES ( #114 )
2023-07-15 07:19:35 -04:00
Aaron Pham
c7f4dc7bb2
feat(test): snapshot testing ( #107 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-10 17:23:19 -04:00
Aaron Pham
d6303d306a
perf: fixing import custom paths and cleanup serialisation ( #102 )
2023-07-04 12:49:14 -04:00
Aaron Pham
e52045eda6
fix: running MPT on CPU ( #92 )
2023-06-29 10:54:12 -04:00
Aaron Pham
d544764386
feat: cascading resource strategies ( #72 )
2023-06-26 17:38:49 -04:00
pre-commit-ci[bot]
5a0d324134
ci: pre-commit autoupdate ( #74 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-26 17:21:44 -04:00
Aaron Pham
dfca956fad
feat: serve adapter layers ( #52 )
2023-06-23 10:07:15 -04:00
Aaron
1ed0ae7787
fix(log): make sure to configure OpenLLM logs correctly
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-19 06:19:06 -04:00
Aaron Pham
03758a5487
fix(tools): adhere to style guidelines ( #31 )
2023-06-18 20:03:17 -04:00
Aaron Pham
f8ebb36e15
tests: fastpath ( #17 )
...
added fastpath cases for configuration and Flan-T5
fixes respecting model_id into lifecycle hooks.
update CLI to cleanup models info
2023-06-12 14:18:26 -04:00
Aaron
f8e99dd8f5
chore(configuration): clean house implementation
...
Using Attrs implementation
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-11 18:46:15 -04:00
aarnphm-ec2-dev
2e453fb005
refactor(configuration): __config__ and perf
...
move model_ids and default_id to config class declaration,
cleanup dependencies between config and LLM implementation
lazy load module during LLM creation to llm_post_init
fix post_init hooks to run load_in_mha.
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-11 12:53:15 +00:00
Aaron
d70530cb0e
chore: add stubs for deepmerge [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-10 03:04:56 -04:00
Aaron
49cb02d2f2
perf(cli): improve printing speed that respect terminal_size
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-02 06:58:11 -07:00
Aaron
873edc4121
chore(types): improve annotaiton for specified CLI
...
improve: Faster CLI improvement, cached hints to __openllm_hints__
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-02 01:01:00 -07:00
aarnphm-ec2-dev
a94294bc65
fix: generate attrs class internally to conform with interface
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-01 19:06:06 +00:00
Aaron
c33a90a0cc
chore: add annotations for attrs and eval correct annotation type
...
eval will be here once I find a different way to parse types into
python
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-31 17:11:20 -07:00
Aaron
fa895c329c
feat: pre-commit setup
...
also sync JS release with Python version
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-27 06:54:22 -07:00
Aaron
549b0c54e9
feat: codegen and bundle build
...
fix configuration generation for runnable
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-15 18:22:25 -07:00
Aaron
d31d450526
feat: Adding central service definition and init openllm_client
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-05-15 00:33:05 -07:00