Aaron
9ccbd60584
revert: include configuration to labels
...
This is used for starting up the bento
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-20 23:37:25 -04:00
aarnphm-ec2-dev
f91e750fcd
fix(build): remove configuration from labels
...
labels will only include model_id for it to work with bentocloud
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-21 03:30:59 +00:00
Aaron
347ffaadbe
chore(playground): generate default dir to not set
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-20 21:25:03 -04:00
aarnphm-ec2-dev
f5b1c8ec1b
fix(ft): correct set epochs args for TrainingArguments
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-21 01:20:56 +00:00
Aaron Pham
ee7fa63a50
ci: using tokens for publishing ( #129 )
2023-07-20 21:14:14 -04:00
Aaron
11f88b24ca
infra: bump to dev version of 0.2.3.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-20 21:03:02 -04:00
Aaron Pham
16118dd28f
infra: prepare for release 0.2.2 [generated]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.2
2023-07-21 00:46:36 +00:00
Aaron Pham
f56f8ee782
feat: fine-tuning script for LlaMA 2 ( #128 )
2023-07-20 20:44:51 -04:00
Aaron
c101103d37
infra: bump to dev version of 0.2.2.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-20 18:51:00 -04:00
Aaron Pham
804b30adc4
infra: prepare for release 0.2.1 [generated]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.1
2023-07-20 22:38:27 +00:00
Aaron
5189d2e721
fix(script): correct patch version for __about__.py
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-20 18:35:31 -04:00
aarnphm-ec2-dev
ea07ff6ce9
fix(llama): loose requirements for running llama in container
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-20 22:07:14 +00:00
aarnphm-ec2-dev
c88950655c
fix(ci): make sure to run publish and dev prep correctly
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-20 22:05:05 +00:00
aarnphm-ec2-dev
b31cd0460b
fix: correct tag inference for model-id
...
in the case of build, the model_id is passed as a full valid tag under
bento store
XXX: We will need to fix this later
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-20 21:40:56 +00:00
aarnphm-ec2-dev
3e50f0a851
fix(cli): implement latest bentoml 1.0.25 features
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-20 20:51:27 +00:00
Aaron Pham
858c2007c3
feat: revision parsed via model_id ( #126 )
2023-07-20 14:36:53 -04:00
aarnphm-ec2-dev
a056365d48
fix(ci): always run create coverage
...
this is to stop evergreen to fail on main
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-20 13:01:32 +00:00
Aaron Pham
1b3508619e
feat(llama): add default prompt for LlaMA-2 ( #122 )
2023-07-20 07:46:33 -04:00
aarnphm-ec2-dev
ee3a00514a
infra: bump to dev version of 0.2.1dev0 [generated]
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-20 00:02:45 +00:00
aarnphm-ec2-dev
5f874da4e2
fix: broken logics for upload
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-19 23:55:35 +00:00
Aaron Pham
f9ca164e73
infra: prepare for release 0.2.0 [generated]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.0
2023-07-19 23:43:52 +00:00
aarnphm-ec2-dev
292bca68c7
fix(ci): releas script correctly parse version
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-19 23:41:33 +00:00
aarnphm-ec2-dev
4beb040cfd
chore(ci): remove macos packaging
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-19 23:35:49 +00:00
aarnphm-ec2-dev
e69042a8e9
fix(script): not gpg signing tag
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-19 23:12:04 +00:00
aarnphm-ec2-dev
9d28e0a1e6
fix(ci): disable gpg push
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-19 23:09:27 +00:00
aarnphm-ec2-dev
b747e3b4b8
fix(ci): remove signing
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-19 23:01:53 +00:00
Aaron
d92b136780
chore(llama): remove decapoda vairants
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-19 18:58:04 -04:00
Aaron
b6679be301
fix: release script not to signed
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-19 18:48:01 -04:00
aarnphm-ec2-dev
8b340559aa
fix(tests): skip running models tests on CI
...
The runners don't have enough space to run all tests
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-19 22:40:40 +00:00
aarnphm-ec2-dev
e319a2977f
fix(ci): editable install
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-19 22:29:29 +00:00
Aaron Pham
c1ddb9ed7c
feat: GPTQ + vLLM and LlaMA ( #113 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-19 18:12:12 -04:00
aarnphm-ec2-dev
dbca689c65
chore(stubs): delete invalid stubs [skip ci]
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-18 16:28:04 +00:00
pre-commit-ci[bot]
b297ec1109
ci: pre-commit autoupdate [pre-commit.ci] ( #119 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-17 16:29:29 -04:00
dependabot[bot]
9833d2f46f
fix(ci): correct setup tests and auto-bot ( #118 )
...
Bump pypa/gh-action-pypi-publish from 1.8.7 to 1.8.8
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-17 14:37:46 -04:00
Aaron
674d0c7c10
chore(ci): ignore pdm [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-16 14:26:41 -04:00
aarnphm-ec2-dev
5bb95652db
chore(ci): skip large models
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-16 05:53:53 +00:00
Aaron Pham
fc963c42ce
fix: build isolation ( #116 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-16 01:52:21 -04:00
HeTaoPKU
fd9ae56812
fix(baichuan): add "cpm-kernel" as additional requirements ( #117 )
...
This is to support the 13b variant of baichuan
Co-authored-by: the <tao.he@hulu.com >
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-15 23:16:05 -04:00
HeTaoPKU
09b0787306
feat(models): Baichuan ( #115 )
...
Co-authored-by: the <tao.he@hulu.com >
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-15 22:01:37 -04:00
aarnphm-ec2-dev
d37d14e52b
fix(tests): mark package on CI to xfail
...
XXX: @aarnphm to solve build isolation when have bandwidth. Currently
this is not a problem when running locally.
`openllm build` just works, where as `openllm.build` won't work
sequentially.
Address some type stubs for jupytext
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-15 12:48:28 +00:00
Aaron Pham
b2dba6143f
fix(resource): correctly parse CUDA_VISIBLE_DEVICES ( #114 )
2023-07-15 07:19:35 -04:00
Aaron Pham
b291526248
revert: remove badges
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-11 14:44:36 -04:00
aarnphm-ec2-dev
e2ae24b74c
fix(tests): building not being isolated
...
We will need to fix this from BentoML
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 17:28:00 +00:00
aarnphm-ec2-dev
2950cffd5b
fix(save): set bento store and model store
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 01:30:39 +00:00
aarnphm-ec2-dev
cea082e7bd
fix(cli): correct prune based on metadata
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 00:34:22 +00:00
aarnphm-ec2-dev
c2bb29b4f3
fix: building mpt dependencies
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 00:21:23 +00:00
aarnphm-ec2-dev
61bfd64bd5
chore: decouple logs
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 00:08:47 +00:00
aarnphm-ec2-dev
cb2f030b5e
fix: add bento tag for default --format container
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-10 23:45:41 +00:00
aarnphm-ec2-dev
7824332a01
chore: remove auto workers
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-10 21:51:12 +00:00
Aaron Pham
c7f4dc7bb2
feat(test): snapshot testing ( #107 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-10 17:23:19 -04:00