Commit Graph

465 Commits

Author SHA1 Message Date
aarnphm-ec2-dev
5f874da4e2 fix: broken logics for upload
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:55:35 +00:00
Aaron Pham
f9ca164e73 infra: prepare for release 0.2.0 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.0
2023-07-19 23:43:52 +00:00
aarnphm-ec2-dev
292bca68c7 fix(ci): releas script correctly parse version
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:41:33 +00:00
aarnphm-ec2-dev
4beb040cfd chore(ci): remove macos packaging
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:35:49 +00:00
aarnphm-ec2-dev
e69042a8e9 fix(script): not gpg signing tag
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:12:04 +00:00
aarnphm-ec2-dev
9d28e0a1e6 fix(ci): disable gpg push
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:09:27 +00:00
aarnphm-ec2-dev
b747e3b4b8 fix(ci): remove signing
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:01:53 +00:00
Aaron
d92b136780 chore(llama): remove decapoda vairants
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-19 18:58:04 -04:00
Aaron
b6679be301 fix: release script not to signed
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-19 18:48:01 -04:00
aarnphm-ec2-dev
8b340559aa fix(tests): skip running models tests on CI
The runners don't have enough space to run all tests

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 22:40:40 +00:00
aarnphm-ec2-dev
e319a2977f fix(ci): editable install
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 22:29:29 +00:00
Aaron Pham
c1ddb9ed7c feat: GPTQ + vLLM and LlaMA (#113)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-19 18:12:12 -04:00
aarnphm-ec2-dev
dbca689c65 chore(stubs): delete invalid stubs [skip ci]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-18 16:28:04 +00:00
pre-commit-ci[bot]
b297ec1109 ci: pre-commit autoupdate [pre-commit.ci] (#119)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-07-17 16:29:29 -04:00
dependabot[bot]
9833d2f46f fix(ci): correct setup tests and auto-bot (#118)
Bump pypa/gh-action-pypi-publish from 1.8.7 to 1.8.8

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-17 14:37:46 -04:00
Aaron
674d0c7c10 chore(ci): ignore pdm [skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-16 14:26:41 -04:00
aarnphm-ec2-dev
5bb95652db chore(ci): skip large models
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-16 05:53:53 +00:00
Aaron Pham
fc963c42ce fix: build isolation (#116)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-16 01:52:21 -04:00
HeTaoPKU
fd9ae56812 fix(baichuan): add "cpm-kernel" as additional requirements (#117)
This is to support the 13b variant of baichuan

Co-authored-by: the <tao.he@hulu.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-15 23:16:05 -04:00
HeTaoPKU
09b0787306 feat(models): Baichuan (#115)
Co-authored-by: the <tao.he@hulu.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-15 22:01:37 -04:00
aarnphm-ec2-dev
d37d14e52b fix(tests): mark package on CI to xfail
XXX: @aarnphm to solve build isolation when have bandwidth. Currently
this is not a problem when running locally.

`openllm build` just works, where as `openllm.build` won't work
sequentially.

Address some type stubs for jupytext

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-15 12:48:28 +00:00
Aaron Pham
b2dba6143f fix(resource): correctly parse CUDA_VISIBLE_DEVICES (#114) 2023-07-15 07:19:35 -04:00
Aaron Pham
b291526248 revert: remove badges
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-07-11 14:44:36 -04:00
aarnphm-ec2-dev
e2ae24b74c fix(tests): building not being isolated
We will need to fix this from BentoML

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-11 17:28:00 +00:00
aarnphm-ec2-dev
2950cffd5b fix(save): set bento store and model store
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-11 01:30:39 +00:00
aarnphm-ec2-dev
cea082e7bd fix(cli): correct prune based on metadata
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-11 00:34:22 +00:00
aarnphm-ec2-dev
c2bb29b4f3 fix: building mpt dependencies
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-11 00:21:23 +00:00
aarnphm-ec2-dev
61bfd64bd5 chore: decouple logs
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-11 00:08:47 +00:00
aarnphm-ec2-dev
cb2f030b5e fix: add bento tag for default --format container
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-10 23:45:41 +00:00
aarnphm-ec2-dev
7824332a01 chore: remove auto workers
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-10 21:51:12 +00:00
Aaron Pham
c7f4dc7bb2 feat(test): snapshot testing (#107)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-10 17:23:19 -04:00
aarnphm-ec2-dev
d3e4b95e84 ci: use trusted pypi publisher
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-07 07:18:13 +00:00
Aaron Pham
fb849a384e feat: GPTNeoX (#106) 2023-07-07 03:05:40 -04:00
aarnphm-ec2-dev
f9af643479 fix(llm): auto import models for first time running
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-06 13:42:03 +00:00
Aaron
ec4293091d ci: wait for auto-bot to run check
only run evergreen on PR

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 10:53:45 -04:00
Aaron
ddb4fdeb87 infra: bump to dev version of 0.1.21.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 10:53:44 -04:00
Aaron
4e35172f05 infra: prepare for release 0.1.20 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
v0.1.20
2023-07-05 09:26:27 -04:00
Aaron
0892d10436 ci: increase postBuffer for release action
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 09:08:13 -04:00
Aaron
183019616b fix: remove editable install
Setup via hatch already

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 08:50:06 -04:00
Aaron Pham
9f6b254086 qa: improvements and agents log (#105) 2023-07-05 08:39:31 -04:00
Aaron
44a225d2fd fix(mpt): trust remote code
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 06:26:29 -04:00
Aaron
775c8c15a5 fix(tests): make sure the model is available on runner
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-04 15:11:58 -04:00
Aaron
f2f9b0a944 fix(bettertransformer): correctly set boolean
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-04 14:38:21 -04:00
aarnphm-ec2-dev
4c5b27495c fix: bettertransformer check to bool already
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-04 17:58:02 +00:00
Aaron Pham
d6303d306a perf: fixing import custom paths and cleanup serialisation (#102) 2023-07-04 12:49:14 -04:00
Aaron Pham
0e45ff7bb7 feat: add citation (#103) 2023-07-03 18:44:43 -04:00
Aaron Pham
8ac2755de4 feat(llm): fine-tuning Falcon (#98)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-30 21:25:16 -04:00
Aaron Pham
59b1d89971 feat: custom dockerfile templates (#95)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-30 13:44:11 -04:00
aarnphm-ec2-dev
e81203884b fix(nightly-requirements): missing new lines [skip ci]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-29 16:23:46 +00:00
aarnphm-ec2-dev
d3633a9430 chore(ci): update correct submodules for compiling triton [skip ci]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-29 16:22:09 +00:00