Aaron Pham
fc963c42ce
fix: build isolation ( #116 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-16 01:52:21 -04:00
HeTaoPKU
fd9ae56812
fix(baichuan): add "cpm-kernel" as additional requirements ( #117 )
...
This is to support the 13b variant of baichuan
Co-authored-by: the <tao.he@hulu.com >
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-15 23:16:05 -04:00
HeTaoPKU
09b0787306
feat(models): Baichuan ( #115 )
...
Co-authored-by: the <tao.he@hulu.com >
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-15 22:01:37 -04:00
aarnphm-ec2-dev
d37d14e52b
fix(tests): mark package on CI to xfail
...
XXX: @aarnphm to solve build isolation when have bandwidth. Currently
this is not a problem when running locally.
`openllm build` just works, where as `openllm.build` won't work
sequentially.
Address some type stubs for jupytext
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-15 12:48:28 +00:00
Aaron Pham
b2dba6143f
fix(resource): correctly parse CUDA_VISIBLE_DEVICES ( #114 )
2023-07-15 07:19:35 -04:00
Aaron Pham
b291526248
revert: remove badges
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-11 14:44:36 -04:00
aarnphm-ec2-dev
e2ae24b74c
fix(tests): building not being isolated
...
We will need to fix this from BentoML
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 17:28:00 +00:00
aarnphm-ec2-dev
2950cffd5b
fix(save): set bento store and model store
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 01:30:39 +00:00
aarnphm-ec2-dev
cea082e7bd
fix(cli): correct prune based on metadata
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 00:34:22 +00:00
aarnphm-ec2-dev
c2bb29b4f3
fix: building mpt dependencies
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 00:21:23 +00:00
aarnphm-ec2-dev
61bfd64bd5
chore: decouple logs
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-11 00:08:47 +00:00
aarnphm-ec2-dev
cb2f030b5e
fix: add bento tag for default --format container
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-10 23:45:41 +00:00
aarnphm-ec2-dev
7824332a01
chore: remove auto workers
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-10 21:51:12 +00:00
Aaron Pham
c7f4dc7bb2
feat(test): snapshot testing ( #107 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-10 17:23:19 -04:00
aarnphm-ec2-dev
d3e4b95e84
ci: use trusted pypi publisher
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-07 07:18:13 +00:00
Aaron Pham
fb849a384e
feat: GPTNeoX ( #106 )
2023-07-07 03:05:40 -04:00
aarnphm-ec2-dev
f9af643479
fix(llm): auto import models for first time running
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-06 13:42:03 +00:00
Aaron
ec4293091d
ci: wait for auto-bot to run check
...
only run evergreen on PR
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-05 10:53:45 -04:00
Aaron
ddb4fdeb87
infra: bump to dev version of 0.1.21.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-05 10:53:44 -04:00
Aaron
4e35172f05
infra: prepare for release 0.1.20 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
v0.1.20
2023-07-05 09:26:27 -04:00
Aaron
0892d10436
ci: increase postBuffer for release action
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-05 09:08:13 -04:00
Aaron
183019616b
fix: remove editable install
...
Setup via hatch already
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-05 08:50:06 -04:00
Aaron Pham
9f6b254086
qa: improvements and agents log ( #105 )
2023-07-05 08:39:31 -04:00
Aaron
44a225d2fd
fix(mpt): trust remote code
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-05 06:26:29 -04:00
Aaron
775c8c15a5
fix(tests): make sure the model is available on runner
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-04 15:11:58 -04:00
Aaron
f2f9b0a944
fix(bettertransformer): correctly set boolean
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-04 14:38:21 -04:00
aarnphm-ec2-dev
4c5b27495c
fix: bettertransformer check to bool already
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-04 17:58:02 +00:00
Aaron Pham
d6303d306a
perf: fixing import custom paths and cleanup serialisation ( #102 )
2023-07-04 12:49:14 -04:00
Aaron Pham
0e45ff7bb7
feat: add citation ( #103 )
2023-07-03 18:44:43 -04:00
Aaron Pham
8ac2755de4
feat(llm): fine-tuning Falcon ( #98 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-30 21:25:16 -04:00
Aaron Pham
59b1d89971
feat: custom dockerfile templates ( #95 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-30 13:44:11 -04:00
aarnphm-ec2-dev
e81203884b
fix(nightly-requirements): missing new lines [skip ci]
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-29 16:23:46 +00:00
aarnphm-ec2-dev
d3633a9430
chore(ci): update correct submodules for compiling triton [skip ci]
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-29 16:22:09 +00:00
Aaron Pham
f2457fcdaf
tests: add sanity check for openllm.client ( #93 )
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 12:07:00 -04:00
Aaron
e922bcb5b3
revert: regression
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 11:43:27 -04:00
Aaron
7e8ca79c2d
chore: style [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 10:54:58 -04:00
Aaron Pham
e52045eda6
fix: running MPT on CPU ( #92 )
2023-06-29 10:54:12 -04:00
Aaron Pham
3a66b1f5ba
docs: add notes about fine-tuning
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-06-29 10:20:45 -04:00
Aaron
c5977c8fdd
chore(ci): add checkout version for 3.11 and sign push
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 02:55:33 -04:00
Aaron
c76dea468e
chore(ci): fixes release script to correct version
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 02:38:44 -04:00
Aaron
179b570181
infra: bump to dev version of 0.1.20.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 02:38:35 -04:00
Aaron Pham
07cc170866
infra: prepare for release 0.1.19 [generated]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.1.19
2023-06-29 04:48:02 +00:00
Aaron
03bc04216b
fix(ci): fetch correct tags for branch
...
This reverts commit 47e49186e1d2b2bcca0153db1cd47efecbbbf82b.
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 00:41:05 -04:00
Aaron
7de2298f21
revert: "fix(ci): release and correct fetch tags in workflow_call"
...
This reverts commit 583e11196efb318e95a459a46d630d3d92ae42bf.
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 00:41:05 -04:00
Aaron
a411509ee2
fix(ci): release and correct fetch tags in workflow_call
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 00:41:05 -04:00
Aaron
5603e2cccc
infra: bump to dev version of 0.1.19.dev0 [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-29 00:41:04 -04:00
Aaron Pham
0297be6e01
infra: prepare for release 0.1.18 [generated]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.1.18
2023-06-29 03:20:28 +00:00
Aaron Pham
01db504e7d
feat: MPT ( #91 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-28 23:12:15 -04:00
Aaron Pham
5a4df53490
fix(load): tokenizer and adapter within a BentoLLM ( #88 )
2023-06-28 15:45:25 -04:00
Aaron Pham
bd4cc9b3ff
fix: loading local ( #87 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-28 11:25:54 -04:00