Commit Graph

316 Commits

Author SHA1 Message Date
aarnphm-ec2-dev
c2bb29b4f3 fix: building mpt dependencies
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-11 00:21:23 +00:00
aarnphm-ec2-dev
61bfd64bd5 chore: decouple logs
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-11 00:08:47 +00:00
aarnphm-ec2-dev
cb2f030b5e fix: add bento tag for default --format container
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-10 23:45:41 +00:00
Aaron Pham
c7f4dc7bb2 feat(test): snapshot testing (#107)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-10 17:23:19 -04:00
Aaron Pham
fb849a384e feat: GPTNeoX (#106) 2023-07-07 03:05:40 -04:00
aarnphm-ec2-dev
f9af643479 fix(llm): auto import models for first time running
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-06 13:42:03 +00:00
Aaron
ddb4fdeb87 infra: bump to dev version of 0.1.21.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 10:53:44 -04:00
Aaron
4e35172f05 infra: prepare for release 0.1.20 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 09:26:27 -04:00
Aaron Pham
9f6b254086 qa: improvements and agents log (#105) 2023-07-05 08:39:31 -04:00
Aaron
44a225d2fd fix(mpt): trust remote code
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 06:26:29 -04:00
Aaron
f2f9b0a944 fix(bettertransformer): correctly set boolean
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-04 14:38:21 -04:00
aarnphm-ec2-dev
4c5b27495c fix: bettertransformer check to bool already
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-04 17:58:02 +00:00
Aaron Pham
d6303d306a perf: fixing import custom paths and cleanup serialisation (#102) 2023-07-04 12:49:14 -04:00
Aaron Pham
8ac2755de4 feat(llm): fine-tuning Falcon (#98)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-30 21:25:16 -04:00
Aaron Pham
59b1d89971 feat: custom dockerfile templates (#95)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-30 13:44:11 -04:00
Aaron
e922bcb5b3 revert: regression
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-29 11:43:27 -04:00
Aaron Pham
e52045eda6 fix: running MPT on CPU (#92) 2023-06-29 10:54:12 -04:00
Aaron
179b570181 infra: bump to dev version of 0.1.20.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-29 02:38:35 -04:00
Aaron Pham
07cc170866 infra: prepare for release 0.1.19 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-06-29 04:48:02 +00:00
Aaron
5603e2cccc infra: bump to dev version of 0.1.19.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-29 00:41:04 -04:00
Aaron Pham
0297be6e01 infra: prepare for release 0.1.18 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-06-29 03:20:28 +00:00
Aaron Pham
01db504e7d feat: MPT (#91)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-28 23:12:15 -04:00
Aaron Pham
5a4df53490 fix(load): tokenizer and adapter within a BentoLLM (#88) 2023-06-28 15:45:25 -04:00
Aaron Pham
bd4cc9b3ff fix: loading local (#87)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-28 11:25:54 -04:00
Aaron Pham
c0a9aa4314 feat: dict() protocol (#85)
* feat: dict() protocol

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>

* ci: auto fixes from pre-commit.ci
For more information, see https://pre-commit.ci

---------

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-28 04:00:29 +00:00
Aaron
868046cae5 infra: bump to dev version of 0.1.18.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-27 14:07:42 -04:00
Aaron Pham
8dd8acbb32 infra: prepare for release 0.1.17 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-06-27 17:48:07 +00:00
Aaron Pham
db1494a6ae feat(start): starting bento and fix load (#80) 2023-06-27 12:45:17 -04:00
aarnphm-ec2-dev
5d4b8ab449 fix(models): save from local path
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-27 03:35:32 +00:00
aarnphm-ec2-dev
861af508ec infra: bump to dev version of 0.1.17.dev0 [generated]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-27 01:46:16 +00:00
Aaron Pham
ee7a754cfd infra: prepare for release 0.1.16 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-06-27 00:45:05 +00:00
Aaron
1c892826cc fix(chatglm): remove do_sample
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-27 00:07:55 +00:00
aarnphm-ec2-dev
828fec3575 infra: bump to dev version of 0.1.16.dev0 [generated]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-27 00:07:54 +00:00
Aaron Pham
4a4d91a8d0 infra: prepare for release 0.1.15 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-06-26 22:40:45 +00:00
aarnphm-ec2-dev
fc103be6e8 fix(dolly): remove loading as pipeline
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-26 22:05:40 +00:00
aarnphm-ec2-dev
3bad48bfa2 chore(cli): add aliases for start with start-http
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-26 21:54:02 +00:00
Aaron Pham
d544764386 feat: cascading resource strategies (#72) 2023-06-26 17:38:49 -04:00
aarnphm-ec2-dev
e12505f70d fix: CLi when no models is available
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-26 10:08:19 +00:00
Aaron
c1556642ae infra: bump to dev version of 0.1.15.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-25 14:00:14 -04:00
Aaron Pham
e733fd55dc infra: prepare for release 0.1.14 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-06-25 17:55:43 +00:00
Aaron Pham
74fdd5e259 feat: release binary distribution (#66) 2023-06-25 10:38:03 -04:00
Aaron Pham
acb6a3cb32 fix: converting envvar to string (#68) 2023-06-25 03:40:45 -04:00
Aaron Pham
3593c764f0 fix(test): robustness (#64) 2023-06-24 11:10:07 -04:00
Aaron Pham
98328be394 peft(models): improve implementation (#60)
If you have a local Dolly-V2 version, please do `openllm prune`
2023-06-24 05:22:18 -04:00
Aaron Pham [bot]
3d9cc935d4 infra: bump to dev version of 0.1.14.dev0 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
2023-06-24 07:33:03 +00:00
Aaron Pham [bot]
91624580f3 infra: prepare for release 0.1.13 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
2023-06-24 07:29:20 +00:00
aarnphm-ec2-dev
3eac8cb12d fix(build): set null for first pass of service import
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-06-24 07:27:00 +00:00
Aaron Pham [bot]
0639c61941 infra: bump to dev version of 0.1.13.dev0 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
2023-06-24 01:30:31 +00:00
Aaron Pham [bot]
6268d00837 infra: prepare for release 0.1.12 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
2023-06-24 01:27:07 +00:00
Aaron Pham
1435478f6c fix(cli): ensure we parse tag for download (#58) 2023-06-23 21:24:53 -04:00