Aaron Pham
|
bbd9aa7646
|
refactor(contrib): similar namespace [clojure-ui build] (#251)
|
2023-08-23 00:21:59 -04:00 |
|
aarnphm-ec2-dev
|
1488fbb167
|
chore(style): enable yapf to match with style guidelines
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-08-22 14:03:06 +00:00 |
|
Aaron Pham
|
3ffb25a872
|
refactor: packages (#249)
|
2023-08-22 08:55:46 -04:00 |
|
Aaron
|
9fb46e1676
|
chore(release): add manual workflow dispatch run on new release tag
[skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-08-17 16:38:58 -04:00 |
|
Aaron Pham
|
4140d160b8
|
feat(embedding): Adding generic endpoint (#227)
|
2023-08-17 15:17:00 -04:00 |
|
Aaron Pham
|
665233c30f
|
chore: conditional commit for running jobs (#232)
|
2023-08-17 10:13:53 -04:00 |
|
Aaron Pham
|
d7a6859c40
|
chore(gh): use setup-bentoml-action (#230)
|
2023-08-17 08:34:35 -04:00 |
|
GutZuFusss
|
4cad367ab5
|
feat(contrib): ClojureScript UI (#89)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-08-16 03:30:44 -04:00 |
|
Aaron Pham
|
58527032e0
|
feat: add default python version for development [skip ci] (#212)
|
2023-08-15 02:39:43 -04:00 |
|
Aaron Pham
|
cd872ef631
|
refactor: monorepo (#203)
|
2023-08-15 02:11:14 -04:00 |
|
Aaron Pham
|
f6317d8003
|
infra: enable compiled wheels for all supported Python (#201)
|
2023-08-12 04:54:50 -04:00 |
|
Aaron Pham
|
5329853b10
|
perf: compiled modules and enable lazyeval (#200)
|
2023-08-11 05:53:45 -04:00 |
|
Aaron Pham
|
c083990edd
|
infra: migrate to initial openllm-node library (#199)
|
2023-08-10 18:54:00 -04:00 |
|
aarnphm-ec2-dev
|
dfc4b489c5
|
feat(build): notes on compiled wheels for Bento
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-08-09 21:52:34 +00:00 |
|
Aaron Pham
|
b1445c6516
|
refactor(cli): compiled wheels and extension modules (#191)
|
2023-08-09 17:10:15 -04:00 |
|
Aaron
|
ae11e487d9
|
fix(brew): specific installation from gzip [skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-08-08 22:32:11 -04:00 |
|
Aaron
|
21143fdfab
|
fix(brew): set correct url for release
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-08-08 22:18:26 -04:00 |
|
Aaron Pham
|
b9dd54f634
|
feat: homebrew tap (#190)
|
2023-08-08 22:11:48 -04:00 |
|
Aaron Pham
|
21ea7e493f
|
feat(generation): initial work for generating tokens (#186)
|
2023-08-06 20:04:40 -04:00 |
|
Aaron Pham
|
2541a0f8dc
|
infra: initial work on compiling mypyc wheels (#182)
|
2023-08-04 10:20:03 -04:00 |
|
pre-commit-ci[bot]
|
c2ed1d56da
|
chore(release): update base container restriction (#173)
Prepare for 0.2.12 release
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-08-01 15:25:17 -04:00 |
|
Aaron Pham
|
8c2867d26d
|
style: define experimental guidelines (#168)
|
2023-07-31 07:54:26 -04:00 |
|
Aaron Pham
|
ef94c6b98a
|
feat(container): vLLM build and base image strategies (#142)
|
2023-07-31 02:44:52 -04:00 |
|
Aaron Pham
|
c391717226
|
feat(ci): automatic release semver + git archival installation (#143)
|
2023-07-25 04:18:49 -04:00 |
|
aarnphm-ec2-dev
|
084786c898
|
fix(cli): `openllm models` for showing available
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-24 23:00:03 +00:00 |
|
Aaron Pham
|
7eabcd4355
|
feat: vLLM integration for PagedAttention (#134)
|
2023-07-24 15:42:17 -04:00 |
|
aarnphm-ec2-dev
|
e4ac0ed8b7
|
fix(cuda): support loading in single GPU
add available_devices for getting # of available GPUs
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 08:10:01 +00:00 |
|
Aaron Pham
|
f56f8ee782
|
feat: fine-tuning script for LlaMA 2 (#128)
|
2023-07-20 20:44:51 -04:00 |
|
aarnphm-ec2-dev
|
3e50f0a851
|
fix(cli): implement latest bentoml 1.0.25 features
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-20 20:51:27 +00:00 |
|
Aaron Pham
|
1b3508619e
|
feat(llama): add default prompt for LlaMA-2 (#122)
|
2023-07-20 07:46:33 -04:00 |
|
Aaron Pham
|
c1ddb9ed7c
|
feat: GPTQ + vLLM and LlaMA (#113)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-07-19 18:12:12 -04:00 |
|
Aaron Pham
|
fc963c42ce
|
fix: build isolation (#116)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-07-16 01:52:21 -04:00 |
|
HeTaoPKU
|
fd9ae56812
|
fix(baichuan): add "cpm-kernel" as additional requirements (#117)
This is to support the 13b variant of baichuan
Co-authored-by: the <tao.he@hulu.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-07-15 23:16:05 -04:00 |
|
HeTaoPKU
|
09b0787306
|
feat(models): Baichuan (#115)
Co-authored-by: the <tao.he@hulu.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-07-15 22:01:37 -04:00 |
|
Aaron Pham
|
b2dba6143f
|
fix(resource): correctly parse CUDA_VISIBLE_DEVICES (#114)
|
2023-07-15 07:19:35 -04:00 |
|
aarnphm-ec2-dev
|
c2bb29b4f3
|
fix: building mpt dependencies
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-11 00:21:23 +00:00 |
|
Aaron Pham
|
c7f4dc7bb2
|
feat(test): snapshot testing (#107)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-07-10 17:23:19 -04:00 |
|
Aaron Pham
|
fb849a384e
|
feat: GPTNeoX (#106)
|
2023-07-07 03:05:40 -04:00 |
|
Aaron Pham
|
d6303d306a
|
perf: fixing import custom paths and cleanup serialisation (#102)
|
2023-07-04 12:49:14 -04:00 |
|
Aaron Pham
|
8ac2755de4
|
feat(llm): fine-tuning Falcon (#98)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-06-30 21:25:16 -04:00 |
|
aarnphm-ec2-dev
|
e81203884b
|
fix(nightly-requirements): missing new lines [skip ci]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-06-29 16:23:46 +00:00 |
|
aarnphm-ec2-dev
|
d3633a9430
|
chore(ci): update correct submodules for compiling triton [skip ci]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-06-29 16:22:09 +00:00 |
|
Aaron Pham
|
e52045eda6
|
fix: running MPT on CPU (#92)
|
2023-06-29 10:54:12 -04:00 |
|
Aaron Pham
|
01db504e7d
|
feat: MPT (#91)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-06-28 23:12:15 -04:00 |
|
Aaron Pham
|
bd4cc9b3ff
|
fix: loading local (#87)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-06-28 11:25:54 -04:00 |
|
Aaron Pham
|
db1494a6ae
|
feat(start): starting bento and fix load (#80)
|
2023-06-27 12:45:17 -04:00 |
|
Aaron
|
6e281cd4cd
|
chore: simplify actions
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-25 11:05:16 -04:00 |
|
Aaron
|
bcf3ef76f3
|
revert: "chore: script to exit on error"
This reverts commit e1ce8f9c20.
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-25 10:43:46 -04:00 |
|
Aaron
|
e1ce8f9c20
|
chore: script to exit on error
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-25 10:41:48 -04:00 |
|
Aaron Pham
|
74fdd5e259
|
feat: release binary distribution (#66)
|
2023-06-25 10:38:03 -04:00 |
|