Aaron Pham
|
2bb97f8ba2
|
chore: update discord link (#838)
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-01-08 19:09:51 -05:00 |
|
Aaron Pham
|
79da419d87
|
chore(deps): bump vllm to 0.2.7 (#837)
* chore(deps): bump vllm to 0.2.7
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-01-08 14:41:58 -05:00 |
|
Aaron Pham
|
b09bd20750
|
infra: prepare for release 0.4.41 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-18 18:08:46 +00:00 |
|
Aaron Pham
|
8d63afc9ce
|
feat(vllm): support GPTQ with 0.2.6 (#797)
* feat(vllm): GPTQ support passthrough
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: run scripts
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* fix(install): set order of xformers before vllm
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: support GPTQ with vLLM
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-18 12:41:19 -05:00 |
|
Aaron Pham
|
2e8fc284f5
|
infra: prepare for release 0.4.40 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-15 16:46:12 +00:00 |
|
Aaron Pham
|
88b6d3d6de
|
perf: upgrade mixtral to use expert parallelism (#783)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-12-15 11:45:08 -05:00 |
|
Aaron Pham
|
d4fbbcee34
|
infra: prepare for release 0.4.39 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-14 19:20:01 +00:00 |
|
Aaron Pham
|
1dbae67172
|
infra: prepare for release 0.4.38 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-13 23:27:41 +00:00 |
|
Aaron Pham
|
8d9d212d61
|
infra: prepare for release 0.4.37 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-13 14:07:33 +00:00 |
|
Aaron Pham
|
9cd1e44b1e
|
infra: prepare for release 0.4.36 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-12 06:34:39 +00:00 |
|
Aaron Pham
|
d3328343d7
|
feat: mixtral support (#770)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-12 01:33:13 -05:00 |
|
Aaron
|
59e8ef93dc
|
chore(deps): lock vLLM to 0.2.4
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-12-12 00:17:18 -05:00 |
|
Aaron Pham
|
8019fd84c8
|
infra: prepare for release 0.4.35 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-07 08:38:13 +00:00 |
|
Aaron Pham
|
81688e0949
|
infra: prepare for release 0.4.34 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-30 12:17:48 +00:00 |
|
yansheng
|
3cb7f14fc1
|
feat(models): Support qwen (#742)
* support qwen
* support qwen
* ci: auto fixes from pre-commit.ci
For more information, see https://pre-commit.ci
* Update openllm-core/src/openllm_core/config/configuration_qwen.py
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* chore: update correct readme and supports qwen models
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: root <yansheng105@gmail.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-30 06:54:17 -05:00 |
|
Aaron Pham
|
9fa0dee406
|
infra: prepare for release 0.4.33 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-29 18:01:13 +00:00 |
|
Aaron Pham
|
69deedd9b8
|
infra: prepare for release 0.4.32 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-29 07:39:42 +00:00 |
|
Aaron Pham
|
77af72ed2a
|
infra: prepare for release 0.4.31 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-26 23:49:04 +00:00 |
|
Aaron Pham
|
e157d3aa9e
|
infra: prepare for release 0.4.30 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-26 09:54:42 +00:00 |
|
Aaron Pham
|
f7a803dfa2
|
infra: prepare for release 0.4.29 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-26 07:50:48 +00:00 |
|
Aaron Pham
|
e27764fe6b
|
infra: prepare for release 0.4.28 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-24 07:09:06 +00:00 |
|
Aaron Pham
|
d8a783772d
|
infra: prepare for release 0.4.27 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-24 06:25:16 +00:00 |
|
Aaron
|
d0e12b1fb8
|
fix(metadata): remove unused packages
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-24 01:19:09 -05:00 |
|
Aaron Pham
|
5442d9cd10
|
fix(trust_remote_code): handle args correctly (#727)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-22 17:03:13 -05:00 |
|
Aaron Pham
|
7eae50377d
|
infra: prepare for release 0.4.26 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-22 11:50:50 +00:00 |
|
Aaron Pham
|
0189342730
|
infra: prepare for release 0.4.25 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-22 09:22:45 +00:00 |
|
Aaron Pham
|
7f09f9daf2
|
infra: prepare for release 0.4.24 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-22 06:34:30 +00:00 |
|
Aaron Pham
|
85e03a4b92
|
infra: prepare for release 0.4.23 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-22 06:16:49 +00:00 |
|
Aaron Pham
|
f3fd32d596
|
infra: prepare for release 0.4.22 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-21 01:38:46 +00:00 |
|
Aaron Pham
|
4c4bc82a47
|
infra: prepare for release 0.4.21 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-20 22:32:44 +00:00 |
|
Aaron Pham
|
204cbd43d2
|
infra: prepare for release 0.4.20 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-20 22:09:47 +00:00 |
|
Aaron Pham
|
46d6fcca98
|
infra: prepare for release 0.4.19 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-20 08:06:53 +00:00 |
|
Aaron Pham
|
c1f86bda16
|
infra: prepare for release 0.4.18 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-20 05:15:14 +00:00 |
|
Aaron Pham
|
6505abdb44
|
chore: update lower bound version of bentoml to avoid breakage (#703)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-19 23:09:14 -05:00 |
|
Aaron Pham
|
d1915d7a9e
|
infra: prepare for release 0.4.17 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-20 03:43:21 +00:00 |
|
Aaron Pham
|
e9207ff683
|
infra: prepare for release 0.4.16 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-19 15:41:03 +00:00 |
|
Aaron
|
cb4386b013
|
fix(release): remove unecessary check for client dependencies [skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-19 10:39:38 -05:00 |
|
Aaron Pham
|
539f250c0f
|
feat(vllm): bump to 0.2.2 (#695)
* feat(vllm): bump to 0.2.2
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: move up to CUDA 12.1
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: remove auto-gptq installation
since the builder image doesn't have access to GPU
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: update containerization warning
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-19 02:52:32 -05:00 |
|
Aaron Pham
|
206521e02d
|
feat(ctranslate): initial infrastructure support (#694)
* perf: compact and improve speed and agility
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* --wip--
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: cleanup infrastructure
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update styles notes and autogen mypy configuration
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-19 01:48:33 -05:00 |
|
Aaron Pham
|
c19654adf3
|
infra: prepare for release 0.4.15 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-19 00:47:18 +00:00 |
|
Aaron Pham
|
5402db1e61
|
infra: prepare for release 0.4.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 21:54:10 +00:00 |
|
Aaron Pham
|
e14f3ffed5
|
infra: prepare for release 0.4.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 21:06:56 +00:00 |
|
Aaron Pham
|
80ed400646
|
fix(build): lock lower version based on each release and update infra (#686)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 15:57:31 -05:00 |
|
Aaron Pham
|
44f6db982d
|
infra: remove codegolf (#671)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 17:38:47 -05:00 |
|
Aaron Pham
|
6102a67a83
|
infra: makes huggingface-hub requirements on fine-tune (#665)
infra: makes huggingface-hub core deps
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 03:12:52 -05:00 |
|
Aaron Pham
|
4a6f13ddd2
|
feat(type): provide structured annotations stubs (#663)
* feat(type): provide client stubs
separation of concern for more brevity code base
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* docs: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 02:58:45 -05:00 |
|
Aaron Pham
|
103156cd71
|
chore(cli): move playground to CLI components (#655)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 23:20:50 -05:00 |
|
Aaron Pham
|
0bf6ec7537
|
fix(dependencies): lock build < 1 for now (#643)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 00:36:08 -05:00 |
|
Zhao Shenyang
|
ae69524749
|
doc: update adding new model guide (#637)
* update
* Update openllm-python/ADDING_NEW_MODEL.md
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Zhao Shenyang <dev@zsy.im>
* Update openllm-python/ADDING_NEW_MODEL.md
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Zhao Shenyang <dev@zsy.im>
* Update openllm-python/ADDING_NEW_MODEL.md
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Zhao Shenyang <dev@zsy.im>
* move ADDING_NEW_MODEL.md to git root directory
---------
Signed-off-by: Zhao Shenyang <dev@zsy.im>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 18:30:44 -05:00 |
|
Aaron Pham
|
e0632a85ed
|
refactor(cli): move out to its own packages (#619)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-12 18:25:44 -05:00 |
|