Aaron Pham
|
072b3e97ec
|
feat: 1.2 APIs (#821)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2024-03-15 03:49:19 -04:00 |
|
Zhao Shenyang
|
16d8caf2ee
|
chore: bump up bentoml version to 1.1.11 (#883)
|
2024-02-04 21:31:14 +08:00 |
|
Zhao Shenyang
|
9d0e292076
|
fix: limit BentoML version range (#881)
|
2024-02-04 16:59:21 +08:00 |
|
Aaron Pham
|
2bb97f8ba2
|
chore: update discord link (#838)
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-01-08 19:09:51 -05:00 |
|
Aaron Pham
|
e9a89b7a7e
|
fix(cattrs): strictly lock <23.2 until we upgrade validation logic (#690)
fix(cattrs): strictly lock <23.2 until we move converter to upper version
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 17:11:15 -05:00 |
|
Aaron Pham
|
0891cde0b6
|
fix(dependencies): ignore broken cattrs release (#689)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 16:52:58 -05:00 |
|
Aaron Pham
|
80ed400646
|
fix(build): lock lower version based on each release and update infra (#686)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 15:57:31 -05:00 |
|
Aaron Pham
|
7398ae0486
|
refactor(strategies): move logics into openllm-python (#578)
fix(strategies): move to openllm
Strategies shouldn't be a part of openllm-core
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 02:23:08 -05:00 |
|
Aaron Pham
|
e2029c934b
|
perf: unify LLM interface (#518)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-06 20:39:43 -05:00 |
|
Aaron Pham
|
956b3a53bc
|
fix(gptq): use upstream integration (#297)
* wip
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
* feat: GPTQ transformers integration
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
* fix: only load if variable is available and add changelog
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
* chore: remove boilerplate check
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-09-04 14:05:50 -04:00 |
|
Aaron Pham
|
3ffb25a872
|
refactor: packages (#249)
|
2023-08-22 08:55:46 -04:00 |
|