Aaron Pham
|
97d76eec85
|
tests: add additional basic testing (#982)
* chore: update rebase tests
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update partial clients before removing
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* fix: update clients parsing logics to work with 0.5
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: ignore ci runs as to run locally
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update async client tests
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update pre-commit
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-23 10:02:23 -04:00 |
|
Aaron Pham
|
072b3e97ec
|
feat: 1.2 APIs (#821)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2024-03-15 03:49:19 -04:00 |
|
Zhao Shenyang
|
16d8caf2ee
|
chore: bump up bentoml version to 1.1.11 (#883)
|
2024-02-04 21:31:14 +08:00 |
|
Zhao Shenyang
|
9d0e292076
|
fix: limit BentoML version range (#881)
|
2024-02-04 16:59:21 +08:00 |
|
Aaron Pham
|
2bb97f8ba2
|
chore: update discord link (#838)
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-01-08 19:09:51 -05:00 |
|
Aaron Pham
|
80ed400646
|
fix(build): lock lower version based on each release and update infra (#686)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 15:57:31 -05:00 |
|
Aaron Pham
|
fad4186dbc
|
feat(server): helpers endpoints for conversation format (#613)
* feat: add support for helpers conversation conversion endpoint
also correct schema generation for openllm client
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update clients to reuse `openllm-core` logics
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: add changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-12 01:02:27 -05:00 |
|
Aaron Pham
|
c41828f68f
|
feat(client): support authentication token and shim implementation (#605)
* chore: synch generate_iterator to be the same as server
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* --wip--
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* wip
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: cleanup shim implementation
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* ci: auto fixes from pre-commit.ci
For more information, see https://pre-commit.ci
* chore: fix pre-commit
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update check with tuple
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-10 17:44:31 -05:00 |
|
Aaron Pham
|
97d7c38fea
|
refactor: cleanup typing to expose correct API (#576)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 01:24:03 -05:00 |
|
Aaron Pham
|
1539c3f7dc
|
feat(client): simple implementation and streaming (#256)
|
2023-10-12 17:21:54 -04:00 |
|
Aaron Pham
|
956b3a53bc
|
fix(gptq): use upstream integration (#297)
* wip
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
* feat: GPTQ transformers integration
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
* fix: only load if variable is available and add changelog
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
* chore: remove boilerplate check
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-09-04 14:05:50 -04:00 |
|
Aaron Pham
|
3ffb25a872
|
refactor: packages (#249)
|
2023-08-22 08:55:46 -04:00 |
|