paperspace
|
a93da12084
|
chore: upgrade to new vLLM schema
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-06-02 15:52:45 +00:00 |
|
Aaron Pham
|
bf28f977bc
|
feat(models): command-r (#1005)
* feat(models): add support for command-r
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* feat(models): support command-r and remove deadcode and extensions
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update local.sh script
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-06-02 10:16:08 -04:00 |
|
Aaron Pham
|
9649073713
|
infra: prepare for release 0.5.4 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-06-01 00:37:27 +00:00 |
|
Aaron Pham
|
162458ffe6
|
infra: prepare for release 0.5.3 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-30 21:30:21 +00:00 |
|
Aaron Pham
|
49908ec289
|
infra: prepare for release 0.5.2 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 04:44:59 +00:00 |
|
paperspace
|
ef11e54a6d
|
chore: update docs and base instruction [skip ci]
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 03:19:47 +00:00 |
|
Aaron Pham
|
5ff77d1f6e
|
infra: prepare for release 0.5.1 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 02:44:23 +00:00 |
|
Aaron Pham
|
2314a3667e
|
infra: prepare for release 0.5.0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-27 18:20:30 +00:00 |
|
Aaron Pham
|
a4a6060f69
|
infra: prepare for release 0.5.0-alpha.15 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-27 17:53:45 +00:00 |
|
Aaron Pham
|
3f048d8a5b
|
chore(qol): update CLI options and performance upgrade for build cache (#997)
* chore(qol): update CLI options and performance upgrade for build cache
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update default python version for dev
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* fix: install custom tar.gz models
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-26 04:17:23 -04:00 |
|
Aaron Pham
|
fa850bafeb
|
infra: prepare for release 0.5.0-alpha.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-23 14:43:49 +00:00 |
|
Aaron Pham
|
97d76eec85
|
tests: add additional basic testing (#982)
* chore: update rebase tests
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update partial clients before removing
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* fix: update clients parsing logics to work with 0.5
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: ignore ci runs as to run locally
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update async client tests
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update pre-commit
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-23 10:02:23 -04:00 |
|
Aaron Pham
|
5cb5203eea
|
infra: prepare for release 0.5.0-alpha.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-22 15:17:25 +00:00 |
|
Aaron Pham
|
f97f5108c4
|
infra: prepare for release 0.5.0-alpha.12 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-14 06:44:07 +00:00 |
|
Aaron Pham
|
d5447240ef
|
infra: prepare for release 0.5.0-alpha.11 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-12 04:22:35 +00:00 |
|
Aaron Pham
|
a335312ce2
|
infra: prepare for release 0.5.0-alpha.10 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 22:37:31 +00:00 |
|
Aaron Pham
|
03610d652d
|
infra: prepare for release 0.5.0-alpha.9 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 21:16:45 +00:00 |
|
Aaron Pham
|
642cf6aaf2
|
infra: prepare for release 0.5.0-alpha.8 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 01:38:22 +00:00 |
|
Aaron Pham
|
5e444381bb
|
infra: prepare for release 0.5.0-alpha.7 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 01:18:05 +00:00 |
|
Aaron Pham
|
de5e01cb47
|
infra: prepare for release 0.5.0-alpha.6 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 00:08:18 +00:00 |
|
Aaron Pham
|
d02f267fc7
|
infra: prepare for release 0.5.0-alpha.5 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-08 23:37:42 +00:00 |
|
Aaron Pham
|
0a1bcacbc4
|
infra: prepare for release 0.5.0-alpha.4 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-08 23:22:33 +00:00 |
|
paperspace
|
526a770a06
|
chore: update base requirements to 0.4.2
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-08 18:46:13 +00:00 |
|
Aaron Pham
|
43b635fbfd
|
fix: update correct CompletionOutput object (#973)
* fix: update correct CompletionOutput object
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* fix: revert to correct version
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-04-30 15:06:46 -04:00 |
|
Aaron Pham
|
135503017d
|
infra: prepare for release 0.5.0-alpha.3 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-04-02 04:36:56 +00:00 |
|
Aaron Pham
|
5c0d2787c0
|
feat: add dbrx support
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-04-02 04:10:19 +00:00 |
|
Aaron Pham
|
e9e6434012
|
infra: prepare for release 0.5.0-alpha.2 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-04-02 03:49:09 +00:00 |
|
Aaron Pham
|
12ac99867f
|
infra: prepare for release 0.5.0-alpha.1 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-21 01:37:56 +00:00 |
|
Aaron Pham
|
f0ab6d44fa
|
fix: make sure to include new implementation in bundle build
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-20 22:11:53 +00:00 |
|
Aaron Pham
|
824ff68818
|
chore: update local script and update service
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-15 20:29:49 +00:00 |
|
Aaron Pham
|
58c741c5aa
|
infra: prepare for release 0.5.0-alpha [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-15 08:46:18 +00:00 |
|
Aaron Pham
|
072b3e97ec
|
feat: 1.2 APIs (#821)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2024-03-15 03:49:19 -04:00 |
|
Aaron Pham
|
1b54d64eb0
|
infra: prepare for release 0.4.44 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-02-06 03:07:09 +00:00 |
|
Aaron Pham
|
fe44c843ec
|
infra: prepare for release 0.4.43 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-02-05 10:48:05 +00:00 |
|
Zhao Shenyang
|
16d8caf2ee
|
chore: bump up bentoml version to 1.1.11 (#883)
|
2024-02-04 21:31:14 +08:00 |
|
Aaron Pham
|
d1583cc1bb
|
infra: prepare for release 0.4.42 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-02-02 12:21:09 +00:00 |
|
Aaron Pham
|
2bb97f8ba2
|
chore: update discord link (#838)
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* Update pyproject.toml
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-01-08 19:09:51 -05:00 |
|
Aaron Pham
|
79da419d87
|
chore(deps): bump vllm to 0.2.7 (#837)
* chore(deps): bump vllm to 0.2.7
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-01-08 14:41:58 -05:00 |
|
Aaron Pham
|
b09bd20750
|
infra: prepare for release 0.4.41 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-18 18:08:46 +00:00 |
|
Aaron Pham
|
8d63afc9ce
|
feat(vllm): support GPTQ with 0.2.6 (#797)
* feat(vllm): GPTQ support passthrough
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: run scripts
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* fix(install): set order of xformers before vllm
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: support GPTQ with vLLM
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-18 12:41:19 -05:00 |
|
Aaron Pham
|
2e8fc284f5
|
infra: prepare for release 0.4.40 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-15 16:46:12 +00:00 |
|
Aaron Pham
|
88b6d3d6de
|
perf: upgrade mixtral to use expert parallelism (#783)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-12-15 11:45:08 -05:00 |
|
Aaron Pham
|
d4fbbcee34
|
infra: prepare for release 0.4.39 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-14 19:20:01 +00:00 |
|
Aaron Pham
|
1dbae67172
|
infra: prepare for release 0.4.38 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-13 23:27:41 +00:00 |
|
Aaron Pham
|
8d9d212d61
|
infra: prepare for release 0.4.37 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-13 14:07:33 +00:00 |
|
Aaron Pham
|
9cd1e44b1e
|
infra: prepare for release 0.4.36 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-12 06:34:39 +00:00 |
|
Aaron Pham
|
d3328343d7
|
feat: mixtral support (#770)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-12 01:33:13 -05:00 |
|
Aaron
|
59e8ef93dc
|
chore(deps): lock vLLM to 0.2.4
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-12-12 00:17:18 -05:00 |
|
Aaron Pham
|
8019fd84c8
|
infra: prepare for release 0.4.35 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-12-07 08:38:13 +00:00 |
|
Aaron Pham
|
81688e0949
|
infra: prepare for release 0.4.34 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-30 12:17:48 +00:00 |
|