Aaron Pham
|
10a60307c1
|
infra: prepare for release 0.5.5 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-06-03 22:14:58 +00:00 |
|
paperspace
|
15cada079a
|
fix(models): make sure to use private-tag name for the generated service
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-06-03 20:45:17 +00:00 |
|
Aaron Pham
|
c60398c45b
|
chore: add more info to metadata
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-06-02 17:57:51 -04:00 |
|
Aaron Pham
|
3193190b94
|
chore: update configuration to yield objects instead
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-06-02 17:48:03 -04:00 |
|
paperspace
|
7d563ee121
|
chore(ci): update scripts [skip ci]
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-06-02 16:12:36 +00:00 |
|
paperspace
|
a93da12084
|
chore: upgrade to new vLLM schema
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-06-02 15:52:45 +00:00 |
|
paperspace
|
8fea50dfdb
|
feat: update ROCm check for syspath
See #950 for more information
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-06-02 14:20:23 +00:00 |
|
Aaron Pham
|
bf28f977bc
|
feat(models): command-r (#1005)
* feat(models): add support for command-r
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* feat(models): support command-r and remove deadcode and extensions
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update local.sh script
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-06-02 10:16:08 -04:00 |
|
Aaron Pham
|
9649073713
|
infra: prepare for release 0.5.4 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-06-01 00:37:27 +00:00 |
|
Aaron Pham
|
45aceb172f
|
feat(API): add light support for batch inference (#1004)
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-31 20:36:12 -04:00 |
|
Aaron Pham
|
162458ffe6
|
infra: prepare for release 0.5.3 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-30 21:30:21 +00:00 |
|
Aaron Pham
|
49908ec289
|
infra: prepare for release 0.5.2 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 04:44:59 +00:00 |
|
paperspace
|
7fca472a66
|
chore: update readme [skip ci]
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 04:43:50 +00:00 |
|
Aaron Pham
|
e9e46b2cc7
|
chore: update examples and readme
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 00:41:32 -04:00 |
|
paperspace
|
02010d3499
|
fix: synchronize into llm_config dict
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 04:31:34 +00:00 |
|
paperspace
|
ef11e54a6d
|
chore: update docs and base instruction [skip ci]
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 03:19:47 +00:00 |
|
Aaron Pham
|
5ff77d1f6e
|
infra: prepare for release 0.5.1 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 02:44:23 +00:00 |
|
paperspace
|
c820cececb
|
fix(generate): make sure to only pass prompt_token_ids if it is a valid
mutable
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-29 02:42:13 +00:00 |
|
Aaron Pham
|
2314a3667e
|
infra: prepare for release 0.5.0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-27 18:20:30 +00:00 |
|
paperspace
|
9da0b4134c
|
chore(qol): make envvar private
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-27 18:07:38 +00:00 |
|
Aaron Pham
|
a4a6060f69
|
infra: prepare for release 0.5.0-alpha.15 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-27 17:53:45 +00:00 |
|
paperspace
|
07655c9ba8
|
chore(build): remove vllm_version envvar and lock into templates
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-27 17:49:58 +00:00 |
|
paperspace
|
ba5a5da720
|
chore: udpate docstring
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-27 17:02:26 +00:00 |
|
paperspace
|
0f32290606
|
chore(packages): ready for 0.5 releases
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-27 16:54:53 +00:00 |
|
Aaron Pham (mbp16)
|
f4f7f16e81
|
chore(releases): remove deadcode
Signed-off-by: Aaron Pham (mbp16) <29749331+aarnphm@users.noreply.github.com>
|
2024-05-27 12:37:50 -04:00 |
|
Aaron Pham
|
f248ea25cd
|
feat(ci): running CI on paperspace (#998)
* chore: update tiny script
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* feat(ci): running on paperspace machines
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update models and increase timeout readiness
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: schema validation for inputs and update client supporting stop
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update coverage config
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: remove some non-essentials
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update locks
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-05-26 13:14:54 -04:00 |
|
Aaron Pham
|
3f048d8a5b
|
chore(qol): update CLI options and performance upgrade for build cache (#997)
* chore(qol): update CLI options and performance upgrade for build cache
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update default python version for dev
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* fix: install custom tar.gz models
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-26 04:17:23 -04:00 |
|
Aaron Pham
|
5e97329bcb
|
infra: prepare 0.5 releases (#996)
* chore: prepare for 0.5
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update changelogs
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: fix to lowest python version supported
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update scripts
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-05-23 12:50:01 -04:00 |
|
paperspace
|
a410b9cfe8
|
infra: update README
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-23 16:44:03 +00:00 |
|
Aaron Pham
|
fa850bafeb
|
infra: prepare for release 0.5.0-alpha.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-23 14:43:49 +00:00 |
|
paperspace
|
cec0aa5487
|
fix(memory): correctly recommend instance types for cloud
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-23 14:42:39 +00:00 |
|
Aaron Pham
|
97d76eec85
|
tests: add additional basic testing (#982)
* chore: update rebase tests
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update partial clients before removing
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* fix: update clients parsing logics to work with 0.5
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: ignore ci runs as to run locally
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update async client tests
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* chore: update pre-commit
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-23 10:02:23 -04:00 |
|
Aaron Pham
|
5cb5203eea
|
infra: prepare for release 0.5.0-alpha.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-22 15:17:25 +00:00 |
|
paperspace
|
b7193511e6
|
fix: correct update default value for dict unpacking
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-22 15:15:23 +00:00 |
|
Aaron Pham
|
f97f5108c4
|
infra: prepare for release 0.5.0-alpha.12 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-14 06:44:07 +00:00 |
|
paperspace
|
e9246e7772
|
fix: make sure to only update fields when correct type is parse
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-14 06:42:46 +00:00 |
|
Aaron Pham
|
d5447240ef
|
infra: prepare for release 0.5.0-alpha.11 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-12 04:22:35 +00:00 |
|
paperspace
|
806308eed6
|
fix: one-shot generation not to concatenate duplicates
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-12 04:21:06 +00:00 |
|
paperspace
|
1d2e554a94
|
chore: disable progressbar for cleaner log trace
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-10 03:11:47 +00:00 |
|
Aaron Pham
|
a335312ce2
|
infra: prepare for release 0.5.0-alpha.10 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 22:37:31 +00:00 |
|
paperspace
|
9a961d9070
|
perf(build): improve preheating layers for caching dependencies
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 22:34:39 +00:00 |
|
Aaron Pham
|
03610d652d
|
infra: prepare for release 0.5.0-alpha.9 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 21:16:45 +00:00 |
|
paperspace
|
8e82bd9600
|
chore: update streaming logics to respect cursor
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 21:13:28 +00:00 |
|
paperspace
|
dd79779e0e
|
fix: update generate_stream to yield delta based on given index
generation
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 20:18:00 +00:00 |
|
Aaron Pham
|
642cf6aaf2
|
infra: prepare for release 0.5.0-alpha.8 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 01:38:22 +00:00 |
|
paperspace
|
c9f8dbc767
|
feat: set options for 'gpu' for building recommendation
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 01:37:29 +00:00 |
|
Aaron Pham
|
5e444381bb
|
infra: prepare for release 0.5.0-alpha.7 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 01:18:05 +00:00 |
|
paperspace
|
852b82d25b
|
fix: make sure to export correct json config
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 01:14:12 +00:00 |
|
Aaron Pham
|
de5e01cb47
|
infra: prepare for release 0.5.0-alpha.6 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 00:08:18 +00:00 |
|
paperspace
|
6726f6ae3e
|
fix: make sure to add cpu to number
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-05-09 00:06:10 +00:00 |
|