Aaron Pham
|
43b635fbfd
|
fix: update correct CompletionOutput object (#973)
* fix: update correct CompletionOutput object
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
* fix: revert to correct version
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
|
2024-04-30 15:06:46 -04:00 |
|
Aaron
|
66de54eae7
|
chore: update default params as pydantic fields
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-04-03 15:50:37 -04:00 |
|
Aaron Pham
|
32f4dff83b
|
fix: explicitly pass only non-null value
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-04-02 04:35:47 +00:00 |
|
Aaron Pham
|
1d817a7e01
|
fix: add support for min_tokens
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-04-02 04:22:14 +00:00 |
|
Aaron Pham
|
5c0d2787c0
|
feat: add dbrx support
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-04-02 04:10:19 +00:00 |
|
Aaron Pham
|
4661838964
|
chore: move out the template to separate files
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-04-02 03:24:26 +00:00 |
|
Aaron Pham
|
67ab9b5762
|
fix: swagger showing for subpath
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-22 02:24:22 +00:00 |
|
Aaron Pham
|
3ef93fe371
|
chore: update support development_mode as DEBUG and support for RELOAD
envvar
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-22 01:19:32 +00:00 |
|
Aaron Pham
|
80b35f0d72
|
revert: correct type for openapi schema generation
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-21 07:51:00 +00:00 |
|
Aaron Pham
|
51bec78ee9
|
fix(load): make sure to respect MAX_MODEL_LEN from env
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-21 07:44:49 +00:00 |
|
Aaron
|
295a3b1061
|
chore(codegen): update generated var to read from envvar
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-03-20 21:51:39 -04:00 |
|
Aaron Pham
|
f0ab6d44fa
|
fix: make sure to include new implementation in bundle build
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-20 22:11:53 +00:00 |
|
Aaron
|
5c8c30a70b
|
fix: uses --pre for alpha releases for now
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-03-20 13:38:10 -04:00 |
|
Aaron
|
2ddbe4eb22
|
fix(service): remove mounting ASGI app
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-03-20 11:51:09 -04:00 |
|
Aaron Pham
|
824ff68818
|
chore: update local script and update service
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2024-03-15 20:29:49 +00:00 |
|
Aaron
|
c34db550a6
|
fix(build): explicit set to use alpha version
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-03-15 05:33:18 -04:00 |
|
Aaron
|
0274fb4c11
|
fix: don't lock openllm to support alpha release
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2024-03-15 05:29:35 -04:00 |
|
Aaron Pham
|
072b3e97ec
|
feat: 1.2 APIs (#821)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2024-03-15 03:49:19 -04:00 |
|