Aaron Pham
|
ad4f388c98
|
refactor: update runner helpers and add max_model_len (#712)
* chore(runner): cleanup unecessary checks for runnable backend
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: saving llm reference to runner
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: correct inject item
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update support for max_seq_len
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: correct max_model_len
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update and warning backward compatibility
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: remove unused sets
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-20 20:37:15 -05:00 |
|
Aaron
|
f753662ae6
|
fix(build): only load model when eager is True
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-20 17:06:25 -05:00 |
|
Aaron
|
5b92e848e2
|
fix: raises error if backend is not supported
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-20 17:03:30 -05:00 |
|
Aaron Pham
|
816c1ee80e
|
feat(engine): CTranslate2 (#698)
* chore: update instruction for dependencies
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat(experimental): CTranslate2
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-19 10:25:08 -05:00 |
|
Aaron Pham
|
14b3ceb436
|
fix(torch_dtype): correctly infer based on options (#682)
Users should be able to set the dtype during build, as we it doesn't effect start time
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 10:52:05 -05:00 |
|
Aaron Pham
|
bce273ad47
|
fix(env): correct format environment on docker (#680)
* fix(env): correct format environment on docker
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* docs: changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 09:51:17 -05:00 |
|
Aaron Pham
|
c1e0e3eae7
|
fix(build): correctly parse default env for container (#679)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 09:35:26 -05:00 |
|
Aaron Pham
|
8fdfd0491f
|
perf(build): locking and improve build speed (#669)
* revert(build): not locking packages
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* perf: improve svars generation and unifying envvar parsing
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* docs: update changelog
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* chore: update stubs check for mypy
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 06:27:45 -05:00 |
|
Aaron Pham
|
9e3f0fea15
|
types: update stubs for remaining entrypoints (#667)
* perf(type): static OpenAI types definition
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: add hf types
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* types: update remaining missing stubs
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 04:26:13 -05:00 |
|
Aaron Pham
|
4a6f13ddd2
|
feat(type): provide structured annotations stubs (#663)
* feat(type): provide client stubs
separation of concern for more brevity code base
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* docs: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 02:58:45 -05:00 |
|
Aaron Pham
|
a58d947bc8
|
perf: improve build logics and cleanup speed (#657)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 00:18:31 -05:00 |
|
Aaron Pham
|
6a6d689a77
|
feat: Yi models (#651)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 21:55:24 -05:00 |
|
Aaron Pham
|
b4b70e2f20
|
fix(cli): update context name parsing correctly (#652)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 21:53:56 -05:00 |
|
pre-commit-ci[bot]
|
52367d1e8b
|
ci: pre-commit autoupdate [pre-commit.ci] (#629)
* ci: pre-commit autoupdate [pre-commit.ci]
updates:
- [github.com/pre-commit/mirrors-prettier: v3.0.3 → v3.1.0](https://github.com/pre-commit/mirrors-prettier/compare/v3.0.3...v3.1.0)
* ci: auto fixes from pre-commit.ci
For more information, see https://pre-commit.ci
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-13 13:07:53 -05:00 |
|
Aaron Pham
|
852cd863a9
|
fix(cli): make sure to pass the dtype to subprocess service (#628)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 05:32:17 -05:00 |
|
Aaron Pham
|
099c0dc31b
|
feat(cli): --dtype arguments (#627)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 05:25:50 -05:00 |
|
Aaron Pham
|
126e6c9d63
|
fix(ruff): correct consistency between isort and formatter (#624)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-12 21:12:50 -05:00 |
|
Aaron Pham
|
e667dac82f
|
chore(cli): always show available models (#621)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-12 18:36:19 -05:00 |
|
Aaron Pham
|
c50a7db80d
|
fix(cli): correct set working_dir (#620)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-12 18:34:11 -05:00 |
|
Aaron Pham
|
e0632a85ed
|
refactor(cli): move out to its own packages (#619)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-12 18:25:44 -05:00 |
|