Aaron Pham
|
c19654adf3
|
infra: prepare for release 0.4.15 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-19 00:47:18 +00:00 |
|
Aaron Pham
|
1831d8f129
|
feat: heuristics logprobs (#692)
* fix(encoder): bring back T5 support on PyTorch
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: support logprobs and prompt_logprobs
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* docs: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-18 19:26:20 -05:00 |
|
Aaron Pham
|
4499469efb
|
fix(annotations): check library through find_spec (#691)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-18 02:02:16 -05:00 |
|
Aaron Pham
|
5402db1e61
|
infra: prepare for release 0.4.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 21:54:10 +00:00 |
|
Aaron Pham
|
e14f3ffed5
|
infra: prepare for release 0.4.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 21:06:56 +00:00 |
|
Aaron Pham
|
80ed400646
|
fix(build): lock lower version based on each release and update infra (#686)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 15:57:31 -05:00 |
|
Aaron Pham
|
381d740a7a
|
fix(llm): remove unnecessary check (#683)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 11:23:22 -05:00 |
|
Aaron Pham
|
65370f6919
|
infra: prepare for release 0.4.12 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 15:54:41 +00:00 |
|
Aaron Pham
|
14b3ceb436
|
fix(torch_dtype): correctly infer based on options (#682)
Users should be able to set the dtype during build, as we it doesn't effect start time
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 10:52:05 -05:00 |
|
Aaron Pham
|
7402408c5f
|
fix(envvar): explicitly set NVIDIA_DRIVER_CAPABILITIES (#681)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 10:40:45 -05:00 |
|
Aaron Pham
|
5752c3f0d8
|
infra: prepare for release 0.4.11 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 14:53:12 +00:00 |
|
Aaron Pham
|
bce273ad47
|
fix(env): correct format environment on docker (#680)
* fix(env): correct format environment on docker
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* docs: changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 09:51:17 -05:00 |
|
Aaron Pham
|
c1e0e3eae7
|
fix(build): correctly parse default env for container (#679)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 09:35:26 -05:00 |
|
Aaron Pham
|
60b60ed29a
|
infra: update cbfmt options (#676)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 07:51:33 -05:00 |
|
Aaron Pham
|
f4de4a9f13
|
infra: prepare for release 0.4.10 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 06:16:58 +00:00 |
|
Aaron Pham
|
d60ca49d2f
|
perf: potentially reduce image size (#675)
* perf: potentially reduce image size
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* perf: use base python packages only
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: typo
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* perf: Shave off 2GB
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 01:15:56 -05:00 |
|
Aaron Pham
|
09cc84a56c
|
chore(loading): include verbose warning about trust_remote_code (#674)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 20:09:50 -05:00 |
|
Aaron Pham
|
1a38de9b1f
|
fix(docs): chatglm support on vLLM (#673)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 17:54:06 -05:00 |
|
Aaron Pham
|
c850d76ccd
|
feat(models): Phi 1.5 (#672)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 17:48:10 -05:00 |
|
Aaron Pham
|
44f6db982d
|
infra: remove codegolf (#671)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 17:38:47 -05:00 |
|
Aaron Pham
|
8fdfd0491f
|
perf(build): locking and improve build speed (#669)
* revert(build): not locking packages
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* perf: improve svars generation and unifying envvar parsing
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* docs: update changelog
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* chore: update stubs check for mypy
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 06:27:45 -05:00 |
|
Aaron Pham
|
fce8f223f3
|
perf: reduce footprint (#668)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 04:45:49 -05:00 |
|
Aaron Pham
|
9e3f0fea15
|
types: update stubs for remaining entrypoints (#667)
* perf(type): static OpenAI types definition
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: add hf types
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* types: update remaining missing stubs
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 04:26:13 -05:00 |
|
Aaron Pham
|
6102a67a83
|
infra: makes huggingface-hub requirements on fine-tune (#665)
infra: makes huggingface-hub core deps
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 03:12:52 -05:00 |
|
Aaron Pham
|
86d23fd6f5
|
feat(llm): respect warnings environment for dtype warning (#664)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 03:05:58 -05:00 |
|
Aaron Pham
|
4a6f13ddd2
|
feat(type): provide structured annotations stubs (#663)
* feat(type): provide client stubs
separation of concern for more brevity code base
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* docs: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 02:58:45 -05:00 |
|
Kuan-Chun Wang
|
af88b9b077
|
fix(runner): remove keyword args for attrs.get() (#661)
|
2023-11-15 04:59:01 -05:00 |
|
Aaron Pham
|
68c6a9dac6
|
infra: prepare for release 0.4.9 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 07:51:30 +00:00 |
|
Aaron Pham
|
9e6df0df89
|
chore: update requirements in README.md (#659)
chore: update requirements
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 02:32:36 -05:00 |
|
Aaron Pham
|
034e08cf08
|
infra: update scripts to run update readme automatically (#658)
* infra: update scripts to run update readme automatically
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: cleanup mirror
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore(dropdown): correctly format noteblock and important block
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: whitespace aware
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 02:22:49 -05:00 |
|
Aaron Pham
|
625afd0c3b
|
infra: prepare for release 0.4.8 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 05:19:22 +00:00 |
|
Aaron Pham
|
a58d947bc8
|
perf: improve build logics and cleanup speed (#657)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 00:18:31 -05:00 |
|
Aaron Pham
|
103156cd71
|
chore(cli): move playground to CLI components (#655)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 23:20:50 -05:00 |
|
Aaron Pham
|
b40874149e
|
infra: prepare for release 0.4.7 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 03:44:09 +00:00 |
|
xianxian.zhang
|
ea02aaaa23
|
fix: correct OPENLLM_DEV_BUILD check (#653)
|
2023-11-14 22:21:37 -05:00 |
|
Aaron Pham
|
6a6d689a77
|
feat: Yi models (#651)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 21:55:24 -05:00 |
|
Aaron Pham
|
b4b70e2f20
|
fix(cli): update context name parsing correctly (#652)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 21:53:56 -05:00 |
|
Aaron
|
9eddae83a6
|
infra: update cohere client
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 01:52:38 -05:00 |
|
Aaron Pham
|
31a799ff61
|
refactor: use DEBUG env-var instead of OPENLLMDEVDEBUG (#647)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 01:39:58 -05:00 |
|
Aaron Pham
|
145aafd386
|
infra: prepare for release 0.4.6 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 06:18:14 +00:00 |
|
Aaron Pham
|
00d6016bcb
|
chore(openapi): unify inject param (#645)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 01:16:20 -05:00 |
|
Aaron Pham
|
b0ab8ccdf6
|
experimental: Cohere compatible endpoints. (#644)
* feat: add generate endpoint
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update generation
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix(cohere): generate endpoints
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: --wip--
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: update testing clients and chat implementation
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: disable schemas for easter eggs
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 01:07:43 -05:00 |
|
Aaron Pham
|
0bf6ec7537
|
fix(dependencies): lock build < 1 for now (#643)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-14 00:36:08 -05:00 |
|
Aaron Pham
|
b30a412398
|
fix(cli): set default dtype to auto infer (#642)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 23:05:27 -05:00 |
|
Aaron Pham
|
99a5d26527
|
fix(service): to yield out correct JSON objects (#640)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 22:41:52 -05:00 |
|
Aaron Pham
|
2d428f12da
|
fix(cpu): more verbose definition for dtype casting (#639)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 20:40:50 -05:00 |
|
Aaron Pham
|
b20c7d1c1d
|
fix(generation): compatibility dtype with CPU (#638)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 20:32:07 -05:00 |
|
Zhao Shenyang
|
ae69524749
|
doc: update adding new model guide (#637)
* update
* Update openllm-python/ADDING_NEW_MODEL.md
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Zhao Shenyang <dev@zsy.im>
* Update openllm-python/ADDING_NEW_MODEL.md
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Zhao Shenyang <dev@zsy.im>
* Update openllm-python/ADDING_NEW_MODEL.md
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Zhao Shenyang <dev@zsy.im>
* move ADDING_NEW_MODEL.md to git root directory
---------
Signed-off-by: Zhao Shenyang <dev@zsy.im>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 18:30:44 -05:00 |
|
Aaron Pham
|
a6387d1d15
|
chore: cleanup unused code path (#633)
we now rely on tokenizer.chat_templates to format prompts correctly
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 17:23:07 -05:00 |
|
Aaron Pham
|
a0d74017cd
|
infra: prepare for release 0.4.5 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-13 18:49:31 +00:00 |
|