Commit Graph

174 Commits

Author SHA1 Message Date
Aaron Pham
c19654adf3 infra: prepare for release 0.4.15 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-19 00:47:18 +00:00
Aaron Pham
1831d8f129 feat: heuristics logprobs (#692)
* fix(encoder): bring back T5 support on PyTorch

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* feat: support logprobs and prompt_logprobs

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* docs: update changelog

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-18 19:26:20 -05:00
Aaron Pham
4499469efb fix(annotations): check library through find_spec (#691)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-18 02:02:16 -05:00
Aaron Pham
5402db1e61 infra: prepare for release 0.4.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 21:54:10 +00:00
Aaron Pham
e14f3ffed5 infra: prepare for release 0.4.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 21:06:56 +00:00
Aaron Pham
80ed400646 fix(build): lock lower version based on each release and update infra (#686)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 15:57:31 -05:00
Aaron Pham
381d740a7a fix(llm): remove unnecessary check (#683)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 11:23:22 -05:00
Aaron Pham
65370f6919 infra: prepare for release 0.4.12 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 15:54:41 +00:00
Aaron Pham
14b3ceb436 fix(torch_dtype): correctly infer based on options (#682)
Users should be able to set the dtype during build, as we it doesn't effect start time

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 10:52:05 -05:00
Aaron Pham
7402408c5f fix(envvar): explicitly set NVIDIA_DRIVER_CAPABILITIES (#681)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 10:40:45 -05:00
Aaron Pham
5752c3f0d8 infra: prepare for release 0.4.11 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 14:53:12 +00:00
Aaron Pham
bce273ad47 fix(env): correct format environment on docker (#680)
* fix(env): correct format environment on docker

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* docs: changelog

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 09:51:17 -05:00
Aaron Pham
c1e0e3eae7 fix(build): correctly parse default env for container (#679)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 09:35:26 -05:00
Aaron Pham
60b60ed29a infra: update cbfmt options (#676)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 07:51:33 -05:00
Aaron Pham
f4de4a9f13 infra: prepare for release 0.4.10 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 06:16:58 +00:00
Aaron Pham
d60ca49d2f perf: potentially reduce image size (#675)
* perf: potentially reduce image size

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* perf: use base python packages only

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* fix: typo

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* perf: Shave off 2GB

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 01:15:56 -05:00
Aaron Pham
09cc84a56c chore(loading): include verbose warning about trust_remote_code (#674)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 20:09:50 -05:00
Aaron Pham
1a38de9b1f fix(docs): chatglm support on vLLM (#673)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 17:54:06 -05:00
Aaron Pham
c850d76ccd feat(models): Phi 1.5 (#672)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 17:48:10 -05:00
Aaron Pham
44f6db982d infra: remove codegolf (#671)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 17:38:47 -05:00
Aaron Pham
8fdfd0491f perf(build): locking and improve build speed (#669)
* revert(build): not locking packages

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* perf: improve svars generation and unifying envvar parsing

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* docs: update changelog

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* chore: update stubs check for mypy

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-16 06:27:45 -05:00
Aaron Pham
fce8f223f3 perf: reduce footprint (#668)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 04:45:49 -05:00
Aaron Pham
9e3f0fea15 types: update stubs for remaining entrypoints (#667)
* perf(type): static OpenAI types definition

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* feat: add hf types

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* types: update remaining missing stubs

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 04:26:13 -05:00
Aaron Pham
6102a67a83 infra: makes huggingface-hub requirements on fine-tune (#665)
infra: makes huggingface-hub core deps

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 03:12:52 -05:00
Aaron Pham
86d23fd6f5 feat(llm): respect warnings environment for dtype warning (#664)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 03:05:58 -05:00
Aaron Pham
4a6f13ddd2 feat(type): provide structured annotations stubs (#663)
* feat(type): provide client stubs

separation of concern for more brevity code base

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* docs: update changelog

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 02:58:45 -05:00
Kuan-Chun Wang
af88b9b077 fix(runner): remove keyword args for attrs.get() (#661) 2023-11-15 04:59:01 -05:00
Aaron Pham
68c6a9dac6 infra: prepare for release 0.4.9 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-15 07:51:30 +00:00
Aaron Pham
9e6df0df89 chore: update requirements in README.md (#659)
chore: update requirements

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-15 02:32:36 -05:00
Aaron Pham
034e08cf08 infra: update scripts to run update readme automatically (#658)
* infra: update scripts to run update readme automatically

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: cleanup mirror

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore(dropdown): correctly format noteblock and important block

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* fix: whitespace aware

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-15 02:22:49 -05:00
Aaron Pham
625afd0c3b infra: prepare for release 0.4.8 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-15 05:19:22 +00:00
Aaron Pham
a58d947bc8 perf: improve build logics and cleanup speed (#657)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-15 00:18:31 -05:00
Aaron Pham
103156cd71 chore(cli): move playground to CLI components (#655)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-14 23:20:50 -05:00
Aaron Pham
b40874149e infra: prepare for release 0.4.7 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-15 03:44:09 +00:00
xianxian.zhang
ea02aaaa23 fix: correct OPENLLM_DEV_BUILD check (#653) 2023-11-14 22:21:37 -05:00
Aaron Pham
6a6d689a77 feat: Yi models (#651)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-14 21:55:24 -05:00
Aaron Pham
b4b70e2f20 fix(cli): update context name parsing correctly (#652)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-14 21:53:56 -05:00
Aaron
9eddae83a6 infra: update cohere client
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-14 01:52:38 -05:00
Aaron Pham
31a799ff61 refactor: use DEBUG env-var instead of OPENLLMDEVDEBUG (#647)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-14 01:39:58 -05:00
Aaron Pham
145aafd386 infra: prepare for release 0.4.6 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-14 06:18:14 +00:00
Aaron Pham
00d6016bcb chore(openapi): unify inject param (#645)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-14 01:16:20 -05:00
Aaron Pham
b0ab8ccdf6 experimental: Cohere compatible endpoints. (#644)
* feat: add generate endpoint

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: update generation

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* fix(cohere): generate endpoints

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: --wip--

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* feat: update testing clients and chat implementation

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: disable schemas for easter eggs

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-14 01:07:43 -05:00
Aaron Pham
0bf6ec7537 fix(dependencies): lock build < 1 for now (#643)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-14 00:36:08 -05:00
Aaron Pham
b30a412398 fix(cli): set default dtype to auto infer (#642)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-13 23:05:27 -05:00
Aaron Pham
99a5d26527 fix(service): to yield out correct JSON objects (#640)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-13 22:41:52 -05:00
Aaron Pham
2d428f12da fix(cpu): more verbose definition for dtype casting (#639)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-13 20:40:50 -05:00
Aaron Pham
b20c7d1c1d fix(generation): compatibility dtype with CPU (#638)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-13 20:32:07 -05:00
Zhao Shenyang
ae69524749 doc: update adding new model guide (#637)
* update

* Update openllm-python/ADDING_NEW_MODEL.md

Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Zhao Shenyang <dev@zsy.im>

* Update openllm-python/ADDING_NEW_MODEL.md

Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Zhao Shenyang <dev@zsy.im>

* Update openllm-python/ADDING_NEW_MODEL.md

Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Zhao Shenyang <dev@zsy.im>

* move ADDING_NEW_MODEL.md to git root directory

---------

Signed-off-by: Zhao Shenyang <dev@zsy.im>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-13 18:30:44 -05:00
Aaron Pham
a6387d1d15 chore: cleanup unused code path (#633)
we now rely on tokenizer.chat_templates to format prompts correctly

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-13 17:23:07 -05:00
Aaron Pham
a0d74017cd infra: prepare for release 0.4.5 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-13 18:49:31 +00:00