Commit Graph

1154 Commits

Author SHA1 Message Date
Aaron Pham
1831d8f129 feat: heuristics logprobs (#692)
* fix(encoder): bring back T5 support on PyTorch

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* feat: support logprobs and prompt_logprobs

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* docs: update changelog

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-18 19:26:20 -05:00
Aaron Pham
4499469efb fix(annotations): check library through find_spec (#691)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-18 02:02:16 -05:00
Aaron Pham
e9a89b7a7e fix(cattrs): strictly lock <23.2 until we upgrade validation logic (#690)
fix(cattrs): strictly lock <23.2 until we move converter to upper version

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 17:11:15 -05:00
Aaron Pham
3d204e9cea infra: bump to homebrew tap release to 0.4.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 22:07:26 +00:00
Aaron Pham
781ae72c6f infra: bump to dev version of 0.4.15.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 22:04:01 +00:00
Aaron Pham
5402db1e61 infra: prepare for release 0.4.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.14
2023-11-17 21:54:10 +00:00
Aaron Pham
0891cde0b6 fix(dependencies): ignore broken cattrs release (#689)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 16:52:58 -05:00
Aaron Pham
131f3f5dc3 infra: bump to homebrew tap release to 0.4.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 21:18:13 +00:00
Aaron Pham
1c5d07d60c infra: bump to dev version of 0.4.14.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 21:17:18 +00:00
Aaron Pham
e14f3ffed5 infra: prepare for release 0.4.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.13
2023-11-17 21:06:56 +00:00
Aaron Pham
c03e3bebb3 fix(infra): prepare correct dependencies for release [skip ci] (#687)
fix(infra): prepare correct dependencies for release

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 16:05:46 -05:00
Aaron Pham
80ed400646 fix(build): lock lower version based on each release and update infra (#686)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 15:57:31 -05:00
Aaron Pham
e01f93f0c3 examples: improve instructions and cleanup simple API server (#684)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 11:53:56 -05:00
Aaron Pham
381d740a7a fix(llm): remove unnecessary check (#683)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 11:23:22 -05:00
Aaron Pham
10471f7e4e infra: bump to homebrew tap release to 0.4.12 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 16:05:37 +00:00
Aaron Pham
89c49f3a4f infra: bump to dev version of 0.4.13.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 16:04:37 +00:00
Aaron Pham
65370f6919 infra: prepare for release 0.4.12 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.12
2023-11-17 15:54:41 +00:00
Aaron Pham
14b3ceb436 fix(torch_dtype): correctly infer based on options (#682)
Users should be able to set the dtype during build, as we it doesn't effect start time

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 10:52:05 -05:00
Aaron Pham
7402408c5f fix(envvar): explicitly set NVIDIA_DRIVER_CAPABILITIES (#681)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 10:40:45 -05:00
Aaron Pham
bd513e51a8 infra: bump to homebrew tap release to 0.4.11 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 15:04:54 +00:00
Aaron Pham
122c95dc31 infra: bump to dev version of 0.4.12.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 15:03:29 +00:00
Aaron Pham
5752c3f0d8 infra: prepare for release 0.4.11 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.11
2023-11-17 14:53:12 +00:00
Aaron Pham
bce273ad47 fix(env): correct format environment on docker (#680)
* fix(env): correct format environment on docker

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* docs: changelog

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 09:51:17 -05:00
Aaron Pham
c1e0e3eae7 fix(build): correctly parse default env for container (#679)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 09:35:26 -05:00
Aaron Pham
21a308538e fix: correct set item for attrs >23.1 (#678)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 09:16:52 -05:00
Aaron Pham
c9daf4b5cb fix(examples): add support for streaming feature (#677)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 08:32:28 -05:00
Aaron Pham
60b60ed29a infra: update cbfmt options (#676)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 07:51:33 -05:00
Aaron Pham
102072bd1c infra: bump to homebrew tap release to 0.4.10 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 06:33:21 +00:00
Aaron Pham
dd83030ac1 infra: bump to dev version of 0.4.11.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-17 06:26:19 +00:00
Aaron Pham
f4de4a9f13 infra: prepare for release 0.4.10 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.10
2023-11-17 06:16:58 +00:00
Aaron Pham
d60ca49d2f perf: potentially reduce image size (#675)
* perf: potentially reduce image size

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* perf: use base python packages only

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* fix: typo

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* perf: Shave off 2GB

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-17 01:15:56 -05:00
Aaron Pham
09cc84a56c chore(loading): include verbose warning about trust_remote_code (#674)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 20:09:50 -05:00
Aaron Pham
1a38de9b1f fix(docs): chatglm support on vLLM (#673)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 17:54:06 -05:00
Aaron Pham
c850d76ccd feat(models): Phi 1.5 (#672)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 17:48:10 -05:00
Aaron Pham
44f6db982d infra: remove codegolf (#671)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 17:38:47 -05:00
Aaron Pham
0fdfe786f3 docs: add LlamaIndex integration (#646)
* docs: add LlamaIndex integration

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* ci: auto fixes from pre-commit.ci

For more information, see https://pre-commit.ci

---------

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-11-16 16:24:43 -05:00
Aaron Pham
8fdfd0491f perf(build): locking and improve build speed (#669)
* revert(build): not locking packages

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* perf: improve svars generation and unifying envvar parsing

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* docs: update changelog

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* chore: update stubs check for mypy

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-16 06:27:45 -05:00
Aaron Pham
fce8f223f3 perf: reduce footprint (#668)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 04:45:49 -05:00
Aaron Pham
9e3f0fea15 types: update stubs for remaining entrypoints (#667)
* perf(type): static OpenAI types definition

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* feat: add hf types

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* types: update remaining missing stubs

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 04:26:13 -05:00
Aaron Pham
6102a67a83 infra: makes huggingface-hub requirements on fine-tune (#665)
infra: makes huggingface-hub core deps

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 03:12:52 -05:00
Aaron Pham
86d23fd6f5 feat(llm): respect warnings environment for dtype warning (#664)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 03:05:58 -05:00
Aaron Pham
4a6f13ddd2 feat(type): provide structured annotations stubs (#663)
* feat(type): provide client stubs

separation of concern for more brevity code base

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* docs: update changelog

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-16 02:58:45 -05:00
xianxian.zhang
c6264f3af7 fix(examples): update notebook with new API (#662) 2023-11-15 22:28:40 -05:00
Kuan-Chun Wang
af88b9b077 fix(runner): remove keyword args for attrs.get() (#661) 2023-11-15 04:59:01 -05:00
Aaron Pham
c05f405163 infra: bump to homebrew tap release to 0.4.9 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-15 08:02:38 +00:00
Aaron Pham
7c64ffea0f infra: bump to dev version of 0.4.10.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-15 08:01:40 +00:00
Aaron Pham
68c6a9dac6 infra: prepare for release 0.4.9 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.9
2023-11-15 07:51:30 +00:00
Aaron Pham
876586a30e fix(falcon): remove early_stopping default arguments (#660)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-15 02:49:54 -05:00
Aaron Pham
9e6df0df89 chore: update requirements in README.md (#659)
chore: update requirements

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-15 02:32:36 -05:00
Aaron Pham
034e08cf08 infra: update scripts to run update readme automatically (#658)
* infra: update scripts to run update readme automatically

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: cleanup mirror

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore(dropdown): correctly format noteblock and important block

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* fix: whitespace aware

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-15 02:22:49 -05:00