Aaron Pham
|
1831d8f129
|
feat: heuristics logprobs (#692)
* fix(encoder): bring back T5 support on PyTorch
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: support logprobs and prompt_logprobs
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* docs: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-18 19:26:20 -05:00 |
|
Aaron Pham
|
4499469efb
|
fix(annotations): check library through find_spec (#691)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-18 02:02:16 -05:00 |
|
Aaron Pham
|
e9a89b7a7e
|
fix(cattrs): strictly lock <23.2 until we upgrade validation logic (#690)
fix(cattrs): strictly lock <23.2 until we move converter to upper version
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 17:11:15 -05:00 |
|
Aaron Pham
|
3d204e9cea
|
infra: bump to homebrew tap release to 0.4.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 22:07:26 +00:00 |
|
Aaron Pham
|
781ae72c6f
|
infra: bump to dev version of 0.4.15.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 22:04:01 +00:00 |
|
Aaron Pham
|
5402db1e61
|
infra: prepare for release 0.4.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.14
|
2023-11-17 21:54:10 +00:00 |
|
Aaron Pham
|
0891cde0b6
|
fix(dependencies): ignore broken cattrs release (#689)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 16:52:58 -05:00 |
|
Aaron Pham
|
131f3f5dc3
|
infra: bump to homebrew tap release to 0.4.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 21:18:13 +00:00 |
|
Aaron Pham
|
1c5d07d60c
|
infra: bump to dev version of 0.4.14.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 21:17:18 +00:00 |
|
Aaron Pham
|
e14f3ffed5
|
infra: prepare for release 0.4.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.13
|
2023-11-17 21:06:56 +00:00 |
|
Aaron Pham
|
c03e3bebb3
|
fix(infra): prepare correct dependencies for release [skip ci] (#687)
fix(infra): prepare correct dependencies for release
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 16:05:46 -05:00 |
|
Aaron Pham
|
80ed400646
|
fix(build): lock lower version based on each release and update infra (#686)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 15:57:31 -05:00 |
|
Aaron Pham
|
e01f93f0c3
|
examples: improve instructions and cleanup simple API server (#684)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 11:53:56 -05:00 |
|
Aaron Pham
|
381d740a7a
|
fix(llm): remove unnecessary check (#683)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 11:23:22 -05:00 |
|
Aaron Pham
|
10471f7e4e
|
infra: bump to homebrew tap release to 0.4.12 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 16:05:37 +00:00 |
|
Aaron Pham
|
89c49f3a4f
|
infra: bump to dev version of 0.4.13.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 16:04:37 +00:00 |
|
Aaron Pham
|
65370f6919
|
infra: prepare for release 0.4.12 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.12
|
2023-11-17 15:54:41 +00:00 |
|
Aaron Pham
|
14b3ceb436
|
fix(torch_dtype): correctly infer based on options (#682)
Users should be able to set the dtype during build, as we it doesn't effect start time
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 10:52:05 -05:00 |
|
Aaron Pham
|
7402408c5f
|
fix(envvar): explicitly set NVIDIA_DRIVER_CAPABILITIES (#681)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 10:40:45 -05:00 |
|
Aaron Pham
|
bd513e51a8
|
infra: bump to homebrew tap release to 0.4.11 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 15:04:54 +00:00 |
|
Aaron Pham
|
122c95dc31
|
infra: bump to dev version of 0.4.12.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 15:03:29 +00:00 |
|
Aaron Pham
|
5752c3f0d8
|
infra: prepare for release 0.4.11 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.11
|
2023-11-17 14:53:12 +00:00 |
|
Aaron Pham
|
bce273ad47
|
fix(env): correct format environment on docker (#680)
* fix(env): correct format environment on docker
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* docs: changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 09:51:17 -05:00 |
|
Aaron Pham
|
c1e0e3eae7
|
fix(build): correctly parse default env for container (#679)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 09:35:26 -05:00 |
|
Aaron Pham
|
21a308538e
|
fix: correct set item for attrs >23.1 (#678)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 09:16:52 -05:00 |
|
Aaron Pham
|
c9daf4b5cb
|
fix(examples): add support for streaming feature (#677)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 08:32:28 -05:00 |
|
Aaron Pham
|
60b60ed29a
|
infra: update cbfmt options (#676)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 07:51:33 -05:00 |
|
Aaron Pham
|
102072bd1c
|
infra: bump to homebrew tap release to 0.4.10 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 06:33:21 +00:00 |
|
Aaron Pham
|
dd83030ac1
|
infra: bump to dev version of 0.4.11.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 06:26:19 +00:00 |
|
Aaron Pham
|
f4de4a9f13
|
infra: prepare for release 0.4.10 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.10
|
2023-11-17 06:16:58 +00:00 |
|
Aaron Pham
|
d60ca49d2f
|
perf: potentially reduce image size (#675)
* perf: potentially reduce image size
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* perf: use base python packages only
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: typo
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* perf: Shave off 2GB
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-17 01:15:56 -05:00 |
|
Aaron Pham
|
09cc84a56c
|
chore(loading): include verbose warning about trust_remote_code (#674)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 20:09:50 -05:00 |
|
Aaron Pham
|
1a38de9b1f
|
fix(docs): chatglm support on vLLM (#673)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 17:54:06 -05:00 |
|
Aaron Pham
|
c850d76ccd
|
feat(models): Phi 1.5 (#672)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 17:48:10 -05:00 |
|
Aaron Pham
|
44f6db982d
|
infra: remove codegolf (#671)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 17:38:47 -05:00 |
|
Aaron Pham
|
0fdfe786f3
|
docs: add LlamaIndex integration (#646)
* docs: add LlamaIndex integration
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* ci: auto fixes from pre-commit.ci
For more information, see https://pre-commit.ci
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-16 16:24:43 -05:00 |
|
Aaron Pham
|
8fdfd0491f
|
perf(build): locking and improve build speed (#669)
* revert(build): not locking packages
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* perf: improve svars generation and unifying envvar parsing
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* docs: update changelog
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* chore: update stubs check for mypy
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 06:27:45 -05:00 |
|
Aaron Pham
|
fce8f223f3
|
perf: reduce footprint (#668)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 04:45:49 -05:00 |
|
Aaron Pham
|
9e3f0fea15
|
types: update stubs for remaining entrypoints (#667)
* perf(type): static OpenAI types definition
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: add hf types
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* types: update remaining missing stubs
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 04:26:13 -05:00 |
|
Aaron Pham
|
6102a67a83
|
infra: makes huggingface-hub requirements on fine-tune (#665)
infra: makes huggingface-hub core deps
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 03:12:52 -05:00 |
|
Aaron Pham
|
86d23fd6f5
|
feat(llm): respect warnings environment for dtype warning (#664)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 03:05:58 -05:00 |
|
Aaron Pham
|
4a6f13ddd2
|
feat(type): provide structured annotations stubs (#663)
* feat(type): provide client stubs
separation of concern for more brevity code base
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* docs: update changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-16 02:58:45 -05:00 |
|
xianxian.zhang
|
c6264f3af7
|
fix(examples): update notebook with new API (#662)
|
2023-11-15 22:28:40 -05:00 |
|
Kuan-Chun Wang
|
af88b9b077
|
fix(runner): remove keyword args for attrs.get() (#661)
|
2023-11-15 04:59:01 -05:00 |
|
Aaron Pham
|
c05f405163
|
infra: bump to homebrew tap release to 0.4.9 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 08:02:38 +00:00 |
|
Aaron Pham
|
7c64ffea0f
|
infra: bump to dev version of 0.4.10.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 08:01:40 +00:00 |
|
Aaron Pham
|
68c6a9dac6
|
infra: prepare for release 0.4.9 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.9
|
2023-11-15 07:51:30 +00:00 |
|
Aaron Pham
|
876586a30e
|
fix(falcon): remove early_stopping default arguments (#660)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 02:49:54 -05:00 |
|
Aaron Pham
|
9e6df0df89
|
chore: update requirements in README.md (#659)
chore: update requirements
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 02:32:36 -05:00 |
|
Aaron Pham
|
034e08cf08
|
infra: update scripts to run update readme automatically (#658)
* infra: update scripts to run update readme automatically
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: cleanup mirror
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore(dropdown): correctly format noteblock and important block
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: whitespace aware
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-15 02:22:49 -05:00 |
|