Aaron
|
2244cce5bd
|
fix(config): __getitem__ to get the value instead of member of class
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-19 05:35:11 -04:00 |
|
Aaron
|
622a2fb37d
|
fix: separate hatch config
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-19 03:39:05 -04:00 |
|
Aaron
|
e3fad40f21
|
fix(env): make tests with extra-dependencies
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-18 23:58:03 -04:00 |
|
Aaron Pham
|
03758a5487
|
fix(tools): adhere to style guidelines (#31)
|
2023-06-18 20:03:17 -04:00 |
|
Aaron
|
a7a6775c68
|
chore: add banner for OpenLLM
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-18 05:55:38 -04:00 |
|
Aaron
|
33d3523e5b
|
chore(readme): update docs and warning notes
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-18 01:39:15 -04:00 |
|
Aaron Pham
|
4fcd7c8ac9
|
integration: HuggingFace Agent (#29)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-06-18 00:13:53 -04:00 |
|
aarnphm-ec2-dev
|
fe8da4e8a9
|
fix(tests): ensure_available on tests
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-06-17 15:12:28 +00:00 |
|
Aaron
|
8bd7351d3c
|
chore: update new gif
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-17 10:56:32 -04:00 |
|
Aaron
|
5a6f42ee99
|
infra: fix generated release link for towncrier [skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-17 09:20:34 -04:00 |
|
Aaron Pham [bot]
|
9be65a813b
|
infra: bump to dev version of 0.1.7.dev0 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
|
2023-06-17 13:12:46 +00:00 |
|
Aaron Pham [bot]
|
ed398c38f8
|
infra: prepare for release 0.1.6 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.1.6
|
2023-06-17 13:02:47 +00:00 |
|
Aaron Pham
|
6f724416c0
|
perf: build quantization and better transformer behaviour (#28)
Fixes quantization_config and low_cpu_mem_usage to be available on PyTorch implementation only
See changelog for more details on #28
|
2023-06-17 08:56:14 -04:00 |
|
Aaron
|
233d4697b5
|
chore: update __all__ to take into _extra_objects
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-16 18:13:35 -04:00 |
|
Aaron Pham
|
ded8a9f809
|
feat: quantization (#27)
|
2023-06-16 18:10:50 -04:00 |
|
Aaron Pham
|
19bc7e3116
|
feat: fine-tuning [part 1] (#23)
|
2023-06-16 00:19:01 -04:00 |
|
Aaron
|
b9ff4ab92a
|
chore: flatten examples llm-config
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-15 18:39:33 -04:00 |
|
Aaron
|
e4b7714756
|
chore(js): update metadata
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-15 13:18:05 -04:00 |
|
Aaron Pham
|
850cf791ef
|
chore: fix README.md
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-06-15 09:37:46 -04:00 |
|
Chaoyu
|
dc50a2e7e5
|
docs: add LangChain and BentoML Examples (#25)
Co-authored-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-06-15 06:14:37 -04:00 |
|
Aaron Pham
|
5e1445218b
|
refactor: toplevel CLI (#26)
Move up CLI outside of the factory function to simplify workflow
|
2023-06-15 02:32:46 -04:00 |
|
Aaron Pham [bot]
|
9a6a976ce1
|
infra: bump to dev version of 0.1.6.dev0 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
|
2023-06-15 06:16:12 +00:00 |
|
Aaron Pham [bot]
|
bb425b89d9
|
infra: prepare for release 0.1.5 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.1.5
|
2023-06-15 06:05:35 +00:00 |
|
Aaron
|
528f76e1d0
|
fix(client): using httpx for running calls within async context
This is so that client.query works within a async context
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-15 01:58:49 -04:00 |
|
aarnphm-ec2-dev
|
b3d924e6d6
|
fix(dolly): make sure to use GPU when available
map device_map to auto when GPU is available
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-06-15 05:53:14 +00:00 |
|
aarnphm-ec2-dev
|
dfe71d7867
|
chore(cli): redirect download models into subcontext
utilise click subcontext for nicer CLI interaction
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-06-14 11:44:39 +00:00 |
|
Aaron
|
d7e92ae525
|
feat(cli): --device all --workers-per-resource
synonymous to the configuration arguments
add support for --device all
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-14 06:36:54 -04:00 |
|
Aaron
|
d07cc95ea0
|
ci: add hatch to dev envs
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-14 03:48:05 -04:00 |
|
Aaron Pham [bot]
|
123d9c442f
|
infra: bump to dev version of 0.1.5.dev0 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
|
2023-06-14 07:43:54 +00:00 |
|
Aaron Pham [bot]
|
f9c0a1093b
|
infra: prepare for release 0.1.4 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.1.4
|
2023-06-14 07:33:16 +00:00 |
|
Aaron
|
be41c23c10
|
codegen: remove black as dependencies
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-14 03:22:05 -04:00 |
|
Aaron
|
50d59cdf8d
|
types: rename interface
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-14 02:45:34 -04:00 |
|
Aaron Pham [bot]
|
47da1916ad
|
infra: bump to dev version of 0.1.4.dev0 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
|
2023-06-14 05:56:49 +00:00 |
|
Aaron Pham [bot]
|
52d786edc7
|
infra: prepare for release 0.1.3 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.1.3
|
2023-06-14 05:46:29 +00:00 |
|
Aaron
|
111d205f63
|
perf: faster LLM loading
using attrs for faster class creation opposed to metaclass
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-14 01:36:42 -04:00 |
|
Aaron
|
ebcedc35de
|
fix(exception): handle notfound explicitly
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-13 20:15:38 -04:00 |
|
Aaron
|
0ab7450e90
|
chore(types): add hints for LLMRunner
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-13 20:13:33 -04:00 |
|
Aaron
|
03c90c2a13
|
fix(llm): ensure we don't bleed runner options
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-13 20:05:33 -04:00 |
|
Aaron
|
e3ccf766d7
|
chore: expose LLMRunner for type
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-13 19:47:36 -04:00 |
|
aarnphm-ec2-dev
|
1194684658
|
fix(llm): cached load
Ensure we only load the llm once
fix falcon offloading load
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-06-13 14:22:09 +00:00 |
|
Aaron
|
74c8323e42
|
docs: update generated with href
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-13 07:30:43 -04:00 |
|
Aaron Pham [bot]
|
ece2b377c0
|
infra: bump to dev version of 0.1.3.dev0 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
|
2023-06-13 11:24:14 +00:00 |
|
Aaron Pham [bot]
|
398ed85b9b
|
infra: prepare for release 0.1.2 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.1.2
|
2023-06-13 11:14:25 +00:00 |
|
Aaron
|
cb76a894cf
|
feat(metadata): add configuration to metadata endpoint
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-13 07:09:31 -04:00 |
|
Aaron Pham
|
dd20941050
|
chore: metadata (#19)
|
2023-06-13 04:09:33 -04:00 |
|
Aaron
|
764d86289c
|
chore(readme): update table with model_ids matrix
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-12 16:57:40 -04:00 |
|
Aaron Pham [bot]
|
b5547bbc97
|
infra: bump to dev version of 0.1.2.dev0 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
|
2023-06-12 20:30:48 +00:00 |
|
Aaron Pham [bot]
|
f85bbec147
|
infra: prepare for release 0.1.1 [generated]
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com>
v0.1.1
|
2023-06-12 20:19:34 +00:00 |
|
Aaron
|
71070b90b4
|
chore(metadata): fix model_id to be respected on service.py
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-12 16:04:52 -04:00 |
|
Aaron
|
4717989384
|
fix(tokenizers): allow forking by default
address message about forking in tokenizers
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-06-12 15:47:51 -04:00 |
|