Aaron Pham [bot]
54be563b3e
infra: bump to dev version of 0.1.11.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-21 17:59:01 +00:00
Aaron Pham [bot]
5a69345d29
infra: prepare for release 0.1.10 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-21 17:53:27 +00:00
Aaron
a85953d6bf
fix(build): make sure not to overwrite missing openllm
...
in build without dev mode
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-21 13:51:37 -04:00
Aaron Pham [bot]
703cb21e96
infra: bump to dev version of 0.1.10.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-21 11:34:24 +00:00
Aaron Pham [bot]
278757d58a
infra: prepare for release 0.1.9 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-21 11:30:44 +00:00
aarnphm-ec2-dev
ef40fdf5c8
fix(build): quote environment variables
...
Make sure that the config is quoted properly in generated Dockerfile
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-21 11:28:37 +00:00
Aaron
de665def5c
fix(cli): support loading model-id from local path
...
SDK should already support loading from local-path, but on CLI there was
a bug with start where it sets the choice for model-id to only
pretrained set of model-id
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-21 07:26:12 -04:00
aarnphm-ec2-dev
d33149d758
fix(log): repr the given LLMConfig in debug mode
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-19 18:20:18 +00:00
Aaron Pham
ca802d9d1a
fix: agent log ( #37 )
2023-06-19 14:11:39 -04:00
Aaron Pham [bot]
78a537079e
infra: bump to dev version of 0.1.9.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-19 18:08:32 +00:00
aarnphm-ec2-dev
70c7c0a9b7
fix(cli): use correct API for client
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-19 18:04:27 +00:00
Aaron Pham [bot]
6bbbefd06a
infra: prepare for release 0.1.8 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-19 18:02:08 +00:00
aarnphm-ec2-dev
6d43bdbcdb
fix(instruct): remove breakpoint
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-19 18:00:00 +00:00
Aaron Pham [bot]
0e3f8d2fba
infra: bump to dev version of 0.1.8.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-19 17:30:50 +00:00
Aaron Pham [bot]
9a6af97356
infra: prepare for release 0.1.7 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-19 17:27:52 +00:00
aarnphm-ec2-dev
feb0c53146
fix(timeout): increase default timeout to avoid asyncio error
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-19 17:02:42 +00:00
Aaron
58758f8241
fix(dolly_v2): gc collect after import
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-19 12:28:13 -04:00
Aaron
1ed0ae7787
fix(log): make sure to configure OpenLLM logs correctly
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-19 06:19:06 -04:00
Aaron
2244cce5bd
fix(config): __getitem__ to get the value instead of member of class
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-19 05:35:11 -04:00
Aaron Pham
03758a5487
fix(tools): adhere to style guidelines ( #31 )
2023-06-18 20:03:17 -04:00
Aaron Pham
4fcd7c8ac9
integration: HuggingFace Agent ( #29 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-18 00:13:53 -04:00
Aaron Pham [bot]
9be65a813b
infra: bump to dev version of 0.1.7.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-17 13:12:46 +00:00
Aaron Pham [bot]
ed398c38f8
infra: prepare for release 0.1.6 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-17 13:02:47 +00:00
Aaron Pham
6f724416c0
perf: build quantization and better transformer behaviour ( #28 )
...
Fixes quantization_config and low_cpu_mem_usage to be available on PyTorch implementation only
See changelog for more details on #28
2023-06-17 08:56:14 -04:00
Aaron
233d4697b5
chore: update __all__ to take into _extra_objects
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-16 18:13:35 -04:00
Aaron Pham
ded8a9f809
feat: quantization ( #27 )
2023-06-16 18:10:50 -04:00
Aaron Pham
19bc7e3116
feat: fine-tuning [part 1] ( #23 )
2023-06-16 00:19:01 -04:00
Aaron
b9ff4ab92a
chore: flatten examples llm-config
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-15 18:39:33 -04:00
Aaron
e4b7714756
chore(js): update metadata
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-15 13:18:05 -04:00
Aaron Pham
5e1445218b
refactor: toplevel CLI ( #26 )
...
Move up CLI outside of the factory function to simplify workflow
2023-06-15 02:32:46 -04:00
Aaron Pham [bot]
9a6a976ce1
infra: bump to dev version of 0.1.6.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-15 06:16:12 +00:00
Aaron Pham [bot]
bb425b89d9
infra: prepare for release 0.1.5 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-15 06:05:35 +00:00
Aaron
528f76e1d0
fix(client): using httpx for running calls within async context
...
This is so that client.query works within a async context
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-15 01:58:49 -04:00
aarnphm-ec2-dev
b3d924e6d6
fix(dolly): make sure to use GPU when available
...
map device_map to auto when GPU is available
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-15 05:53:14 +00:00
aarnphm-ec2-dev
dfe71d7867
chore(cli): redirect download models into subcontext
...
utilise click subcontext for nicer CLI interaction
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-14 11:44:39 +00:00
Aaron
d7e92ae525
feat(cli): --device all --workers-per-resource
...
synonymous to the configuration arguments
add support for --device all
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-14 06:36:54 -04:00
Aaron Pham [bot]
123d9c442f
infra: bump to dev version of 0.1.5.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-14 07:43:54 +00:00
Aaron Pham [bot]
f9c0a1093b
infra: prepare for release 0.1.4 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-14 07:33:16 +00:00
Aaron
be41c23c10
codegen: remove black as dependencies
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-14 03:22:05 -04:00
Aaron
50d59cdf8d
types: rename interface
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-14 02:45:34 -04:00
Aaron Pham [bot]
47da1916ad
infra: bump to dev version of 0.1.4.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-14 05:56:49 +00:00
Aaron Pham [bot]
52d786edc7
infra: prepare for release 0.1.3 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-14 05:46:29 +00:00
Aaron
111d205f63
perf: faster LLM loading
...
using attrs for faster class creation opposed to metaclass
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-14 01:36:42 -04:00
Aaron
ebcedc35de
fix(exception): handle notfound explicitly
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-13 20:15:38 -04:00
Aaron
0ab7450e90
chore(types): add hints for LLMRunner
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-13 20:13:33 -04:00
Aaron
03c90c2a13
fix(llm): ensure we don't bleed runner options
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-13 20:05:33 -04:00
Aaron
e3ccf766d7
chore: expose LLMRunner for type
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-13 19:47:36 -04:00
aarnphm-ec2-dev
1194684658
fix(llm): cached load
...
Ensure we only load the llm once
fix falcon offloading load
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-06-13 14:22:09 +00:00
Aaron Pham [bot]
ece2b377c0
infra: bump to dev version of 0.1.3.dev0 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-13 11:24:14 +00:00
Aaron Pham [bot]
398ed85b9b
infra: prepare for release 0.1.2 [generated]
...
Signed-off-by: Aaron Pham [bot] <29749331+aarnphm@users.noreply.github.com >
2023-06-13 11:14:25 +00:00