Aaron Pham
8c2867d26d
style: define experimental guidelines ( #168 )
2023-07-31 07:54:26 -04:00
Aaron Pham
4fbfb363bf
infra: update changelog and added readme badges [generated] ( #162 )
2023-07-31 04:02:02 -04:00
RichardScottOZ
001ff6b5ac
docs: update README.md typos ( #155 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-29 05:40:40 -04:00
Aaron Pham
15640a85cd
feat: supports embeddings for T5 and ChatGLM family generation ( #153 )
2023-07-27 16:43:43 -04:00
Aaron Pham
e075bd25ea
chore: add NousResearch's as non-gated Llama ( #152 )
2023-07-27 15:30:56 -04:00
Aaron Pham
4fae00b68b
fix(ci): correct tag for checkout ( #150 )
2023-07-25 14:11:03 -04:00
Aaron Pham
1940086bec
feat(client): embeddings ( #146 )
2023-07-25 05:44:21 -04:00
Aaron Pham
afb2d34673
docs: update fine tuning model support ( #145 )
2023-07-25 04:21:52 -04:00
Aaron Pham
a80fb4635d
docs: remove extraneous whitespace ( #144 )
2023-07-25 04:19:55 -04:00
Aaron Pham
c391717226
feat(ci): automatic release semver + git archival installation ( #143 )
2023-07-25 04:18:49 -04:00
Chaoyu
0fe9d83ff3
docs: Update README.md ( #139 )
2023-07-24 18:11:20 -04:00
Aaron Pham
693631958a
feat(service): provisional API ( #133 )
2023-07-23 02:15:39 -04:00
Aaron Pham
1b3508619e
feat(llama): add default prompt for LlaMA-2 ( #122 )
2023-07-20 07:46:33 -04:00
Aaron
d92b136780
chore(llama): remove decapoda vairants
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-19 18:58:04 -04:00
Aaron Pham
c1ddb9ed7c
feat: GPTQ + vLLM and LlaMA ( #113 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-19 18:12:12 -04:00
HeTaoPKU
fd9ae56812
fix(baichuan): add "cpm-kernel" as additional requirements ( #117 )
...
This is to support the 13b variant of baichuan
Co-authored-by: the <tao.he@hulu.com >
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-15 23:16:05 -04:00
HeTaoPKU
09b0787306
feat(models): Baichuan ( #115 )
...
Co-authored-by: the <tao.he@hulu.com >
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-15 22:01:37 -04:00
Aaron Pham
b2dba6143f
fix(resource): correctly parse CUDA_VISIBLE_DEVICES ( #114 )
2023-07-15 07:19:35 -04:00
Aaron Pham
b291526248
revert: remove badges
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-11 14:44:36 -04:00
Aaron Pham
c7f4dc7bb2
feat(test): snapshot testing ( #107 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-10 17:23:19 -04:00
Aaron Pham
fb849a384e
feat: GPTNeoX ( #106 )
2023-07-07 03:05:40 -04:00
Aaron Pham
d6303d306a
perf: fixing import custom paths and cleanup serialisation ( #102 )
2023-07-04 12:49:14 -04:00
Aaron Pham
0e45ff7bb7
feat: add citation ( #103 )
2023-07-03 18:44:43 -04:00
Aaron Pham
3a66b1f5ba
docs: add notes about fine-tuning
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-06-29 10:20:45 -04:00
Aaron Pham
01db504e7d
feat: MPT ( #91 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-28 23:12:15 -04:00
Steve Guo
b96d944506
doc: Update bentocloud deployment ( #75 )
...
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-06-26 20:24:39 -04:00
Aaron Pham
acb6a3cb32
fix: converting envvar to string ( #68 )
2023-06-25 03:40:45 -04:00
Aaron Pham
dfca956fad
feat: serve adapter layers ( #52 )
2023-06-23 10:07:15 -04:00
Aaron Pham
ae2474a017
docs: update typo and links
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-06-22 05:03:28 -04:00
Aaron
38ab523fc8
docs: update LangChain integration
...
Finally it is merged onto LangChain upstream
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-22 04:19:51 -04:00
Ikko Eltociear Ashimine
9c6b43b163
docs: rename camel case to official Hugging Face name ( #39 )
2023-06-20 12:02:03 -04:00
Aaron
9139e6f290
docs: update README to use OPT as example
...
This helps with system with lower memory and RAM
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-19 13:40:10 -04:00
Aaron
752c2e60a5
fix: remove direct url reference
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-19 13:25:29 -04:00
Aaron
2244cce5bd
fix(config): __getitem__ to get the value instead of member of class
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-19 05:35:11 -04:00
Aaron
a7a6775c68
chore: add banner for OpenLLM
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-18 05:55:38 -04:00
Aaron
33d3523e5b
chore(readme): update docs and warning notes
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-18 01:39:15 -04:00
Aaron Pham
4fcd7c8ac9
integration: HuggingFace Agent ( #29 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-18 00:13:53 -04:00
Aaron Pham
19bc7e3116
feat: fine-tuning [part 1] ( #23 )
2023-06-16 00:19:01 -04:00
Aaron Pham
850cf791ef
chore: fix README.md
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-06-15 09:37:46 -04:00
Aaron
74c8323e42
docs: update generated with href
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-13 07:30:43 -04:00
Aaron Pham
dd20941050
chore: metadata ( #19 )
2023-06-13 04:09:33 -04:00
Aaron
764d86289c
chore(readme): update table with model_ids matrix
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-12 16:57:40 -04:00
Chaoyu
187a5f834f
docs: add --model-id command ( #18 )
...
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-06-12 14:03:36 -04:00
Jian Shen
d3bbb727ea
doc: add gif to readme
2023-06-12 15:51:08 +08:00
Aaron
06c90c0ba3
docs: update matrix [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-11 00:48:09 -04:00
aarnphm-ec2-dev
c669d38dea
fix(flan-t5): casting model to CUDA
...
Add a notes about GPU support for Flax
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-10 02:55:55 -04:00
Aaron
e90d90e9a0
feat(docs): copy button from table list
...
the script now generate into a HTML table, which allows us to use the
copy button from the README.md
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-10 01:23:56 -04:00
Aaron
7d382ced4f
chore(docs): update notes about flan-t5
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-10 00:22:12 -04:00
Chaoyu
e2b26adf2f
chore(docs): update README.md
...
See #12
2023-06-10 00:21:21 -04:00
Aaron
1597d5d4bb
chore(readme): update stablelm [generated]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-06-10 00:21:21 -04:00