Aaron
af54ff299f
fix(ec2): increase subnet availability to all available zone with g5
...
instances
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 16:07:41 -04:00
pre-commit-ci[bot]
c2ed1d56da
chore(release): update base container restriction ( #173 )
...
Prepare for 0.2.12 release
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 15:25:17 -04:00
Aaron
6ba8899743
fix: remove invalid OPENLLMDEVDEBUG envvar
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 01:52:08 -04:00
Aaron
961455c762
fix(cli): always --force on --push
...
feat: add --bento-version for ``openllm build``
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 00:56:46 -04:00
Aaron
ca5e3c7ae5
fix: correct setup property for envvar instance
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 23:34:42 -04:00
Aaron
16f032417e
revert: "infra: reduce instance type for more lenient"
...
This reverts commit 4a1d849203 .
2023-07-31 21:34:56 -04:00
Aaron
4a1d849203
infra: reduce instance type for more lenient
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 21:25:59 -04:00
Aaron
23c5aa5958
revert: remove unreleased changelog
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 21:07:00 -04:00
Aaron
fa0e947dd0
chore: add editorconfig [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 20:22:14 -04:00
Aaron Pham
729e423b17
chore(bnb): filter warnings message on CPU ( #170 )
2023-07-31 15:48:59 -04:00
Aaron
19d88d4cb8
infra: ignore rev that update styling [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 09:07:58 -04:00
Aaron
e01853a81c
chore(infra): disable update-changelog for now [skip ci]
...
Need to figure out how to update unreleased without adding it again
probably need to do with `--keep`
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 09:05:50 -04:00
Aaron Pham
ec3c381e8c
infra: add instruction for using docker images from release notes ( #169 )
2023-07-31 08:39:10 -04:00
Aaron Pham
8c2867d26d
style: define experimental guidelines ( #168 )
2023-07-31 07:54:26 -04:00
dependabot[bot]
2c2070f69f
chore(deps): bump docker/setup-qemu-action from 2.1.0 to 2.2.0 [skip ci] ( #165 )
...
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-31 07:52:16 -04:00
dependabot[bot]
94c949c22c
chore(deps): bump aws-actions/configure-aws-credentials from 1 to 2 [skip ci] ( #167 )
...
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-31 07:45:50 -04:00
dependabot[bot]
9592ca02fb
chore(deps): bump docker/setup-buildx-action from 2.5.0 to 2.9.1 [skip ci] ( #164 )
...
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-31 07:45:26 -04:00
dependabot[bot]
4d566fee09
chore(deps): bump peter-evans/create-pull-request from 4 to 5 [skip ci] ( #166 )
...
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-31 07:45:05 -04:00
Aaron
b5652e7d66
fix(ci): agree with signing
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 06:40:14 -04:00
dependabot[bot]
431b326dd3
chore(deps): bump docker/login-action from 2.1.0 to 2.2.0 ( #163 )
...
Bumps [docker/login-action](https://github.com/docker/login-action ) from 2.1.0 to 2.2.0.
- [Release notes](https://github.com/docker/login-action/releases )
- [Commits](https://github.com/docker/login-action/compare/v2.1.0...v2.2.0 )
---
updated-dependencies:
- dependency-name: docker/login-action
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-31 09:26:06 +00:00
Aaron
ae17322b73
fix(ci): correct set digest for signing images
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 04:27:15 -04:00
Aaron Pham
4fbfb363bf
infra: update changelog and added readme badges [generated] ( #162 )
2023-07-31 04:02:02 -04:00
Aaron Pham
fec68d732b
fix(ci): Correctly set signing for pushing container images ( #161 )
2023-07-31 03:43:07 -04:00
Aaron Pham
ef94c6b98a
feat(container): vLLM build and base image strategies ( #142 )
2023-07-31 02:44:52 -04:00
RichardScottOZ
001ff6b5ac
docs: update README.md typos ( #155 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-29 05:40:40 -04:00
Aaron Pham
0c79fabd1a
chore(release): add darwin binary to release notes ( #154 )
2023-07-28 15:00:42 -04:00
Aaron Pham
4de0ca8a13
infra: bump to dev version of 0.2.12.dev0 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-28 00:14:52 +00:00
Aaron Pham
7d9dcb5d40
infra: prepare for release 0.2.11 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.11
2023-07-28 00:04:32 +00:00
aarnphm-ec2-dev
fc66ff275b
fix: make sure to add torch to dependencies
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-28 00:01:52 +00:00
Aaron Pham
15640a85cd
feat: supports embeddings for T5 and ChatGLM family generation ( #153 )
2023-07-27 16:43:43 -04:00
Aaron Pham
e075bd25ea
chore: add NousResearch's as non-gated Llama ( #152 )
2023-07-27 15:30:56 -04:00
Aaron Pham
eacd8d9f46
fix(pre-commit): disable auto fixes ( #151 )
2023-07-27 13:37:09 -04:00
Aaron Pham
4fae00b68b
fix(ci): correct tag for checkout ( #150 )
2023-07-25 14:11:03 -04:00
Aaron Pham
e0a90c8d7e
infra: bump to dev version of 0.2.11.dev0 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-25 17:03:38 +00:00
Aaron Pham
c97d39380c
infra: prepare for release 0.2.10 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.10
2023-07-25 16:49:50 +00:00
Aaron
e000e7d1c6
fix(ci): release correct version via git
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-25 12:48:19 -04:00
aarnphm-ec2-dev
6dc0bf0b12
fix: remove breakpoint on CLI
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-25 16:30:16 +00:00
aarnphm-ec2-dev
b23b59e1c9
fix(embeddings): correctly set JSON data via CLI client
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-25 16:26:01 +00:00
aarnphm-ec2-dev
56bf84a760
fix(ci): make sure to exclude generated _version.py
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-25 09:55:24 +00:00
Aaron Pham
1940086bec
feat(client): embeddings ( #146 )
2023-07-25 05:44:21 -04:00
Aaron Pham
dcd34bd381
fix(build): running bento insider container ( #141 )
...
Behaviour of `docker run` should be the same with `openllm start`
2023-07-25 04:24:28 -04:00
Aaron Pham
afb2d34673
docs: update fine tuning model support ( #145 )
2023-07-25 04:21:52 -04:00
Aaron Pham
a80fb4635d
docs: remove extraneous whitespace ( #144 )
2023-07-25 04:19:55 -04:00
Aaron Pham
c391717226
feat(ci): automatic release semver + git archival installation ( #143 )
2023-07-25 04:18:49 -04:00
Aaron Pham
5635ce8d87
infra: bump to dev version of 0.2.10.dev0 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-24 23:35:04 +00:00
Aaron Pham
fb656164e1
infra: prepare for release 0.2.9 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.9
2023-07-24 23:24:09 +00:00
aarnphm-ec2-dev
084786c898
fix(cli): `openllm models` for showing available
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-07-24 23:00:03 +00:00
Chaoyu
0fe9d83ff3
docs: Update README.md ( #139 )
2023-07-24 18:11:20 -04:00
Aaron Pham
60c725a21f
ci: release PyPI before building binary ( #138 )
2023-07-24 16:39:51 -04:00
Aaron Pham
e72f0d55f4
infra: bump to dev version of 0.2.9.dev0 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-24 19:58:13 +00:00