Aaron
794719670e
chore: update README [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-04 12:10:21 -04:00
Aaron Pham
cdc6bae0e9
infra: bump to dev version of ..1.dev0 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-08-04 15:47:20 +00:00
Aaron Pham
9d1476e360
infra: prepare for release 0.2.15 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.15
2023-08-04 15:32:47 +00:00
Aaron
287b7f9ab2
fix: releases issue when building new container [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-04 11:31:02 -04:00
Aaron
20deb3354d
infra: bump to dev version of 0.2.15.dev0 [generated] [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-04 11:11:14 -04:00
Aaron Pham
cb05446760
infra: prepare for release 0.2.14 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.14
2023-08-04 14:50:59 +00:00
Aaron
975a1d0349
fix: remove tokens for release [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-04 10:49:06 -04:00
Aaron
1e74e967d1
fix(container): correct cache directory
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-04 10:31:06 -04:00
Aaron Pham
2541a0f8dc
infra: initial work on compiling mypyc wheels ( #182 )
2023-08-04 10:20:03 -04:00
Aaron Pham
2cc264aa72
fix(vllm): correctly load given model id from envvar ( #181 )
2023-08-03 16:34:35 -04:00
Aaron
db8e47bc5b
fix(build): correct module type for stubs and strip assert [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-03 04:15:55 -04:00
Aaron
8f74e24c2f
fix: clone all for nightly strategy
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-03 03:17:18 -04:00
Aaron
b949106daf
fix(ci): rename runner name [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-03 02:24:45 -04:00
Aaron Pham
e9eff70978
infra: bump to dev version of 0.2.14.dev0 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-08-03 06:18:57 +00:00
Aaron Pham
8428692d45
infra: prepare for release 0.2.13 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.13
2023-08-03 06:06:09 +00:00
Aaron
cac7a19be9
fix(build): to run on tags [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-03 02:00:13 -04:00
aarnphm-ec2-dev
29ca9f398f
fix: add arch_list for cross compiling
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-08-03 04:33:48 +00:00
Aaron
f5eb21ede0
revert: "chore(aws): use g4dn for more availability"
...
This reverts commit a06464bdc7 .
2023-08-02 23:55:29 -04:00
aarnphm-ec2-dev
a01d867bc7
chore(base): add auto-gptq CUDA kernel
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-08-03 02:40:06 +00:00
aarnphm-ec2-dev
820b4991fa
chore(stubs): add generated for auto-gptq and vllm [skip ci]
...
This is to help with working on CPU machine
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-08-03 02:28:24 +00:00
aarnphm-ec2-dev
a06464bdc7
chore(aws): use g4dn for more availability
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-08-03 02:17:37 +00:00
Aaron
af64a6dfd5
chore(docs): update to obsidian README format
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-02 21:49:33 -04:00
aarnphm-ec2-dev
b349820429
fix(build): add `--device` into envvar
...
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com >
2023-08-03 00:44:40 +00:00
Aaron Pham
cfc7f3888d
chore(vllm): add all supported models ( #179 )
2023-08-02 17:42:02 -04:00
Aaron Pham
72337410cf
fix: nightly resolver for correct tag ( #177 )
2023-08-02 13:10:50 -04:00
Aaron
d4fbfa5e5c
fix: custom release strategy for correct naming
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-02 03:03:21 -04:00
Aaron Pham
acb81a6e1a
fix(build): dispatch container via workflow calls ( #174 )
...
add OPENLLM_USE_LOCAL_LATEST as default behaviour within container
2023-08-02 01:54:10 -04:00
Aaron
f989ebd4b9
infra: bump to dev version of 0.2.13.dev0 [generated] [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 19:52:56 -04:00
Aaron Pham
57fdbda192
infra: prepare for release 0.2.12 [generated] [skip ci]
...
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
v0.2.12
2023-08-01 23:27:01 +00:00
Aaron
af54ff299f
fix(ec2): increase subnet availability to all available zone with g5
...
instances
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 16:07:41 -04:00
pre-commit-ci[bot]
c2ed1d56da
chore(release): update base container restriction ( #173 )
...
Prepare for 0.2.12 release
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 15:25:17 -04:00
Aaron
6ba8899743
fix: remove invalid OPENLLMDEVDEBUG envvar
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 01:52:08 -04:00
Aaron
961455c762
fix(cli): always --force on --push
...
feat: add --bento-version for ``openllm build``
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-08-01 00:56:46 -04:00
Aaron
ca5e3c7ae5
fix: correct setup property for envvar instance
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 23:34:42 -04:00
Aaron
16f032417e
revert: "infra: reduce instance type for more lenient"
...
This reverts commit 4a1d849203 .
2023-07-31 21:34:56 -04:00
Aaron
4a1d849203
infra: reduce instance type for more lenient
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 21:25:59 -04:00
Aaron
23c5aa5958
revert: remove unreleased changelog
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 21:07:00 -04:00
Aaron
fa0e947dd0
chore: add editorconfig [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 20:22:14 -04:00
Aaron Pham
729e423b17
chore(bnb): filter warnings message on CPU ( #170 )
2023-07-31 15:48:59 -04:00
Aaron
19d88d4cb8
infra: ignore rev that update styling [skip ci]
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 09:07:58 -04:00
Aaron
e01853a81c
chore(infra): disable update-changelog for now [skip ci]
...
Need to figure out how to update unreleased without adding it again
probably need to do with `--keep`
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 09:05:50 -04:00
Aaron Pham
ec3c381e8c
infra: add instruction for using docker images from release notes ( #169 )
2023-07-31 08:39:10 -04:00
Aaron Pham
8c2867d26d
style: define experimental guidelines ( #168 )
2023-07-31 07:54:26 -04:00
dependabot[bot]
2c2070f69f
chore(deps): bump docker/setup-qemu-action from 2.1.0 to 2.2.0 [skip ci] ( #165 )
...
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-31 07:52:16 -04:00
dependabot[bot]
94c949c22c
chore(deps): bump aws-actions/configure-aws-credentials from 1 to 2 [skip ci] ( #167 )
...
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-31 07:45:50 -04:00
dependabot[bot]
9592ca02fb
chore(deps): bump docker/setup-buildx-action from 2.5.0 to 2.9.1 [skip ci] ( #164 )
...
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-31 07:45:26 -04:00
dependabot[bot]
4d566fee09
chore(deps): bump peter-evans/create-pull-request from 4 to 5 [skip ci] ( #166 )
...
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com >
2023-07-31 07:45:05 -04:00
Aaron
b5652e7d66
fix(ci): agree with signing
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 06:40:14 -04:00
dependabot[bot]
431b326dd3
chore(deps): bump docker/login-action from 2.1.0 to 2.2.0 ( #163 )
...
Bumps [docker/login-action](https://github.com/docker/login-action ) from 2.1.0 to 2.2.0.
- [Release notes](https://github.com/docker/login-action/releases )
- [Commits](https://github.com/docker/login-action/compare/v2.1.0...v2.2.0 )
---
updated-dependencies:
- dependency-name: docker/login-action
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-31 09:26:06 +00:00
Aaron
ae17322b73
fix(ci): correct set digest for signing images
...
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com >
2023-07-31 04:27:15 -04:00