Commit Graph

104 Commits

Author SHA1 Message Date
Aaron
af54ff299f fix(ec2): increase subnet availability to all available zone with g5
instances

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-08-01 16:07:41 -04:00
Aaron
16f032417e revert: "infra: reduce instance type for more lenient"
This reverts commit 4a1d849203.
2023-07-31 21:34:56 -04:00
Aaron
4a1d849203 infra: reduce instance type for more lenient
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-31 21:25:59 -04:00
Aaron
e01853a81c chore(infra): disable update-changelog for now [skip ci]
Need to figure out how to update unreleased without adding it again

probably need to do with `--keep`

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-31 09:05:50 -04:00
Aaron Pham
ec3c381e8c infra: add instruction for using docker images from release notes (#169) 2023-07-31 08:39:10 -04:00
dependabot[bot]
2c2070f69f chore(deps): bump docker/setup-qemu-action from 2.1.0 to 2.2.0 [skip ci] (#165)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-31 07:52:16 -04:00
dependabot[bot]
94c949c22c chore(deps): bump aws-actions/configure-aws-credentials from 1 to 2 [skip ci] (#167)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-31 07:45:50 -04:00
dependabot[bot]
9592ca02fb chore(deps): bump docker/setup-buildx-action from 2.5.0 to 2.9.1 [skip ci] (#164)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-07-31 07:45:26 -04:00
dependabot[bot]
4d566fee09 chore(deps): bump peter-evans/create-pull-request from 4 to 5 [skip ci] (#166)
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-07-31 07:45:05 -04:00
Aaron
b5652e7d66 fix(ci): agree with signing
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-31 06:40:14 -04:00
dependabot[bot]
431b326dd3 chore(deps): bump docker/login-action from 2.1.0 to 2.2.0 (#163)
Bumps [docker/login-action](https://github.com/docker/login-action) from 2.1.0 to 2.2.0.
- [Release notes](https://github.com/docker/login-action/releases)
- [Commits](https://github.com/docker/login-action/compare/v2.1.0...v2.2.0)

---
updated-dependencies:
- dependency-name: docker/login-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-07-31 09:26:06 +00:00
Aaron
ae17322b73 fix(ci): correct set digest for signing images
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-31 04:27:15 -04:00
Aaron Pham
4fbfb363bf infra: update changelog and added readme badges [generated] (#162) 2023-07-31 04:02:02 -04:00
Aaron Pham
fec68d732b fix(ci): Correctly set signing for pushing container images (#161) 2023-07-31 03:43:07 -04:00
Aaron Pham
ef94c6b98a feat(container): vLLM build and base image strategies (#142) 2023-07-31 02:44:52 -04:00
Aaron Pham
0c79fabd1a chore(release): add darwin binary to release notes (#154) 2023-07-28 15:00:42 -04:00
Aaron Pham
4fae00b68b fix(ci): correct tag for checkout (#150) 2023-07-25 14:11:03 -04:00
Aaron
e000e7d1c6 fix(ci): release correct version via git
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-25 12:48:19 -04:00
Aaron Pham
c391717226 feat(ci): automatic release semver + git archival installation (#143) 2023-07-25 04:18:49 -04:00
Aaron Pham
60c725a21f ci: release PyPI before building binary (#138) 2023-07-24 16:39:51 -04:00
aarnphm-ec2-dev
d4f3cf8b75 fix(llm): ignore quantization config when --quantize int4 is passed
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-22 22:45:46 +00:00
Aaron Pham
31bd2fe31b chore(ci): better release flow (#131) 2023-07-21 21:53:57 -04:00
Aaron
8a42832360 chore(ci): simplify bump dev workflow [skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-21 14:20:30 -04:00
aarnphm-ec2-dev
39f7725870 fix(ci): skip running actions on generated commit
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 15:48:22 +00:00
aarnphm-ec2-dev
8eb143ff60 fix(ci): remove unecessary semver check for release notes [skip ci]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 06:31:49 +00:00
Aaron Pham
ee7fa63a50 ci: using tokens for publishing (#129) 2023-07-20 21:14:14 -04:00
Aaron
5189d2e721 fix(script): correct patch version for __about__.py
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-20 18:35:31 -04:00
aarnphm-ec2-dev
c88950655c fix(ci): make sure to run publish and dev prep correctly
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-20 22:05:05 +00:00
aarnphm-ec2-dev
a056365d48 fix(ci): always run create coverage
this is to stop evergreen to fail on main

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-20 13:01:32 +00:00
Aaron Pham
1b3508619e feat(llama): add default prompt for LlaMA-2 (#122) 2023-07-20 07:46:33 -04:00
aarnphm-ec2-dev
5f874da4e2 fix: broken logics for upload
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:55:35 +00:00
aarnphm-ec2-dev
292bca68c7 fix(ci): releas script correctly parse version
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:41:33 +00:00
aarnphm-ec2-dev
4beb040cfd chore(ci): remove macos packaging
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:35:49 +00:00
aarnphm-ec2-dev
e69042a8e9 fix(script): not gpg signing tag
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:12:04 +00:00
aarnphm-ec2-dev
9d28e0a1e6 fix(ci): disable gpg push
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:09:27 +00:00
aarnphm-ec2-dev
b747e3b4b8 fix(ci): remove signing
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:01:53 +00:00
Aaron
b6679be301 fix: release script not to signed
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-19 18:48:01 -04:00
aarnphm-ec2-dev
e319a2977f fix(ci): editable install
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 22:29:29 +00:00
Aaron Pham
c1ddb9ed7c feat: GPTQ + vLLM and LlaMA (#113)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-19 18:12:12 -04:00
dependabot[bot]
9833d2f46f fix(ci): correct setup tests and auto-bot (#118)
Bump pypa/gh-action-pypi-publish from 1.8.7 to 1.8.8

Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-17 14:37:46 -04:00
aarnphm-ec2-dev
d37d14e52b fix(tests): mark package on CI to xfail
XXX: @aarnphm to solve build isolation when have bandwidth. Currently
this is not a problem when running locally.

`openllm build` just works, where as `openllm.build` won't work
sequentially.

Address some type stubs for jupytext

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-15 12:48:28 +00:00
Aaron Pham
b2dba6143f fix(resource): correctly parse CUDA_VISIBLE_DEVICES (#114) 2023-07-15 07:19:35 -04:00
Aaron Pham
c7f4dc7bb2 feat(test): snapshot testing (#107)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-10 17:23:19 -04:00
aarnphm-ec2-dev
d3e4b95e84 ci: use trusted pypi publisher
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-07 07:18:13 +00:00
Aaron
ec4293091d ci: wait for auto-bot to run check
only run evergreen on PR

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 10:53:45 -04:00
Aaron
0892d10436 ci: increase postBuffer for release action
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 09:08:13 -04:00
Aaron
183019616b fix: remove editable install
Setup via hatch already

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-05 08:50:06 -04:00
Aaron Pham
59b1d89971 feat: custom dockerfile templates (#95)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-06-30 13:44:11 -04:00
Aaron
c5977c8fdd chore(ci): add checkout version for 3.11 and sign push
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-29 02:55:33 -04:00
Aaron
c76dea468e chore(ci): fixes release script to correct version
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-06-29 02:38:44 -04:00