Commit Graph

1522 Commits

Author SHA1 Message Date
Aaron Pham
45aceb172f feat(API): add light support for batch inference (#1004)
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-31 20:36:12 -04:00
Aaron Pham
162458ffe6 infra: prepare for release 0.5.3 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.5.3
2024-05-30 21:30:21 +00:00
Aaron Pham
12f0d45a9d fix(client): make sure to initialised helpers class correctly
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2024-05-30 17:26:09 -04:00
Aaron Pham
49908ec289 infra: prepare for release 0.5.2 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.5.2
2024-05-29 04:44:59 +00:00
paperspace
7fca472a66 chore: update readme [skip ci]
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-29 04:43:50 +00:00
Aaron Pham
e9e46b2cc7 chore: update examples and readme
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2024-05-29 00:41:32 -04:00
paperspace
02010d3499 fix: synchronize into llm_config dict
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-29 04:31:34 +00:00
paperspace
ef11e54a6d chore: update docs and base instruction [skip ci]
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-29 03:19:47 +00:00
Aaron Pham
439f10c786 chore(ci): make sure binary depends on publish to avoid race condition
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2024-05-28 22:48:08 -04:00
Aaron Pham
5ff77d1f6e infra: prepare for release 0.5.1 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.5.1
2024-05-29 02:44:23 +00:00
paperspace
c820cececb fix(generate): make sure to only pass prompt_token_ids if it is a valid
mutable

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-29 02:42:13 +00:00
Aaron Pham
fd527352b2 chore(ci): cleanup unnecessary dev refresh cycle
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2024-05-27 14:34:33 -04:00
Aaron Pham
2314a3667e infra: prepare for release 0.5.0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.5.0
2024-05-27 18:20:30 +00:00
paperspace
cc1b23da57 ci: final check for releases [skip ci]
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-27 18:19:03 +00:00
pre-commit-ci[bot]
fc01ad71ea ci: pre-commit autoupdate [pre-commit.ci] [skip ci] (#1002)
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.4.4 → v0.4.5](https://github.com/astral-sh/ruff-pre-commit/compare/v0.4.4...v0.4.5)

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-05-27 14:09:26 -04:00
Aaron Pham (mbp16)
ed868bad5c chore: update releases script
Signed-off-by: Aaron Pham (mbp16) <29749331+aarnphm@users.noreply.github.com>
2024-05-27 14:07:46 -04:00
paperspace
9da0b4134c chore(qol): make envvar private
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-27 18:07:38 +00:00
Aaron Pham
a4a6060f69 infra: prepare for release 0.5.0-alpha.15 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.5.0-alpha.15
2024-05-27 17:53:45 +00:00
paperspace
07655c9ba8 chore(build): remove vllm_version envvar and lock into templates
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-27 17:49:58 +00:00
paperspace
ba5a5da720 chore: udpate docstring
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-27 17:02:26 +00:00
paperspace
0f32290606 chore(packages): ready for 0.5 releases
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-27 16:54:53 +00:00
Aaron Pham (mbp16)
20ac656018 fix(deps): make sure core deps are available on setup
Signed-off-by: Aaron Pham (mbp16) <29749331+aarnphm@users.noreply.github.com>
2024-05-27 12:44:22 -04:00
Aaron Pham (mbp16)
f4f7f16e81 chore(releases): remove deadcode
Signed-off-by: Aaron Pham (mbp16) <29749331+aarnphm@users.noreply.github.com>
2024-05-27 12:37:50 -04:00
Aaron Pham (mbp16)
da42c269c9 fix(ci): remove checking for hatch
since we don't use it on the script any longer.

Signed-off-by: Aaron Pham (mbp16) <29749331+aarnphm@users.noreply.github.com>
2024-05-27 12:12:40 -04:00
dependabot[bot]
5fcfa51573 chore(deps): bump taiki-e/install-action from 2.33.22 to 2.33.34 (#1000)
Bumps [taiki-e/install-action](https://github.com/taiki-e/install-action) from 2.33.22 to 2.33.34.
- [Release notes](https://github.com/taiki-e/install-action/releases)
- [Changelog](https://github.com/taiki-e/install-action/blob/main/CHANGELOG.md)
- [Commits](c2927f0c5b...60784cb1f4)

---
updated-dependencies:
- dependency-name: taiki-e/install-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-27 11:59:51 -04:00
paperspace
e84cd815e4 fix(releases): make sure to use correct hash
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-27 15:59:09 +00:00
paperspace
1eea284810 chore: update actions to v5
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-27 15:52:38 +00:00
Aaron
6d870c06a0 fix(ci): make sure to use distinct name pattern for v4 upload
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2024-05-26 22:25:03 -04:00
paperspace
6fad74e510 fix: setup python for binary distribution
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-26 17:29:14 +00:00
paperspace
479f44ce04 fix: make sure to separate PR and main run
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-26 17:24:16 +00:00
Aaron Pham
f248ea25cd feat(ci): running CI on paperspace (#998)
* chore: update tiny script

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* feat(ci): running on paperspace machines

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: update models and increase timeout readiness

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* fix: schema validation for inputs and update client supporting stop

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* chore: update coverage config

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* chore: remove some non-essentials

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* chore: update locks

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2024-05-26 13:14:54 -04:00
paperspace
a58e12d116 fix: ci
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-26 08:38:24 +00:00
Aaron Pham
3f048d8a5b chore(qol): update CLI options and performance upgrade for build cache (#997)
* chore(qol): update CLI options and performance upgrade for build cache

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* chore: update default python version for dev

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* fix: install custom tar.gz models

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-26 04:17:23 -04:00
Aaron
bc0be036d5 infra: enable releases cycle for next dev versions
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2024-05-23 12:59:02 -04:00
Aaron
315aaa0c26 revert: "chore(deps): bump actions/upload-artifact from 3.1.3 to 4.3.3 (#986)"
This reverts commit dda3b1da3e.
2024-05-23 12:55:11 -04:00
Aaron
bf31d3f9d6 revert: "chore(deps): bump actions/download-artifact from 3.0.2 to 4.1.7 (#990)"
This reverts commit 346ab20cca.
2024-05-23 12:55:01 -04:00
dependabot[bot]
346ab20cca chore(deps): bump actions/download-artifact from 3.0.2 to 4.1.7 (#990)
Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 3.0.2 to 4.1.7.
- [Release notes](https://github.com/actions/download-artifact/releases)
- [Commits](9bc31d5ccc...65a9edc588)

---
updated-dependencies:
- dependency-name: actions/download-artifact
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-23 12:51:35 -04:00
dependabot[bot]
dda3b1da3e chore(deps): bump actions/upload-artifact from 3.1.3 to 4.3.3 (#986)
Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact) from 3.1.3 to 4.3.3.
- [Release notes](https://github.com/actions/upload-artifact/releases)
- [Commits](a8a3f3ad30...65462800fd)

---
updated-dependencies:
- dependency-name: actions/upload-artifact
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-23 12:51:18 -04:00
Aaron Pham
5e97329bcb infra: prepare 0.5 releases (#996)
* chore: prepare for 0.5

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: update changelogs

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: fix to lowest python version supported

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: update scripts

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2024-05-23 12:50:01 -04:00
paperspace
a410b9cfe8 infra: update README
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-23 16:44:03 +00:00
Aaron Pham
fa850bafeb infra: prepare for release 0.5.0-alpha.14 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.5.0-alpha.14
2024-05-23 14:43:49 +00:00
paperspace
cec0aa5487 fix(memory): correctly recommend instance types for cloud
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-23 14:42:39 +00:00
paperspace
db523e2940 chore(infra): add support for dry running versioning for releases
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-23 14:06:08 +00:00
Aaron Pham
97d76eec85 tests: add additional basic testing (#982)
* chore: update rebase tests

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* chore: update partial clients before removing

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* fix: update clients parsing logics to work with 0.5

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* chore: ignore ci runs as to run locally

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* chore: update async client tests

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

* chore: update pre-commit

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-23 10:02:23 -04:00
Aaron Pham
5cb5203eea infra: prepare for release 0.5.0-alpha.13 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.5.0-alpha.13
2024-05-22 15:17:25 +00:00
paperspace
b7193511e6 fix: correct update default value for dict unpacking
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
2024-05-22 15:15:23 +00:00
Dennis Rall
362c099f97 fix(docs): update correct BentoML links (#995)
Signed-off-by: Dennis Rall <56480601+dennisrall@users.noreply.github.com>
2024-05-22 11:14:08 -04:00
dependabot[bot]
40db3dee08 chore(deps): bump aquasecurity/trivy-action from 0.19.0 to 0.20.0 (#991)
Bumps [aquasecurity/trivy-action](https://github.com/aquasecurity/trivy-action) from 0.19.0 to 0.20.0.
- [Release notes](https://github.com/aquasecurity/trivy-action/releases)
- [Commits](d710430a67...b2933f565d)

---
updated-dependencies:
- dependency-name: aquasecurity/trivy-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-22 11:00:51 -04:00
dependabot[bot]
d72b4bf82d chore(deps): bump github/codeql-action from 3.25.3 to 3.25.5 (#992)
Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.25.3 to 3.25.5.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](d39d31e687...b7cec75265)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-22 11:00:40 -04:00
dependabot[bot]
95d471410d chore(deps): bump actions/checkout from 4.1.5 to 4.1.6 (#993)
Bumps [actions/checkout](https://github.com/actions/checkout) from 4.1.5 to 4.1.6.
- [Release notes](https://github.com/actions/checkout/releases)
- [Changelog](https://github.com/actions/checkout/blob/main/CHANGELOG.md)
- [Commits](44c2b7a8a4...a5ac7e51b4)

---
updated-dependencies:
- dependency-name: actions/checkout
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-05-22 11:00:29 -04:00