Commit Graph

1233 Commits

Author SHA1 Message Date
Aaron Pham
aab173cd99 refactor: focus (#730)
* perf: remove based images

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: update changelog

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: move dockerifle to run on release only

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: cleanup unused types

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-24 01:11:31 -05:00
Aaron Pham
52a44b1bfa chore: cleanup loader (#729)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 21:51:51 -05:00
Aaron Pham
5442d9cd10 fix(trust_remote_code): handle args correctly (#727)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-22 17:03:13 -05:00
Aaron Pham
79c9608735 infra: reduce wait time to around 7 mins (#726)
Seems like the release process for PyPI usually takes from 4-7 minutes

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-22 07:28:36 -05:00
Aaron Pham
831bb8c497 infra: bump to homebrew tap release to 0.4.26 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 11:59:36 +00:00
Aaron Pham
80842ad501 infra: bump to dev version of 0.4.27.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 11:58:42 +00:00
Aaron Pham
7eae50377d infra: prepare for release 0.4.26 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.26
2023-11-22 11:50:50 +00:00
Aaron Pham
b28b5269b5 feat(openai): chat templates and complete control of prompt generation (#725)
* feat(openai): chat templates and complete control of prompt generation

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* fix: correctly use base chat templates

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* fix: remove symlink

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 06:49:14 -05:00
Aaron Pham
7aa0918a6f fix(client): correct schemas parser from correct response output (#724)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-22 05:01:35 -05:00
Aaron Pham
f83f64ffd7 fix(infra): setup higher timer for building container images (#723)
* fix(infra): setup higher timer for building container images

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: remove invalid tests

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-22 05:00:33 -05:00
Aaron Pham
6dd07580e2 infra: bump to homebrew tap release to 0.4.25 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 09:35:41 +00:00
Aaron Pham
1876609a67 infra: bump to dev version of 0.4.26.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 09:34:22 +00:00
Aaron Pham
0189342730 infra: prepare for release 0.4.25 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.25
2023-11-22 09:22:45 +00:00
Aaron Pham
63d86faa32 fix(openai): correct stop tokens and finish_reason state (#722)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 04:21:13 -05:00
Aaron Pham
06626e7d1e infra: bump to homebrew tap release to 0.4.24 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 06:51:39 +00:00
Aaron Pham
c9b23638a5 infra: bump to dev version of 0.4.25.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 06:50:35 +00:00
Aaron Pham
7f09f9daf2 infra: prepare for release 0.4.24 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.24
2023-11-22 06:34:30 +00:00
Aaron
d697ea3903 fix(image): setup correct installation
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-22 01:33:26 -05:00
Aaron Pham
9f84b8b945 infra: bump to homebrew tap release to 0.4.23 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 06:26:20 +00:00
Aaron Pham
1df549f76a infra: bump to dev version of 0.4.24.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 06:25:32 +00:00
Aaron Pham
85e03a4b92 infra: prepare for release 0.4.23 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.23
2023-11-22 06:16:49 +00:00
Aaron Pham
38b7c44df0 fix(base-image): update base image to include cuda for now (#720)
* fix(base-image): update base image to include cuda for now

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* fix: build core and client on release images

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: cleanup style changes

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-22 01:15:19 -05:00
Aaron Pham
8bb2742a9a chore(types): append additional types change (#719)
* chore(types): append additional types change

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* chore: add arguments for parsing dir

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-21 22:38:20 -05:00
Aaron Pham
04ef08a7f8 chore(strategy): compact and add stubs (#718)
generate service_vars automatically inline without reading from files

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-21 21:49:28 -05:00
Aaron Pham
909db8c3bf refactor: reduce compiled cacheline
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-22 02:27:42 +00:00
Aaron Pham
77bd6f090a chore(logger): fix warnings and streamline style (#717)
Sorry but there are too much wasted spacing in `_llm.py`, and I'm unhappy and not productive anytime I look or want to do anything with it

---------

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-11-21 18:54:51 -05:00
Aaron Pham
d53cf234bd fix(api-server): correct set generation from LLM class
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-21 10:38:36 +00:00
Aaron Pham
2821e172ef fix(examples): use non-chat models
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-21 10:12:48 +00:00
Aaron
93709f1d66 fix(infra): remove unless exclude
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-21 05:04:47 -05:00
Aaron
14242a7ab8 fix(utils): correct import
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-21 05:03:20 -05:00
Aaron Pham
c33b071ee4 refactor: delete unused code (#716)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-21 04:39:48 -05:00
Aaron Pham
a8a9f154ce fix(ci): tests (#715)
* fix: tests

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

* chore: remove broken tests

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-21 03:05:22 -05:00
Aaron Pham
e70246ca5d feat(generation): add support for eos_token_id (#714)
chore: add support for custom eos_token_id

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-21 02:01:36 -05:00
Aaron Pham
fde78a2c78 chore: cleanup unused prompt templates (#713)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-21 01:56:51 -05:00
Aaron Pham
e6b9a749a4 infra: bump to homebrew tap release to 0.4.22 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-21 01:50:09 +00:00
Aaron Pham
be7a4bf576 infra: bump to dev version of 0.4.23.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-21 01:49:02 +00:00
Aaron Pham
f3fd32d596 infra: prepare for release 0.4.22 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.22
2023-11-21 01:38:46 +00:00
Aaron Pham
ad4f388c98 refactor: update runner helpers and add max_model_len (#712)
* chore(runner): cleanup unecessary checks for runnable backend

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: saving llm reference to runner

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: correct inject item

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: update support for max_seq_len

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* fix: correct max_model_len

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: update and warning backward compatibility

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

* chore: remove unused sets

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

---------

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-20 20:37:15 -05:00
Aaron Pham
8fc5f1f70c infra: bump to homebrew tap release to 0.4.21 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-20 22:50:00 +00:00
Aaron Pham
aac92f40ac infra: bump to dev version of 0.4.22.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-20 22:48:59 +00:00
Aaron Pham
4c4bc82a47 infra: prepare for release 0.4.21 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.21
2023-11-20 22:32:44 +00:00
Aaron
00e2666e48 fix(build): contraint packages for bentoml >1.1.10
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-20 17:30:38 -05:00
dependabot[bot]
1f2cdc8021 chore(deps): bump github/codeql-action from 2.22.5 to 2.22.7 (#707)
Bumps [github/codeql-action](https://github.com/github/codeql-action) from 2.22.5 to 2.22.7.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](74483a38d3...66b90a5db1)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-11-20 17:22:49 -05:00
dependabot[bot]
62da3a7fe5 chore(deps): bump docker/build-push-action from 5.0.0 to 5.1.0 (#708)
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 5.0.0 to 5.1.0.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](0565240e2d...4a13e500e5)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-11-20 17:22:41 -05:00
dependabot[bot]
b1ee1ce2a1 chore(deps): bump taiki-e/install-action from 2.21.11 to 2.21.17 (#709)
Bumps [taiki-e/install-action](https://github.com/taiki-e/install-action) from 2.21.11 to 2.21.17.
- [Release notes](https://github.com/taiki-e/install-action/releases)
- [Changelog](https://github.com/taiki-e/install-action/blob/main/CHANGELOG.md)
- [Commits](4d8504289a...ccc14bdc8d)

---
updated-dependencies:
- dependency-name: taiki-e/install-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-11-20 17:22:30 -05:00
pre-commit-ci[bot]
d7bb0ab197 ci: pre-commit autoupdate [pre-commit.ci] (#711)
updates:
- [github.com/astral-sh/ruff-pre-commit: v0.1.5 → v0.1.6](https://github.com/astral-sh/ruff-pre-commit/compare/v0.1.5...v0.1.6)
- [github.com/pre-commit/mirrors-eslint: v8.53.0 → v8.54.0](https://github.com/pre-commit/mirrors-eslint/compare/v8.53.0...v8.54.0)

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-11-20 17:22:19 -05:00
Aaron Pham
c25f69fddc infra: bump to homebrew tap release to 0.4.20 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-20 22:19:49 +00:00
Aaron Pham
5491fe30b0 infra: bump to dev version of 0.4.21.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-11-20 22:18:14 +00:00
Aaron Pham
204cbd43d2 infra: prepare for release 0.4.20 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.20
2023-11-20 22:09:47 +00:00
Aaron
f753662ae6 fix(build): only load model when eager is True
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-11-20 17:06:25 -05:00