Aaron Pham
|
fa2038f4e2
|
fix: loading correct local models (#599)
* fix(model): loading local correctly
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* chore: update repr and correct bentomodel processor
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* ci: auto fixes from pre-commit.ci
For more information, see https://pre-commit.ci
* chore: cleanup transformers implementation
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: ruff to ignore I001 on all stubs
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-10 02:36:12 -05:00 |
|
Aaron Pham
|
5e45245457
|
package: add openllm core dependencies to labels (#600)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-10 02:33:55 -05:00 |
|
Aaron Pham
|
665a41940e
|
revert: configuration not to dump flatten (#597)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-09 14:48:23 -05:00 |
|
Aaron Pham
|
d60f2fb909
|
infra: remove tsconfig (#595)
* infra: remove tsconfig
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* ci: auto fixes from pre-commit.ci
For more information, see https://pre-commit.ci
* chore: filter only ec python and jsx
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update pnpm lock
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: run vendor
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: ignore blame
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: ignore on CI
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-09 13:06:31 -05:00 |
|
Aaron Pham
|
ac377fe490
|
infra: using ruff formatter (#594)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-09 12:44:05 -05:00 |
|
Aaron Pham
|
021fd453b9
|
infra: move out clojure to external (#593)
As we don't write this
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-09 12:24:18 -05:00 |
|
Aaron Pham
|
b8a2e8cf91
|
refactor(cli): cleanup API (#592)
* chore: remove unused imports
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* refactor(cli): update to only need model_id
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* feat: `openllm start model-id`
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: add changelog
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update changelog notice
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update correct config and running tools
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update backward compat options and treat JSON outputs
corespondingly
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-09 11:40:17 -05:00 |
|
Aaron Pham
|
86f7acafa9
|
infra: bump to homebrew tap release to 0.4.1 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 13:36:07 +00:00 |
|
Aaron Pham
|
12858605a0
|
infra: bump to dev version of 0.4.2.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 13:34:43 +00:00 |
|
Aaron Pham
|
0d88370127
|
infra: prepare for release 0.4.1 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.1
|
2023-11-08 13:24:46 +00:00 |
|
Aaron Pham
|
e87830ef0a
|
container: update tracing dependencies (#591)
* chore: update build message
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: add tracing dependencies to container
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 08:08:40 -05:00 |
|
Aaron Pham
|
0ea025da5a
|
fix(cli): append model-id instruction to build (#590)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 07:44:36 -05:00 |
|
Aaron Pham
|
d47b985e5d
|
docs: update quantization notes (#589)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 07:40:12 -05:00 |
|
Aaron Pham
|
47107727b3
|
feat(vllm): squeezellm (#588)
* feat(vllm): squeezellm
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* fix: correct import_model with awq and gatekeep squeezellm for PyTorch
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 07:21:27 -05:00 |
|
Aaron Pham
|
ff8b6377c8
|
fix(awq): correct awq detection for support (#586)
* fix(awq): correct detection for awq
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* chore: update base docker to work
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* chore: disable awq on pytorch for now
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* ci: auto fixes from pre-commit.ci
For more information, see https://pre-commit.ci
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-08 06:57:11 -05:00 |
|
Aaron Pham
|
655a4071de
|
fix: unbound variable for completion client (#587)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 05:37:39 -05:00 |
|
Aaron Pham
|
387637405d
|
fix(gptq): update config fields (#585)
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 05:20:30 -05:00 |
|
Aaron Pham
|
85a7243ac3
|
fix: device imports using strategies (#584)
* fix: device imports using strategies
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* chore: support trust_remote_code for vLLM runners
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 05:10:50 -05:00 |
|
Aaron
|
6d81fbbccd
|
fix(infra): add coverage config
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 04:08:08 -05:00 |
|
dependabot[bot]
|
35661d7c8c
|
chore(deps): bump sigstore/cosign-installer from 3.1.2 to 3.2.0 (#582)
Bumps [sigstore/cosign-installer](https://github.com/sigstore/cosign-installer) from 3.1.2 to 3.2.0.
- [Release notes](https://github.com/sigstore/cosign-installer/releases)
- [Commits](11086d2504...1fc5bd396d)
---
updated-dependencies:
- dependency-name: sigstore/cosign-installer
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-11-08 02:59:49 -05:00 |
|
dependabot[bot]
|
0d76a4c935
|
chore(deps): bump taiki-e/install-action from 2.21.7 to 2.21.8 (#581)
Bumps [taiki-e/install-action](https://github.com/taiki-e/install-action) from 2.21.7 to 2.21.8.
- [Release notes](https://github.com/taiki-e/install-action/releases)
- [Changelog](https://github.com/taiki-e/install-action/blob/main/CHANGELOG.md)
- [Commits](11dea51b35...b4f94d4449)
---
updated-dependencies:
- dependency-name: taiki-e/install-action
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-11-08 02:59:34 -05:00 |
|
dependabot[bot]
|
0e1ed391e4
|
chore(deps): bump aquasecurity/trivy-action from 0.13.1 to 0.14.0 (#583)
Bumps [aquasecurity/trivy-action](https://github.com/aquasecurity/trivy-action) from 0.13.1 to 0.14.0.
- [Release notes](https://github.com/aquasecurity/trivy-action/releases)
- [Commits](f78e9ecf42...2b6a709cf9)
---
updated-dependencies:
- dependency-name: aquasecurity/trivy-action
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-11-08 02:59:26 -05:00 |
|
Aaron Pham
|
c90d510589
|
infra: disable npm updates and correct python packages (#580)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 02:58:05 -05:00 |
|
Aaron Pham
|
ea42108e45
|
chore(service): cleanup API (#579)
* chore(service): cleanup API
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: running tools
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* fix: tests import
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 02:53:08 -05:00 |
|
Aaron Pham
|
7398ae0486
|
refactor(strategies): move logics into openllm-python (#578)
fix(strategies): move to openllm
Strategies shouldn't be a part of openllm-core
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 02:23:08 -05:00 |
|
Aaron Pham
|
7f46aa3475
|
fix(stubs): update initialisation types (#577)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 01:30:46 -05:00 |
|
Aaron Pham
|
97d7c38fea
|
refactor: cleanup typing to expose correct API (#576)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-08 01:24:03 -05:00 |
|
Aaron Pham
|
c40d4c1016
|
fix(examples): correct dependencies in requirements.txt [skip ci] (#575)
Update requirements.txt
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-07 23:02:53 -05:00 |
|
Aaron Pham
|
30d8981916
|
chore(openai): simplify client examples (#574)
* chore(openai): simplify client examples
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* chore: fix spacing
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-07 23:01:14 -05:00 |
|
Aaron Pham
|
cfd09bfc47
|
chore(runner): yield the outputs directly (#573)
update openai client examples to >1
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-07 22:34:11 -05:00 |
|
Aaron Pham
|
b3c4e204b2
|
infra: bump to homebrew tap release to 0.4.0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-07 22:55:47 +00:00 |
|
Aaron Pham
|
1d5a8d021f
|
infra: bump to dev version of 0.4.1.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-07 22:53:55 +00:00 |
|
Aaron Pham
|
8ffab93d39
|
infra: prepare for release 0.4.0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.4.0
|
2023-11-07 22:42:22 +00:00 |
|
Aaron Pham
|
4d356f4b72
|
feat: Mistral support (#571)
* feat: Mistral support
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
* ci: auto fixes from pre-commit.ci
For more information, see https://pre-commit.ci
* chore: fix style
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
* chore: update README docs about mistral
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
---------
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-07 17:28:02 -05:00 |
|
Aaron Pham
|
d9a7b6a147
|
fix(client): one-shot generation construction (#570)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-07 17:14:39 -05:00 |
|
Aaron Pham
|
272b185ee7
|
chore(examples): update openai client (#568)
chore: update openai client examples
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-07 16:42:32 -05:00 |
|
Aaron Pham
|
dc27b0e727
|
fix: update build dependencies and format chat prompt (#569)
chore: update correct check and format prompt
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-11-07 16:42:20 -05:00 |
|
Aaron Pham
|
8fade070f3
|
infra: update docs on serving fine-tuning layers (#567)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-06 21:34:44 -05:00 |
|
Aaron
|
b158609e95
|
docs: update BentoML integration
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-06 20:57:51 -05:00 |
|
Aaron
|
4550b2548b
|
docs: update readme for HF agent
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-06 20:55:10 -05:00 |
|
Aaron
|
8aca432df5
|
chore(examples): add logprobs check on PyTorch
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-06 20:52:49 -05:00 |
|
Aaron Pham
|
2287173c47
|
fix(openai): stop is not available in config (#566)
This is now inference arguments, instead of config
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-06 20:47:50 -05:00 |
|
Aaron Pham
|
e2029c934b
|
perf: unify LLM interface (#518)
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Signed-off-by: paperspace <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-11-06 20:39:43 -05:00 |
|
Abhishek
|
f2639879af
|
feat: support toggle TLS verification (#532)
Signed-off-by: Abhishek <59995387+ABHISHEK03312@users.noreply.github.com>
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
Co-authored-by: abhishek <abhishek_vaidyanathan@ensigninfosecurity.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-11-06 13:34:53 -05:00 |
|
dependabot[bot]
|
b4bf5a35e6
|
chore(deps-dev): bump @typescript-eslint/eslint-plugin from 6.9.0 to 6.10.0 (#564)
chore(deps-dev): bump @typescript-eslint/eslint-plugin
Bumps [@typescript-eslint/eslint-plugin](https://github.com/typescript-eslint/typescript-eslint/tree/HEAD/packages/eslint-plugin) from 6.9.0 to 6.10.0.
- [Release notes](https://github.com/typescript-eslint/typescript-eslint/releases)
- [Changelog](https://github.com/typescript-eslint/typescript-eslint/blob/main/packages/eslint-plugin/CHANGELOG.md)
- [Commits](https://github.com/typescript-eslint/typescript-eslint/commits/v6.10.0/packages/eslint-plugin)
---
updated-dependencies:
- dependency-name: "@typescript-eslint/eslint-plugin"
dependency-type: direct:development
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-11-06 13:19:52 -05:00 |
|
dependabot[bot]
|
b84a2753c7
|
chore(deps-dev): bump @types/react from 18.2.20 to 18.2.35 (#559)
Bumps [@types/react](https://github.com/DefinitelyTyped/DefinitelyTyped/tree/HEAD/types/react) from 18.2.20 to 18.2.35.
- [Release notes](https://github.com/DefinitelyTyped/DefinitelyTyped/releases)
- [Commits](https://github.com/DefinitelyTyped/DefinitelyTyped/commits/HEAD/types/react)
---
updated-dependencies:
- dependency-name: "@types/react"
dependency-type: direct:development
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-11-06 13:17:44 -05:00 |
|
dependabot[bot]
|
28b4166f37
|
chore(deps): bump @vercel/og from 0.5.18 to 0.5.20 (#556)
Bumps @vercel/og from 0.5.18 to 0.5.20.
---
updated-dependencies:
- dependency-name: "@vercel/og"
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-11-06 13:17:25 -05:00 |
|
dependabot[bot]
|
0bfa4930e4
|
chore(deps-dev): bump eslint from 8.47.0 to 8.53.0 (#558)
Bumps [eslint](https://github.com/eslint/eslint) from 8.47.0 to 8.53.0.
- [Release notes](https://github.com/eslint/eslint/releases)
- [Changelog](https://github.com/eslint/eslint/blob/main/CHANGELOG.md)
- [Commits](https://github.com/eslint/eslint/compare/v8.47.0...v8.53.0)
---
updated-dependencies:
- dependency-name: eslint
dependency-type: direct:development
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-11-06 13:17:13 -05:00 |
|
dependabot[bot]
|
dc5c3a1f16
|
chore(deps): bump taiki-e/install-action from 2.21.3 to 2.21.7 (#561)
Bumps [taiki-e/install-action](https://github.com/taiki-e/install-action) from 2.21.3 to 2.21.7.
- [Release notes](https://github.com/taiki-e/install-action/releases)
- [Changelog](https://github.com/taiki-e/install-action/blob/main/CHANGELOG.md)
- [Commits](285bd10d1c...11dea51b35)
---
updated-dependencies:
- dependency-name: taiki-e/install-action
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-11-06 13:17:02 -05:00 |
|
dependabot[bot]
|
711f6fed8e
|
chore(deps): bump aquasecurity/trivy-action from 0.13.0 to 0.13.1 (#562)
Bumps [aquasecurity/trivy-action](https://github.com/aquasecurity/trivy-action) from 0.13.0 to 0.13.1.
- [Release notes](https://github.com/aquasecurity/trivy-action/releases)
- [Commits](b77b85c025...f78e9ecf42)
---
updated-dependencies:
- dependency-name: aquasecurity/trivy-action
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-11-06 13:16:54 -05:00 |
|