Aaron Pham
|
4fae00b68b
|
fix(ci): correct tag for checkout (#150)
|
2023-07-25 14:11:03 -04:00 |
|
Aaron Pham
|
e0a90c8d7e
|
infra: bump to dev version of 0.2.11.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-07-25 17:03:38 +00:00 |
|
Aaron Pham
|
c97d39380c
|
infra: prepare for release 0.2.10 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.10
|
2023-07-25 16:49:50 +00:00 |
|
Aaron
|
e000e7d1c6
|
fix(ci): release correct version via git
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-07-25 12:48:19 -04:00 |
|
aarnphm-ec2-dev
|
6dc0bf0b12
|
fix: remove breakpoint on CLI
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-25 16:30:16 +00:00 |
|
aarnphm-ec2-dev
|
b23b59e1c9
|
fix(embeddings): correctly set JSON data via CLI client
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-25 16:26:01 +00:00 |
|
aarnphm-ec2-dev
|
56bf84a760
|
fix(ci): make sure to exclude generated _version.py
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-25 09:55:24 +00:00 |
|
Aaron Pham
|
1940086bec
|
feat(client): embeddings (#146)
|
2023-07-25 05:44:21 -04:00 |
|
Aaron Pham
|
dcd34bd381
|
fix(build): running bento insider container (#141)
Behaviour of `docker run` should be the same with `openllm start`
|
2023-07-25 04:24:28 -04:00 |
|
Aaron Pham
|
afb2d34673
|
docs: update fine tuning model support (#145)
|
2023-07-25 04:21:52 -04:00 |
|
Aaron Pham
|
a80fb4635d
|
docs: remove extraneous whitespace (#144)
|
2023-07-25 04:19:55 -04:00 |
|
Aaron Pham
|
c391717226
|
feat(ci): automatic release semver + git archival installation (#143)
|
2023-07-25 04:18:49 -04:00 |
|
Aaron Pham
|
5635ce8d87
|
infra: bump to dev version of 0.2.10.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-07-24 23:35:04 +00:00 |
|
Aaron Pham
|
fb656164e1
|
infra: prepare for release 0.2.9 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.9
|
2023-07-24 23:24:09 +00:00 |
|
aarnphm-ec2-dev
|
084786c898
|
fix(cli): `openllm models` for showing available
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-24 23:00:03 +00:00 |
|
Chaoyu
|
0fe9d83ff3
|
docs: Update README.md (#139)
|
2023-07-24 18:11:20 -04:00 |
|
Aaron Pham
|
60c725a21f
|
ci: release PyPI before building binary (#138)
|
2023-07-24 16:39:51 -04:00 |
|
Aaron Pham
|
e72f0d55f4
|
infra: bump to dev version of 0.2.9.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-07-24 19:58:13 +00:00 |
|
Aaron Pham
|
23a8ae44ed
|
infra: prepare for release 0.2.8 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.8
|
2023-07-24 19:44:11 +00:00 |
|
Aaron Pham
|
7eabcd4355
|
feat: vLLM integration for PagedAttention (#134)
|
2023-07-24 15:42:17 -04:00 |
|
dependabot[bot]
|
9afbdc5198
|
chore(deps): update bitsandbytes requirement from <0.40 to <0.42 (#137)
Updates the requirements on [bitsandbytes](https://github.com/TimDettmers/bitsandbytes) to permit the latest version.
- [Release notes](https://github.com/TimDettmers/bitsandbytes/releases)
- [Changelog](https://github.com/TimDettmers/bitsandbytes/blob/main/CHANGELOG.md)
- [Commits](https://github.com/TimDettmers/bitsandbytes/compare/0.32.0...0.41.0)
---
updated-dependencies:
- dependency-name: bitsandbytes
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
|
2023-07-24 07:59:50 +00:00 |
|
aarnphm-ec2-dev
|
4cd0784ee2
|
chore: export generation items for lazy loading
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-23 08:01:55 +00:00 |
|
aarnphm-ec2-dev
|
e2cdd767ef
|
chore(cli): simplify table for `openllm models`
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-23 06:29:58 +00:00 |
|
Aaron Pham
|
693631958a
|
feat(service): provisional API (#133)
|
2023-07-23 02:15:39 -04:00 |
|
Aaron Pham
|
d88b069160
|
infra: bump to dev version of 0.2.8.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-07-23 01:21:32 +00:00 |
|
Aaron Pham
|
b74bea36a7
|
infra: prepare for release 0.2.7 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.7
|
2023-07-23 01:10:45 +00:00 |
|
aarnphm-ec2-dev
|
99bb0e4446
|
fix(serialisation): using save_pretrained with import_model
Fix llm_post_init correct wrapper behaviour
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-23 01:07:39 +00:00 |
|
aarnphm-ec2-dev
|
d4f3cf8b75
|
fix(llm): ignore quantization config when --quantize int4 is passed
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-22 22:45:46 +00:00 |
|
aarnphm-ec2-dev
|
6f4c58175d
|
chore(llm): add envvar for making tag
the envvar isd OPENLLM_USE_LOCAL_LATEST
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-22 21:37:19 +00:00 |
|
Aaron Pham
|
57a0fec247
|
infra: bump to dev version of 0.2.7.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
|
2023-07-22 21:31:19 +00:00 |
|
Aaron Pham
|
71689e506d
|
infra: prepare for release 0.2.6 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.6
|
2023-07-22 21:19:04 +00:00 |
|
Aaron Pham
|
19f20c7dad
|
perf(serialisation): implement wrapper to reduce callstack (#132)
|
2023-07-22 17:15:03 -04:00 |
|
Aaron
|
ecf31e90b7
|
chore(configuration): remove unused call
to remove one call in the call stack
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-07-22 15:25:44 -04:00 |
|
Aaron Pham
|
31bd2fe31b
|
chore(ci): better release flow (#131)
|
2023-07-21 21:53:57 -04:00 |
|
aarnphm-ec2-dev
|
beb8c2bb08
|
fix(ft): set report_to none to avoid wandb setup
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 18:33:38 +00:00 |
|
Aaron
|
8a42832360
|
chore(ci): simplify bump dev workflow [skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 14:20:30 -04:00 |
|
Aaron
|
5d2dd470d0
|
infra: bump to dev version of 0.2.6.dev0 [generated] [skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 14:16:48 -04:00 |
|
Aaron Pham
|
d49ff95f7f
|
infra: prepare for release 0.2.5 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.5
|
2023-07-21 17:59:21 +00:00 |
|
Aaron Pham
|
81b0451685
|
feat(cli): query with per request instruction (#130)
|
2023-07-21 13:57:21 -04:00 |
|
aarnphm-ec2-dev
|
39f7725870
|
fix(ci): skip running actions on generated commit
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 15:48:22 +00:00 |
|
aarnphm-ec2-dev
|
aa32bfcc4d
|
infra: bump to dev version of 0.2.5.dev0 [generated]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 15:45:13 +00:00 |
|
Aaron Pham
|
6b61217523
|
infra: prepare for release 0.2.4 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.4
|
2023-07-21 08:21:36 +00:00 |
|
aarnphm-ec2-dev
|
e4ac0ed8b7
|
fix(cuda): support loading in single GPU
add available_devices for getting # of available GPUs
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 08:10:01 +00:00 |
|
aarnphm-ec2-dev
|
8eb143ff60
|
fix(ci): remove unecessary semver check for release notes [skip ci]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 06:31:49 +00:00 |
|
aarnphm-ec2-dev
|
033358a991
|
infra: bump to dev version of 0.2.4.dev0 [generated]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 06:31:49 +00:00 |
|
Aaron Pham
|
e5cada218a
|
infra: prepare for release 0.2.3 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.3
|
2023-07-21 03:39:34 +00:00 |
|
Aaron
|
9ccbd60584
|
revert: include configuration to labels
This is used for starting up the bento
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-07-20 23:37:25 -04:00 |
|
aarnphm-ec2-dev
|
f91e750fcd
|
fix(build): remove configuration from labels
labels will only include model_id for it to work with bentocloud
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 03:30:59 +00:00 |
|
Aaron
|
347ffaadbe
|
chore(playground): generate default dir to not set
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
|
2023-07-20 21:25:03 -04:00 |
|
aarnphm-ec2-dev
|
f5b1c8ec1b
|
fix(ft): correct set epochs args for TrainingArguments
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 01:20:56 +00:00 |
|