Commit Graph

503 Commits

Author SHA1 Message Date
aarnphm-ec2-dev
d4f3cf8b75 fix(llm): ignore quantization config when --quantize int4 is passed
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-22 22:45:46 +00:00
aarnphm-ec2-dev
6f4c58175d chore(llm): add envvar for making tag
the envvar isd OPENLLM_USE_LOCAL_LATEST

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-22 21:37:19 +00:00
Aaron Pham
57a0fec247 infra: bump to dev version of 0.2.7.dev0 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
2023-07-22 21:31:19 +00:00
Aaron Pham
71689e506d infra: prepare for release 0.2.6 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.6
2023-07-22 21:19:04 +00:00
Aaron Pham
19f20c7dad perf(serialisation): implement wrapper to reduce callstack (#132) 2023-07-22 17:15:03 -04:00
Aaron
ecf31e90b7 chore(configuration): remove unused call
to remove one call in the call stack

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-22 15:25:44 -04:00
Aaron Pham
31bd2fe31b chore(ci): better release flow (#131) 2023-07-21 21:53:57 -04:00
aarnphm-ec2-dev
beb8c2bb08 fix(ft): set report_to none to avoid wandb setup
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 18:33:38 +00:00
Aaron
8a42832360 chore(ci): simplify bump dev workflow [skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-21 14:20:30 -04:00
Aaron
5d2dd470d0 infra: bump to dev version of 0.2.6.dev0 [generated] [skip ci]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-21 14:16:48 -04:00
Aaron Pham
d49ff95f7f infra: prepare for release 0.2.5 [generated] [skip ci]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.5
2023-07-21 17:59:21 +00:00
Aaron Pham
81b0451685 feat(cli): query with per request instruction (#130) 2023-07-21 13:57:21 -04:00
aarnphm-ec2-dev
39f7725870 fix(ci): skip running actions on generated commit
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 15:48:22 +00:00
aarnphm-ec2-dev
aa32bfcc4d infra: bump to dev version of 0.2.5.dev0 [generated]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 15:45:13 +00:00
Aaron Pham
6b61217523 infra: prepare for release 0.2.4 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.4
2023-07-21 08:21:36 +00:00
aarnphm-ec2-dev
e4ac0ed8b7 fix(cuda): support loading in single GPU
add available_devices for getting # of available GPUs

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 08:10:01 +00:00
aarnphm-ec2-dev
8eb143ff60 fix(ci): remove unecessary semver check for release notes [skip ci]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 06:31:49 +00:00
aarnphm-ec2-dev
033358a991 infra: bump to dev version of 0.2.4.dev0 [generated]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 06:31:49 +00:00
Aaron Pham
e5cada218a infra: prepare for release 0.2.3 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.3
2023-07-21 03:39:34 +00:00
Aaron
9ccbd60584 revert: include configuration to labels
This is used for starting up the bento

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-20 23:37:25 -04:00
aarnphm-ec2-dev
f91e750fcd fix(build): remove configuration from labels
labels will only include model_id for it to work with bentocloud

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 03:30:59 +00:00
Aaron
347ffaadbe chore(playground): generate default dir to not set
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-20 21:25:03 -04:00
aarnphm-ec2-dev
f5b1c8ec1b fix(ft): correct set epochs args for TrainingArguments
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-21 01:20:56 +00:00
Aaron Pham
ee7fa63a50 ci: using tokens for publishing (#129) 2023-07-20 21:14:14 -04:00
Aaron
11f88b24ca infra: bump to dev version of 0.2.3.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-20 21:03:02 -04:00
Aaron Pham
16118dd28f infra: prepare for release 0.2.2 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.2
2023-07-21 00:46:36 +00:00
Aaron Pham
f56f8ee782 feat: fine-tuning script for LlaMA 2 (#128) 2023-07-20 20:44:51 -04:00
Aaron
c101103d37 infra: bump to dev version of 0.2.2.dev0 [generated]
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-20 18:51:00 -04:00
Aaron Pham
804b30adc4 infra: prepare for release 0.2.1 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.1
2023-07-20 22:38:27 +00:00
Aaron
5189d2e721 fix(script): correct patch version for __about__.py
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-20 18:35:31 -04:00
aarnphm-ec2-dev
ea07ff6ce9 fix(llama): loose requirements for running llama in container
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-20 22:07:14 +00:00
aarnphm-ec2-dev
c88950655c fix(ci): make sure to run publish and dev prep correctly
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-20 22:05:05 +00:00
aarnphm-ec2-dev
b31cd0460b fix: correct tag inference for model-id
in the case of build, the model_id is passed as a full valid tag under
bento store

XXX: We will need to fix this later

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-20 21:40:56 +00:00
aarnphm-ec2-dev
3e50f0a851 fix(cli): implement latest bentoml 1.0.25 features
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-20 20:51:27 +00:00
Aaron Pham
858c2007c3 feat: revision parsed via model_id (#126) 2023-07-20 14:36:53 -04:00
aarnphm-ec2-dev
a056365d48 fix(ci): always run create coverage
this is to stop evergreen to fail on main

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-20 13:01:32 +00:00
Aaron Pham
1b3508619e feat(llama): add default prompt for LlaMA-2 (#122) 2023-07-20 07:46:33 -04:00
aarnphm-ec2-dev
ee3a00514a infra: bump to dev version of 0.2.1dev0 [generated]
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-20 00:02:45 +00:00
aarnphm-ec2-dev
5f874da4e2 fix: broken logics for upload
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:55:35 +00:00
Aaron Pham
f9ca164e73 infra: prepare for release 0.2.0 [generated]
Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
v0.2.0
2023-07-19 23:43:52 +00:00
aarnphm-ec2-dev
292bca68c7 fix(ci): releas script correctly parse version
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:41:33 +00:00
aarnphm-ec2-dev
4beb040cfd chore(ci): remove macos packaging
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:35:49 +00:00
aarnphm-ec2-dev
e69042a8e9 fix(script): not gpg signing tag
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:12:04 +00:00
aarnphm-ec2-dev
9d28e0a1e6 fix(ci): disable gpg push
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:09:27 +00:00
aarnphm-ec2-dev
b747e3b4b8 fix(ci): remove signing
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 23:01:53 +00:00
Aaron
d92b136780 chore(llama): remove decapoda vairants
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-19 18:58:04 -04:00
Aaron
b6679be301 fix: release script not to signed
Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>
2023-07-19 18:48:01 -04:00
aarnphm-ec2-dev
8b340559aa fix(tests): skip running models tests on CI
The runners don't have enough space to run all tests

Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 22:40:40 +00:00
aarnphm-ec2-dev
e319a2977f fix(ci): editable install
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
2023-07-19 22:29:29 +00:00
Aaron Pham
c1ddb9ed7c feat: GPTQ + vLLM and LlaMA (#113)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2023-07-19 18:12:12 -04:00