Aaron Pham
|
8c2867d26d
|
style: define experimental guidelines (#168)
|
2023-07-31 07:54:26 -04:00 |
|
Aaron Pham
|
ef94c6b98a
|
feat(container): vLLM build and base image strategies (#142)
|
2023-07-31 02:44:52 -04:00 |
|
aarnphm-ec2-dev
|
e4ac0ed8b7
|
fix(cuda): support loading in single GPU
add available_devices for getting # of available GPUs
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 08:10:01 +00:00 |
|
Aaron Pham
|
fc963c42ce
|
fix: build isolation (#116)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-07-16 01:52:21 -04:00 |
|
Aaron Pham
|
c7f4dc7bb2
|
feat(test): snapshot testing (#107)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-07-10 17:23:19 -04:00 |
|
Aaron Pham
|
db1494a6ae
|
feat(start): starting bento and fix load (#80)
|
2023-06-27 12:45:17 -04:00 |
|
Aaron Pham
|
74fdd5e259
|
feat: release binary distribution (#66)
|
2023-06-25 10:38:03 -04:00 |
|
Aaron Pham
|
3593c764f0
|
fix(test): robustness (#64)
|
2023-06-24 11:10:07 -04:00 |
|
aarnphm-ec2-dev
|
fe8da4e8a9
|
fix(tests): ensure_available on tests
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-06-17 15:12:28 +00:00 |
|
Aaron Pham
|
19bc7e3116
|
feat: fine-tuning [part 1] (#23)
|
2023-06-16 00:19:01 -04:00 |
|
Aaron Pham
|
f8ebb36e15
|
tests: fastpath (#17)
added fastpath cases for configuration and Flan-T5
fixes respecting model_id into lifecycle hooks.
update CLI to cleanup models info
|
2023-06-12 14:18:26 -04:00 |
|