Aaron Pham
|
8c2867d26d
|
style: define experimental guidelines (#168)
|
2023-07-31 07:54:26 -04:00 |
|
Aaron Pham
|
ef94c6b98a
|
feat(container): vLLM build and base image strategies (#142)
|
2023-07-31 02:44:52 -04:00 |
|
aarnphm-ec2-dev
|
e4ac0ed8b7
|
fix(cuda): support loading in single GPU
add available_devices for getting # of available GPUs
Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>
|
2023-07-21 08:10:01 +00:00 |
|
Aaron Pham
|
c7f4dc7bb2
|
feat(test): snapshot testing (#107)
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
|
2023-07-10 17:23:19 -04:00 |
|
Aaron Pham
|
db1494a6ae
|
feat(start): starting bento and fix load (#80)
|
2023-06-27 12:45:17 -04:00 |
|
Aaron Pham
|
74fdd5e259
|
feat: release binary distribution (#66)
|
2023-06-25 10:38:03 -04:00 |
|