OpenLLM

mirror of https://github.com/bentoml/OpenLLM.git synced 2026-05-04 13:52:46 -04:00

Files

Aaron Pham 6f724416c0 perf: build quantization and better transformer behaviour (#28 )

Fixes quantization_config and low_cpu_mem_usage to be available on PyTorch implementation only

See changelog for more details on #28

2023-06-17 08:56:14 -04:00

ci.yml

2023-06-17 08:56:14 -04:00

create-releases.yml

2023-06-08 16:03:41 -04:00

release-notes.yml

2023-06-08 16:03:41 -04:00