mirror of
https://github.com/bentoml/OpenLLM.git
synced 2026-05-04 13:52:46 -04:00
Fixes quantization_config and low_cpu_mem_usage to be available on PyTorch implementation only See changelog for more details on #28
Fixes quantization_config and low_cpu_mem_usage to be available on PyTorch implementation only See changelog for more details on #28