OpenLLM

mirror of https://github.com/bentoml/OpenLLM.git synced 2026-01-24 23:37:48 -05:00

Files

Aaron Pham 6f724416c0 perf: build quantization and better transformer behaviour (#28 )

Fixes quantization_config and low_cpu_mem_usage to be available on PyTorch implementation only

See changelog for more details on #28

2023-06-17 08:56:14 -04:00

feat: quantization (#27 )

2023-06-16 18:10:50 -04:00

chore: add license header

2023-06-04 16:22:37 -07:00

2023-06-17 08:56:14 -04:00

CODEOWNERS

2023-06-08 17:52:39 -04:00

dependabot.yml

chore: add license header

2023-06-04 16:22:37 -07:00

SECURITY.md

feat: quantization (#27 )

2023-06-16 18:10:50 -04:00