OpenLLM

mirror of https://github.com/bentoml/OpenLLM.git synced 2026-03-06 16:16:37 -05:00

Author	SHA1	Message	Date
Aaron Pham	fddd0bf95e	feat: bootstrap documentation site (#252 ) Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com> Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com> Signed-off-by: GutZuFusss <leon.ikinger@googlemail.com> Co-authored-by: GutZuFusss <leon.ikinger@googlemail.com>	2023-09-12 12:28:29 -04:00
aarnphm-ec2-dev	f917898867	chore: update langchain examples Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>	2023-09-08 03:35:10 +00:00
Aaron Pham	b7af7765d4	fix(yapf): align weird new lines break [generated] [skip ci] (#284 ) fix(yapf): align weird new lines break Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>	2023-09-01 05:34:22 -04:00
Aaron	b545ad2ad1	style: google Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>	2023-08-30 13:52:35 -04:00
Aaron Pham	c9cef1d773	fix: persistent styling between ruff and yapf (#279 )	2023-08-30 11:37:41 -04:00
aarnphm-ec2-dev	806a663e4a	chore(style): add one blank line to conform with Google style Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>	2023-08-26 11:36:57 +00:00
Aaron Pham	46c8904806	cron(style): run formatter [generated] [skip ci] (#257 )	2023-08-25 06:38:59 -04:00
Aaron	6b0ab17018	chore: remove unnecessary headers Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>	2023-08-15 18:15:54 -04:00
Aaron Pham	5329853b10	perf: compiled modules and enable lazyeval (#200 )	2023-08-11 05:53:45 -04:00
Aaron Pham	8c2867d26d	style: define experimental guidelines (#168 )	2023-07-31 07:54:26 -04:00
Aaron Pham	ef94c6b98a	feat(container): vLLM build and base image strategies (#142 )	2023-07-31 02:44:52 -04:00
Aaron Pham	c7f4dc7bb2	feat(test): snapshot testing (#107 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2023-07-10 17:23:19 -04:00
Aaron Pham	5a4df53490	fix(load): tokenizer and adapter within a BentoLLM (#88 )	2023-06-28 15:45:25 -04:00
Aaron Pham	98328be394	peft(models): improve implementation (#60 ) If you have a local Dolly-V2 version, please do `openllm prune`	2023-06-24 05:22:18 -04:00
Aaron Pham	03758a5487	fix(tools): adhere to style guidelines (#31 )	2023-06-18 20:03:17 -04:00
Aaron Pham	ded8a9f809	feat: quantization (#27 )	2023-06-16 18:10:50 -04:00
Aaron Pham	19bc7e3116	feat: fine-tuning [part 1] (#23 )	2023-06-16 00:19:01 -04:00
Chaoyu	dc50a2e7e5	docs: add LangChain and BentoML Examples (#25 ) Co-authored-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>	2023-06-15 06:14:37 -04:00
Aaron	ec941c95d5	chore: add license header Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>	2023-06-04 16:22:37 -07:00
Aaron	78358dbb8d	fix(type): configuration and dependencies Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>	2023-05-28 06:01:30 -07:00
Aaron	b7f3a10910	refactor: migrate __init_subclass__ to Metaclass LLMMetaclass will now responsible for generate internal attributes add llm_type and identifying_params to Runnable class subclass of openllm.LLM now can set a class attribute __openllm_internal__ to let openllm knows that this is an internal class implementation, instead of providing a _internal in the class initialization. support for preprocess_parameters and postprocess_parameters on client side for better client UX Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com> Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com> Signed-off-by: aarnphm-ec2-dev <29749331+aarnphm@users.noreply.github.com>	2023-05-27 03:09:45 +00:00
Aaron	d31d450526	feat: Adding central service definition and init openllm_client Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>	2023-05-15 00:33:05 -07:00
Chaoyu	dd8b6050b2	feat: FLAN-T5 supports - add infrastructure, to be implemented: cache, chat history - Base Runnable Implementation, that fits LangChain API - Added a Prompt descriptor and utils. feat: license headers and auto factory impl and CLI Auto construct args from pydantic config Add auto factory for ease of use only provide `/generate` to streamline UX experience CLI > envvar > input contract for configuration fix: serve from a thread fix CLI args chore: cleanup names and refactor imports Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>	2023-05-03 17:50:14 -07:00

23 Commits