OpenLLM

mirror/OpenLLM

Fork 0

mirror of https://github.com/bentoml/OpenLLM.git synced 2026-01-23 15:01:32 -05:00

Commit Graph

Select branches

Hide Pull Requests

dependabot/github_actions/actions-dependencies-67da17a5df

dependabot/pip/production-dependencies-041ea65659

dependabot/uv/setuptools-78.1.1

dependabot/uv/uv-bcbe2f4226

main

pre-commit-ci-update-config

#1

#10

#1000

#1001

#1002

#1004

#1005

#1006

#1007

#1008

#1011

#1012

#1013

#1016

#1017

#1018

#1019

#102

#1022

#1023

#1024

#1027

#1029

#103

#1030

#1031

#1032

#1033

#1034

#1035

#1036

#1038

#1040

#1041

#1043

#1045

#1046

#1047

#1048

#105

#1051

#1052

#1054

#1056

#1057

#1058

#1059

#106

#1060

#1061

#1065

#1066

#1067

#1069

#107

#1070

#1071

#1072

#1073

#1074

#1075

#1077

#1079

#1080

#1081

#1082

#1084

#1085

#1086

#1087

#1088

#1089

#1090

#1091

#1092

#1093

#1094

#1095

#1096

#1097

#1098

#1099

#110

#1100

#1102

#1103

#1104

#1105

#1106

#1107

#1108

#1109

#1110

#1111

#1113

#1114

#1116

#1117

#1118

#1119

#1120

#1121

#1122

#1123

#1124

#1125

#1126

#1127

#1128

#113

#1130

#1131

#1132

#1133

#1134

#1135

#1138

#1139

#114

#1140

#1141

#1142

#1143

#1144

#1145

#1146

#1147

#1148

#1149

#115

#1150

#1151

#1152

#1153

#1154

#1155

#1156

#1157

#1158

#1159

#116

#1160

#1161

#1162

#1163

#1164

#1165

#1166

#1167

#1169

#117

#1170

#1171

#1173

#1174

#1175

#1176

#1177

#1179

#118

#1180

#1181

#1182

#1183

#1187

#1188

#1189

#119

#1190

#1191

#1192

#1193

#1194

#1195

#1196

#1197

#1198

#1199

#12

#1200

#1202

#1203

#1204

#1205

#1206

#1207

#1208

#1209

#1210

#1210

#1211

#1211

#1212

#1213

#1214

#1215

#1216

#1216

#1217

#1217

#1218

#1218

#122

#126

#128

#129

#130

#131

#132

#133

#134

#137

#138

#139

#141

#142

#143

#144

#145

#146

#148

#150

#151

#152

#153

#154

#155

#160

#161

#162

#163

#164

#165

#166

#167

#168

#169

#17

#170

#171

#173

#174

#176

#177

#178

#179

#18

#181

#182

#183

#184

#185

#186

#187

#188

#189

#19

#190

#191

#193

#197

#199

#2

#200

#201

#203

#207

#208

#211

#212

#213

#215

#216

#217

#218

#219

#22

#220

#221

#222

#223

#224

#227

#228

#23

#230

#232

#240

#242

#243

#244

#245

#246

#249

#25

#250

#251

#252

#255

#256

#257

#26

#260

#261

#262

#263

#264

#265

#266

#267

#268

#269

#27

#270

#271

#273

#278

#279

#28

#280

#283

#284

#285

#287

#288

#289

#29

#290

#291

#292

#293

#294

#295

#296

#297

#298

#302

#304

#305

#31

#314

#315

#316

#317

#318

#319

#320

#321

#322

#323

#324

#325

#326

#327

#328

#329

#330

#331

#332

#333

#334

#335

#336

#337

#338

#339

#340

#341

#342

#343

#344

#345

#346

#349

#35

#351

#352

#353

#355

#356

#357

#358

#359

#360

#361

#362

#363

#364

#365

#366

#367

#368

#369

#37

#370

#371

#372

#373

#374

#375

#376

#378

#379

#380

#381

#382

#383

#384

#389

#39

#390

#391

#392

#393

#394

#395

#396

#397

#398

#399

#4

#400

#401

#402

#403

#404

#405

#406

#407

#411

#412

#413

#414

#417

#423

#424

#425

#426

#427

#428

#429

#430

#431

#432

#433

#434

#435

#436

#437

#438

#439

#440

#441

#45

#455

#456

#457

#458

#459

#460

#461

#462

#463

#464

#465

#466

#467

#468

#469

#470

#471

#472

#473

#474

#475

#477

#478

#479

#480

#482

#483

#484

#485

#486

#487

#488

#489

#490

#491

#492

#493

#494

#495

#496

#497

#499

#5

#50

#500

#501

#502

#503

#504

#506

#508

#509

#51

#510

#511

#516

#518

#519

#52

#521

#522

#523

#524

#525

#526

#527

#528

#529

#530

#532

#533

#535

#536

#537

#538

#539

#54

#540

#541

#542

#544

#545

#546

#548

#549

#550

#554

#556

#557

#558

#559

#56

#560

#561

#562

#563

#564

#565

#566

#567

#568

#569

#57

#570

#571

#573

#574

#575

#576

#577

#578

#579

#58

#580

#581

#582

#583

#584

#585

#586

#587

#588

#589

#590

#591

#592

#593

#594

#595

#597

#599

#6

#60

#600

#601

#602

#605

#606

#609

#610

#611

#612

#613

#614

#615

#616

#617

#618

#619

#620

#621

#622

#623

#624

#625

#626

#627

#628

#629

#630

#631

#632

#633

#634

#635

#636

#637

#638

#639

#64

#640

#642

#643

#644

#645

#646

#647

#648

#651

#652

#653

#654

#655

#657

#658

#659

#66

#660

#661

#662

#663

#664

#665

#667

#668

#669

#671

#672

#673

#674

#675

#676

#677

#678

#679

#68

#680

#681

#682

#683

#684

#686

#687

#689

#690

#691

#692

#693

#694

#695

#698

#699

#7

#70

#700

#701

#702

#703

#704

#705

#706

#707

#708

#709

#71

#711

#712

#713

#714

#715

#716

#717

#718

#719

#72

#720

#721

#722

#723

#724

#725

#726

#727

#728

#729

#730

#733

#734

#735

#739

#74

#742

#749

#75

#750

#751

#753

#757

#76

#760

#761

#762

#763

#764

#765

#766

#767

#770

#772

#773

#774

#775

#776

#779

#781

#782

#783

#786

#789

#790

#791

#792

#793

#794

#796

#797

#798

#799

#8

#80

#805

#807

#808

#811

#812

#813

#814

#815

#816

#817

#818

#819

#821

#823

#824

#825

#826

#830

#831

#832

#833

#834

#836

#837

#838

#84

#841

#842

#843

#844

#845

#846

#847

#848

#85

#854

#855

#856

#857

#858

#866

#867

#868

#869

#87

#870

#877

#878

#879

#88

#880

#881

#883

#884

#885

#886

#887

#888

#889

#89

#890

#891

#892

#893

#896

#897

#898

#899

#9

#90

#906

#907

#908

#909

#91

#912

#913

#915

#916

#917

#918

#919

#92

#920

#923

#925

#928

#93

#931

#932

#933

#935

#938

#939

#940

#941

#942

#943

#945

#946

#947

#949

#95

#950

#953

#954

#955

#956

#957

#958

#959

#963

#964

#969

#970

#973

#974

#975

#976

#977

#978

#979

#98

#980

#981

#982

#983

#984

#985

#986

#987

#988

#989

#990

#991

#992

#993

#994

#995

#996

#997

#998

#999

v0.0.10

v0.0.11

v0.0.12

v0.0.13

v0.0.14

v0.0.15

v0.0.16

v0.0.17

v0.0.18

v0.0.19

v0.0.21

v0.0.22

v0.0.23

v0.0.24

v0.0.25

v0.0.26

v0.0.27

v0.0.28

v0.0.29

v0.0.30

v0.0.31

v0.0.32

v0.0.33

v0.0.34

v0.0.4

v0.0.5

v0.0.6

v0.0.7

v0.0.8

v0.0.9

v0.1.0

v0.1.1

v0.1.10

v0.1.11

v0.1.12

v0.1.13

v0.1.14

v0.1.15

v0.1.16

v0.1.17

v0.1.18

v0.1.19

v0.1.2

v0.1.20

v0.1.3

v0.1.4

v0.1.5

v0.1.6

v0.1.7

v0.1.8

v0.1.9

v0.2.0

v0.2.1

v0.2.10

v0.2.11

v0.2.12

v0.2.13

v0.2.14

v0.2.15

v0.2.16

v0.2.17

v0.2.18

v0.2.19

v0.2.2

v0.2.20

v0.2.21

v0.2.22

v0.2.23

v0.2.24

v0.2.25

v0.2.26

v0.2.27

v0.2.3

v0.2.4

v0.2.5

v0.2.6

v0.2.7

v0.2.8

v0.2.9

v0.3.0

v0.3.1

v0.3.10

v0.3.11

v0.3.12

v0.3.13

v0.3.14

v0.3.2

v0.3.3

v0.3.4

v0.3.5

v0.3.6

v0.3.7

v0.3.8

v0.3.9

v0.4.0

v0.4.1

v0.4.10

v0.4.11

v0.4.12

v0.4.13

v0.4.14

v0.4.15

v0.4.16

v0.4.17

v0.4.18

v0.4.19

v0.4.2

v0.4.20

v0.4.21

v0.4.22

v0.4.23

v0.4.24

v0.4.25

v0.4.26

v0.4.27

v0.4.28

v0.4.29

v0.4.3

v0.4.30

v0.4.31

v0.4.32

v0.4.33

v0.4.34

v0.4.35

v0.4.36

v0.4.37

v0.4.38

v0.4.39

v0.4.4

v0.4.40

v0.4.41

v0.4.42

v0.4.43

v0.4.44

v0.4.5

v0.4.6

v0.4.7

v0.4.8

v0.4.9

v0.5.0

v0.5.0-alpha

v0.5.0-alpha.1

v0.5.0-alpha.10

v0.5.0-alpha.11

v0.5.0-alpha.12

v0.5.0-alpha.13

v0.5.0-alpha.14

v0.5.0-alpha.15

v0.5.0-alpha.2

v0.5.0-alpha.3

v0.5.0-alpha.4

v0.5.0-alpha.5

v0.5.0-alpha.6

v0.5.0-alpha.7

v0.5.0-alpha.8

v0.5.0-alpha.9

v0.5.1

v0.5.2

v0.5.3

v0.5.4

v0.5.5

v0.5.6

v0.5.7

v0.6.0

v0.6.1

v0.6.10

v0.6.11

v0.6.12

v0.6.13

v0.6.14

v0.6.15

v0.6.16

v0.6.17

v0.6.18

v0.6.19

v0.6.2

v0.6.20

v0.6.21

v0.6.22

v0.6.23

v0.6.24

v0.6.25

v0.6.26

v0.6.27

v0.6.28

v0.6.29

v0.6.3

v0.6.30

v0.6.4

v0.6.5

v0.6.6

v0.6.7

v0.6.8

v0.6.9

ef40fdf5c8 fix(build): quote environment variables aarnphm-ec2-dev 2023-06-21 11:28:37 +00:00
de665def5c fix(cli): support loading model-id from local path Aaron 2023-06-21 07:25:13 -04:00
84466c2827 fix(infra): move lines to placeholder Aaron Pham 2023-06-20 21:50:21 -04:00
e69d3f9ca0 chore: update bug-report Aaron Pham 2023-06-20 21:49:35 -04:00
9c6b43b163 docs: rename camel case to official Hugging Face name (#39) Ikko Eltociear Ashimine 2023-06-21 01:02:03 +09:00
d33149d758 fix(log): repr the given LLMConfig in debug mode aarnphm-ec2-dev 2023-06-19 18:19:51 +00:00
ca802d9d1a fix: agent log (#37) Aaron Pham 2023-06-19 14:11:39 -04:00
78a537079e infra: bump to dev version of 0.1.9.dev0 [generated] Aaron Pham [bot] 2023-06-19 18:08:32 +00:00
70c7c0a9b7 fix(cli): use correct API for client aarnphm-ec2-dev 2023-06-19 18:04:27 +00:00
6bbbefd06a infra: prepare for release 0.1.8 [generated] v0.1.8 Aaron Pham [bot] 2023-06-19 18:02:08 +00:00
6d43bdbcdb fix(instruct): remove breakpoint aarnphm-ec2-dev 2023-06-19 17:59:44 +00:00
9139e6f290 docs: update README to use OPT as example Aaron 2023-06-19 13:40:10 -04:00
0e3f8d2fba infra: bump to dev version of 0.1.8.dev0 [generated] Aaron Pham [bot] 2023-06-19 17:30:50 +00:00
9a6af97356 infra: prepare for release 0.1.7 [generated] v0.1.7 Aaron Pham [bot] 2023-06-19 17:27:52 +00:00
752c2e60a5 fix: remove direct url reference Aaron 2023-06-19 13:25:29 -04:00
feb0c53146 fix(timeout): increase default timeout to avoid asyncio error aarnphm-ec2-dev 2023-06-19 17:01:54 +00:00
58758f8241 fix(dolly_v2): gc collect after import Aaron 2023-06-19 12:28:13 -04:00
4f1fee4bee fix(ci): install towncrier for changelog automation Aaron 2023-06-19 06:32:15 -04:00
1ed0ae7787 fix(log): make sure to configure OpenLLM logs correctly Aaron 2023-06-19 06:16:08 -04:00
2244cce5bd fix(config): __getitem__ to get the value instead of member of class Aaron 2023-06-19 05:34:49 -04:00
622a2fb37d fix: separate hatch config Aaron 2023-06-19 03:29:20 -04:00
e3fad40f21 fix(env): make tests with extra-dependencies Aaron 2023-06-18 23:58:03 -04:00
03758a5487 fix(tools): adhere to style guidelines (#31) Aaron Pham 2023-06-18 20:03:17 -04:00
a7a6775c68 chore: add banner for OpenLLM Aaron 2023-06-18 05:55:38 -04:00
33d3523e5b chore(readme): update docs and warning notes Aaron 2023-06-18 01:39:15 -04:00
4fcd7c8ac9 integration: HuggingFace Agent (#29) Aaron Pham 2023-06-18 00:13:53 -04:00
fe8da4e8a9 fix(tests): ensure_available on tests aarnphm-ec2-dev 2023-06-17 15:12:28 +00:00
8bd7351d3c chore: update new gif Aaron 2023-06-17 10:26:34 -04:00
5a6f42ee99 infra: fix generated release link for towncrier [skip ci] Aaron 2023-06-17 09:19:46 -04:00
9be65a813b infra: bump to dev version of 0.1.7.dev0 [generated] Aaron Pham [bot] 2023-06-17 13:12:46 +00:00
ed398c38f8 infra: prepare for release 0.1.6 [generated] v0.1.6 Aaron Pham [bot] 2023-06-17 13:02:47 +00:00
6f724416c0 perf: build quantization and better transformer behaviour (#28) Aaron Pham 2023-06-17 08:56:14 -04:00
233d4697b5 chore: update __all__ to take into _extra_objects Aaron 2023-06-16 18:13:23 -04:00
ded8a9f809 feat: quantization (#27) Aaron Pham 2023-06-16 18:10:50 -04:00
19bc7e3116 feat: fine-tuning [part 1] (#23) Aaron Pham 2023-06-16 00:19:01 -04:00
b9ff4ab92a chore: flatten examples llm-config Aaron 2023-06-15 18:39:33 -04:00
e4b7714756 chore(js): update metadata Aaron 2023-06-15 13:18:05 -04:00
850cf791ef chore: fix README.md Aaron Pham 2023-06-15 09:37:46 -04:00
dc50a2e7e5 docs: add LangChain and BentoML Examples (#25) Chaoyu 2023-06-15 03:14:37 -07:00
5e1445218b refactor: toplevel CLI (#26) Aaron Pham 2023-06-15 02:32:46 -04:00
9a6a976ce1 infra: bump to dev version of 0.1.6.dev0 [generated] Aaron Pham [bot] 2023-06-15 06:16:12 +00:00
bb425b89d9 infra: prepare for release 0.1.5 [generated] v0.1.5 Aaron Pham [bot] 2023-06-15 06:05:35 +00:00
528f76e1d0 fix(client): using httpx for running calls within async context Aaron 2023-06-15 01:58:49 -04:00
b3d924e6d6 fix(dolly): make sure to use GPU when available aarnphm-ec2-dev 2023-06-15 05:52:25 +00:00
dfe71d7867 chore(cli): redirect download models into subcontext aarnphm-ec2-dev 2023-06-14 11:44:39 +00:00
d7e92ae525 feat(cli): --device all --workers-per-resource Aaron 2023-06-14 06:36:54 -04:00
d07cc95ea0 ci: add hatch to dev envs Aaron 2023-06-14 03:46:42 -04:00
123d9c442f infra: bump to dev version of 0.1.5.dev0 [generated] Aaron Pham [bot] 2023-06-14 07:43:54 +00:00
f9c0a1093b infra: prepare for release 0.1.4 [generated] v0.1.4 Aaron Pham [bot] 2023-06-14 07:33:16 +00:00
be41c23c10 codegen: remove black as dependencies Aaron 2023-06-14 03:22:05 -04:00
50d59cdf8d types: rename interface Aaron 2023-06-14 02:45:15 -04:00
47da1916ad infra: bump to dev version of 0.1.4.dev0 [generated] Aaron Pham [bot] 2023-06-14 05:56:49 +00:00
52d786edc7 infra: prepare for release 0.1.3 [generated] v0.1.3 Aaron Pham [bot] 2023-06-14 05:46:29 +00:00
111d205f63 perf: faster LLM loading Aaron 2023-06-14 01:36:42 -04:00
ebcedc35de fix(exception): handle notfound explicitly Aaron 2023-06-13 20:15:38 -04:00
0ab7450e90 chore(types): add hints for LLMRunner Aaron 2023-06-13 20:13:33 -04:00
03c90c2a13 fix(llm): ensure we don't bleed runner options Aaron 2023-06-13 20:05:33 -04:00
e3ccf766d7 chore: expose LLMRunner for type Aaron 2023-06-13 19:47:36 -04:00
1194684658 fix(llm): cached load aarnphm-ec2-dev 2023-06-13 14:22:09 +00:00
74c8323e42 docs: update generated with href Aaron 2023-06-13 07:30:43 -04:00
ece2b377c0 infra: bump to dev version of 0.1.3.dev0 [generated] Aaron Pham [bot] 2023-06-13 11:24:14 +00:00
398ed85b9b infra: prepare for release 0.1.2 [generated] v0.1.2 Aaron Pham [bot] 2023-06-13 11:14:25 +00:00
cb76a894cf feat(metadata): add configuration to metadata endpoint Aaron 2023-06-13 07:09:13 -04:00
dd20941050 chore: metadata (#19) Aaron Pham 2023-06-13 04:09:33 -04:00
764d86289c chore(readme): update table with model_ids matrix Aaron 2023-06-12 16:57:24 -04:00
b5547bbc97 infra: bump to dev version of 0.1.2.dev0 [generated] Aaron Pham [bot] 2023-06-12 20:30:48 +00:00
f85bbec147 infra: prepare for release 0.1.1 [generated] v0.1.1 Aaron Pham [bot] 2023-06-12 20:19:34 +00:00
71070b90b4 chore(metadata): fix model_id to be respected on service.py Aaron 2023-06-12 16:04:52 -04:00
4717989384 fix(tokenizers): allow forking by default Aaron 2023-06-12 15:41:19 -04:00
aa8812cf90 fix(build): empty model_id Aaron 2023-06-12 14:29:08 -04:00
30a8c32a53 infra: bump to dev version of 0.1.1.dev0 [generated] Aaron 2023-06-12 14:31:20 -04:00
53a63dbe78 infra: prepare for release 0.1.0 v0.1.0 Aaron 2023-06-12 14:23:26 -04:00
f8ebb36e15 tests: fastpath (#17) Aaron Pham 2023-06-12 14:18:26 -04:00
187a5f834f docs: add --model-id command (#18) Chaoyu 2023-06-12 11:03:36 -07:00
d3bbb727ea doc: add gif to readme Jian Shen 2023-06-12 15:51:08 +08:00
0fc209da72 chore: bump up dependencies for BentoML Aaron 2023-06-12 01:26:25 -04:00
f8e99dd8f5 chore(configuration): clean house implementation Aaron 2023-06-11 18:45:20 -04:00
1847209489 feat(cli): --workers aarnphm-ec2-dev 2023-06-11 15:50:56 +00:00
81d46ca211 feat(type): support annotations aarnphm-ec2-dev 2023-06-11 14:58:17 +00:00
2e453fb005 refactor(configuration): __config__ and perf aarnphm-ec2-dev 2023-06-11 12:53:15 +00:00
17241292da feat(cli): show runtime implementation aarnphm-ec2-dev 2023-06-11 05:29:11 +00:00
06c90c0ba3 docs: update matrix [generated] Aaron 2023-06-11 00:47:14 -04:00
3177781e50 infra: bump to dev version of 0.0.35.dev0 [generated] Aaron Pham [bot] 2023-06-11 04:45:24 +00:00
0552b32456 infra: prepare for release 0.0.34 [generated] v0.0.34 Aaron Pham [bot] 2023-06-11 04:35:30 +00:00
a5efb7fcb1 fix(stablelm): running on GPU aarnphm-ec2-dev 2023-06-11 04:28:22 +00:00
8762a56093 revert: broken KeyboardInterrupt change aarnphm-ec2-dev 2023-06-11 04:20:07 +00:00
512cd0715c feat(service): implementing with lifecycle hooks aarnphm-ec2-dev 2023-06-11 04:14:18 +00:00
5a7942574f chore(docs): update docs for to_runner aarnphm-ec2-dev 2023-06-11 03:38:56 +00:00
6a937d8b51 feat(scheduling): custom GPU offload strategy Aaron 2023-06-10 22:57:54 -04:00
b22468e8c4 feat(cli): openllm models --show-available Aaron 2023-06-10 20:45:40 -04:00
7d71246322 fix(stablelm): load with BetterTransformers on CPU only Aaron 2023-06-10 20:45:05 -04:00
204a7ab7c9 revert(starcoder): quant 8 aarnphm-ec2-dev 2023-06-10 23:17:42 +00:00
bb37f7e238 feat(utils): lazy load modules and fix typo aarnphm-ec2-dev 2023-06-10 22:18:37 +00:00
05fa34f9e6 refactor: pretrained => model_id Aaron 2023-06-10 17:36:02 -04:00
4841051fc5 feat(stablelm): CPU inference Aaron 2023-06-10 07:53:29 -04:00
53296111d0 fix(gpu): enable device_map 'auto' to multi-gpu setup only aarnphm-ec2-dev 2023-06-10 11:38:31 +00:00
66a87ef0b7 infra: bump to dev version of 0.0.34.dev0 [generated] Aaron Pham [bot] 2023-06-10 10:19:02 +00:00
56f50deab6 infra: prepare for release 0.0.33 [generated] v0.0.33 Aaron Pham [bot] 2023-06-10 10:09:12 +00:00
2348946ada fix(starcoder): disable quant 8 aarnphm-ec2-dev 2023-06-10 10:01:43 +00:00
4db141c649 feat(gpu): support passing GPU per LLM aarnphm-ec2-dev 2023-06-10 09:47:16 +00:00

... 15 16 17 18 19 ...