infra: prepare for release 0.1.14 [generated]

Signed-off-by: Aaron Pham <29749331+aarnphm@users.noreply.github.com>
This commit is contained in:
Aaron Pham
2023-06-25 17:55:43 +00:00
parent 2fed9eae78
commit e733fd55dc
6 changed files with 42 additions and 29 deletions

View File

@@ -18,6 +18,46 @@ This changelog is managed by towncrier and is compiled at release time.
<!-- towncrier release notes start -->
## [0.1.14](https://github.com/bentoml/openllm/tree/v0.1.14)
### Features
- Added support for standalone binary distribution. Currently works on Linux and
Windows:
The following are supported:
- aarch64-unknown-linux-gnu
- x86_64-unknown-linux-gnu
- x86_64-unknown-linux-musl
- i686-unknown-linux-gnu
- powerpc64le-unknown-linux-gnu
- x86_64-pc-windows-msvc
- i686-pc-windows-msvc
Reverted matrices expansion for CI to all Python version. Now leveraging Hatch
env matrices
[#66](https://github.com/bentoml/openllm/issues/66)
### Bug fix
- Moved implementation of dolly-v2 and falcon serialization to save PreTrainedModel instead of pipeline.
Save dolly-v2 now save the actual model instead of the pipeline abstraction. If you have a Dolly-V2
model available locally, kindly ask you to do `openllm prune` to have the new implementation available.
Dolly-v2 and falcon nows implements some memory optimization to help with loading with lower resources system
Configuration removed field: 'use_pipeline'
[#60](https://github.com/bentoml/openllm/issues/60)
- Remove duplicated class instance of `generation_config` as it should be set via
instance attributes.
fixes tests flakiness and one broken cases for parsing env
[#64](https://github.com/bentoml/openllm/issues/64)
## [0.1.13](https://github.com/bentoml/openllm/tree/v0.1.13)
No significant changes.

View File

@@ -1,8 +0,0 @@
Moved implementation of dolly-v2 and falcon serialization to save PreTrainedModel instead of pipeline.
Save dolly-v2 now save the actual model instead of the pipeline abstraction. If you have a Dolly-V2
model available locally, kindly ask you to do `openllm prune` to have the new implementation available.
Dolly-v2 and falcon nows implements some memory optimization to help with loading with lower resources system
Configuration removed field: 'use_pipeline'

View File

@@ -1,4 +0,0 @@
Remove duplicated class instance of `generation_config` as it should be set via
instance attributes.
fixes tests flakiness and one broken cases for parsing env

View File

@@ -1,15 +0,0 @@
Added support for standalone binary distribution. Currently works on Linux and
Windows:
The following are supported:
- aarch64-unknown-linux-gnu
- x86_64-unknown-linux-gnu
- x86_64-unknown-linux-musl
- i686-unknown-linux-gnu
- powerpc64le-unknown-linux-gnu
- x86_64-pc-windows-msvc
- i686-pc-windows-msvc
Reverted matrices expansion for CI to all Python version. Now leveraging Hatch
env matrices

View File

@@ -1,6 +1,6 @@
{
"name": "openllm",
"version": "0.1.14.dev0",
"version": "0.1.14",
"description": "OpenLLM: Your one stop-and-go solution for serving Large Language Model",
"repository": "git@github.com:llmsys/OpenLLM.git",
"author": "Aaron Pham <29749331+aarnphm@users.noreply.github.com>",

View File

@@ -11,4 +11,4 @@
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
__version__ = "0.1.14.dev0"
__version__ = "0.1.14"