Skip to content

PaddleSpeech 1.5.0 Release Note #3996

@luotao1

Description

@luotao1

Full Changelog: 748a5f9...develop 18 contributors

Version Adaptation

Upgrade and adapt PaddleSpeech from Paddle 2.5.1 to Paddle 3.0.0-beta. Address incompatibility issues caused by the new version upgrade of Paddle, perform adaptation development and regression testing on the models in PaddleSpeech, and ensure the suite operates normally without loss of model functionality or accuracy.

  • Ensure the adaptation of 80+ existing models in the demo and example directories.
  • Ensure the adaptation and accuracy alignment of 10+ core models in the example directory.
  • Support the re-export of 20+ dynamic-to-static models using the PIR + predictor approach and ensure successful inference.

New Features

  • Implement the third-party library audio tools used in DAC (Descript-Audio-Codec) training.
  • Reproduce the losses required for DAC training: MultiScaleSTFTLoss, GANLoss, and SISDRLoss.

Bug Fix

Others

  • Clean up dependencies and support using PaddleSpeech in Python>3.8 environments

Acknowledgements

Special thanks to contributors including @wanx7130 @warrentdrew @DrRyanHuang @cchenhaifeng @undefined-ux @zxcd @GreatV @yinfan98 @Liyulingyue @megemini @SuiYunsy @Netrvin @enkilee @tianshuo78520a and others for their support.

New Contributors

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions