Skip to content

Conversation

@yqzhishen
Copy link
Collaborator

  • In normal procedure, SHFC only shifts the pitch on variance predictors:
    • variance predictor (shifted pitch) => acoustic model (original pitch) => vocoder (original pitch)
    • If the voicebank has no variance model, SHFC is a no-op.
  • With pitch controllable vocoders, SHFC shifts pitch for both variance predictors and acoustic models, meaning synthesizing on shifted pitch and then shift back to original pitch with styles and formants unchanged using the vocoder:
    • variance predictor (shifted pitch) => acoustic model (shifted pitch) => vocoder (original pitch)

@stakira stakira merged commit 43ea25e into stakira:master Feb 24, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants