
Single Speaker (LJSpeech Dataset)
| Utterance | groundthruth | UDPNet(fsteps:1200 rsteps 8) | UDPNet(fsteps:960 rsteps 8) | UDPNet(fsteps:720 rsteps 8) | UDPNet(fsteps:240 rsteps 8) |
|---|---|---|---|---|---|
| #1 | |||||
| #2 | |||||
| #3 | |||||
| #4 | |||||
| #5 |
Unseen Speakers (VCTK Dataset)
| Utterance | groundthruth | UDPNet(fsteps:1200 rsteps 8) | UDPNet(fsteps:960 rsteps 8) | UDPNet(fsteps:720 rsteps 8) | UDPNet(fsteps:240 rsteps 8) |
|---|---|---|---|---|---|
| #1 | |||||
| #2 | |||||
| #3 | |||||
| #4 | |||||
| #5 |