Paper

PRVAE-VC: Non-parallel many-to-many voice conversion with perturbation-resistant variational autoencoder

Kou Tanaka, Hirokazu Kameoka, Takuhiro Kaneko

Audio samples for ATRCORPORA

(Supported: Safari, Chrome, FireFox, Opera)

Source speaker: , Target speaker: , Utterance:

Re-synthesized
 Source
 Target
Converted
 CVAE-Small (1240k params)
   + beta
   + cc (cycle consistency)
   + ac (auxiliary classifier)
   + pr (perturbation resistance, ours)
Converted
 CVAE-Large (8842k params)
   + beta
   + cc (cycle consistency)
   + ac (auxiliary classifier)
   + pr (perturbation resistance, ours)