Abstract: Recently, neural ordinary differential equations (ODE) models trained with flow matching have achieved impressive performance on the zero-shot voice clone task. Nevertheless, postulating ...
Abstract: This paper introduces PFlow-VC, a conditional flow matching voice conversion model that leverages fine-grained discrete pitch tokens and target speaker prompt information for expressive ...
NVIDIA GPU with 24GB+ VRAM (e.g., RTX 4090, L40S, A100) Docker with NVIDIA Container Toolkit Or: Python 3.10+, CUDA 12.x ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results