Abstract: In this paper, we take a step towards jointly modeling automatic speech recognition (STT) and speech synthesis (TTS) in a fully non-autoregressive way. We develop a novel multimodal ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results