AdaSpeech: Adaptive Text to Speech for Custom Voice

By arXiv.org - 2021-03-05

Description

Custom voice, a specific text to speech (TTS) service in commercial speech platforms, aims to adapt a source TTS model to synthesize personal voice for a target speaker using few speech data. Custom v ...

Summary

Adaptive Text to Speech for Custom Voice Abstract: 1) to support diverse customers, the adaptation model needs to handle diverse acoustic conditions that could be very different from source speech data, and 2) to support a large number of customers, the adaptation parameters need to be small enough for each target speaker to reduce memory usage while maintaining high voice quality.
2) To better trade off the adaptation parameters and voice quality, we introduce conditional layer normalization in the mel-spectrogram decoder of AdaSpeech, and fine-tune this part in addition to speaker embedding for adaptation.
arXiv is committed to these values and only works with partners that adhere to them.

Topics

NLP (0.26)
Backend (0.15)
UX (0.11)

Similar Articles

mT5: A massively multilingual pre-trained text-to-text transformer

By arXiv.org - 2020-10-23

The recent "Text-to-Text Transfer Transformer" (T5) leveraged a unified text-to-text format and scale to attain state-of-the-art results on a wide variety of English-language NLP tasks. In this paper, ...

Generating similes effortlessly like a Pro: A Style Transfer Approach for Simile Generation

By arXiv.org - 2020-10-06

Literary tropes, from poetry to stories, are at the crux of human imagination and communication. Figurative language such as a simile go beyond plain expressions to give readers new insights and inspi ...

Utility is in the Eye of the User: A Critique of NLP Leaderboards

By arXiv.org - 2020-10-01

Benchmarks such as GLUE have helped drive advances in NLP by incentivizing the creation of more accurate models. While this leaderboard paradigm has been remarkably successful, a historical focus on p ...

How to put machine learning models into production

By Stack Overflow Blog - 2020-10-12

The goal of building a machine learning model is to solve a problem, and a machine learning model can only do so when it is in production and actively in use by consumers. As such, model deployment is ...

COMETA: A Corpus for Medical Entity Linking in the Social Media

By arXiv.org - 2020-10-08

Whilst there has been growing progress in Entity Linking (EL) for general language, existing datasets fail to address the complex nature of health terminology in layman's language. Meanwhile, there is ...

Something Every Data Scientist Should Know But Probably Doesn’t: The Bias-Variance Trade-off…

By Medium - 2021-01-04

A groundbreaking and relatively new discovery upends classical statistics with relevant implications for data science practitioners and…