facebook/wav2vec2-base-960h · Hugging Face

By huggingface - 2021-02-08

Description

We’re on a journey to solve and democratize artificial intelligence through natural language.

Summary

We show for the first time that learning powerful representations from speech audio alone followed by fine-tuning on transcribed speech can outperform the best semi-supervised methods while being conceptually simpler.
wav2vec 2.0 masks the speech input in the latent space and solves a contrastive task defined over a quantization of the latent representations which are jointly learned.
Experiments using all labeled data of Librispeech achieve 1.8/3.3 WER on the clean/other test sets.
This demonstrates the feasibility of speech recognition with limited amounts of labeled data.

Topics

Backend (0.3)
Database (0.15)
Machine_Learning (0.14)

Similar Articles

4 Limitations of Google Data Studio That Advanced Users Should Watch Out For

By Medium - 2021-03-22

Google Data Studio is a tool I have been using more and more in the past few months. With the high usage, I have come to notice its advantages over other tools, its capabilities, but also its’…

Drowning in Data? How To Ensure Your Data Strategy Isn't Hurting Your Brand?

By CMSWire.com - 2021-03-16

Not all data is valuable or actionable and discerning which is which can be hard. Learn to craft a successful data strategy that can help a brand learn to swim.

Data Engineering and Data Science collaboration processes

By Medium - 2020-02-29

As a Data Engineer, I had the opportunity to experience one Data Engineers/Data Scientist cooperation process and quickly saw the downfalls of it. I, therefore, became very interested in how we can…

The Growing Importance of Metadata Management Systems

By Gradient Flow - 2021-02-02

Metadata will be the foundation for data governance solutions, data catalogs, and other enterprise data systems. By Assaf Araki and Ben Lorica. Introduction As companies embrace digital technologie…

Learning Data Science From the Perspective of a Proficient Developer

By Medium - 2020-12-08

As you know, data science, and more specifically machine learning, is very much en vogue now, so guess what? I decided to enroll in a MOOC to become fluent in data science. But when you start with a…

15 Essential Steps To Build Reliable Data Pipelines

By Medium - 2020-12-01

If I learned anything from working as a data engineer, it is that practically any data pipeline fails at some point. Broken connection, broken dependencies, data arriving too late, or some external…