Fine-Tune Wav2Vec2 for English ASR in Hugging Face with 🤗 Transformers

By huggingface - 2021-03-12

Description

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Summary

  • Fine-Tune Wav2Vec2 for English ASR with 🤗 Transformers Wav2Vec2 is a pretrained model for Automatic Speech Recognition (ASR) and was released in September 2020 by Alexei Baevski, Michael Auli, and Alex Conneau.
  • The sampling rate hereby plays an important role in that it defines how many data points of the speech signal are measured per second.
  • Without going into too many details, in contrast to the common data collators, this data collator treats the input_values and labels differently and thus applies to separate padding functions on them (again making use of Wav2Vec2's context manager).
  • obj:`int`, `optional`):

 

Topics

  1. NLP (0.28)
  2. Backend (0.18)
  3. Database (0.09)

Similar Articles

Creating the Whole Machine Learning Pipeline with PyCaret

By Medium - 2020-12-03

This tutorial covers the entire ML process, from data ingestion, pre-processing, model training, hyper-parameter fitting, predicting and storing the model for later use. We will complete all these…

K-fold Cross Validation with PyTorch

By MachineCurve - 2021-02-02

Explanations and code examples showing you how to use K-fold Cross Validation for Machine Learning model evaluation/testing with PyTorch.

30 Most Asked Machine Learning Questions Answered

By Medium - 2021-03-18

Machine Learning is the path to a better and advanced future. A Machine Learning Developer is the most demanding job in 2021 and it is going to increase by 20–30% in the upcoming 3–5 years. Machine…