Description
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
Summary
- Fine-Tune Wav2Vec2 for English ASR with 🤗 Transformers Wav2Vec2 is a pretrained model for Automatic Speech Recognition (ASR) and was released in September 2020 by Alexei Baevski, Michael Auli, and Alex Conneau.
- The sampling rate hereby plays an important role in that it defines how many data points of the speech signal are measured per second.
- Without going into too many details, in contrast to the common data collators, this data collator treats the input_values and labels differently and thus applies to separate padding functions on them (again making use of Wav2Vec2's context manager).
- obj:`int`, `optional`):