Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision

By arXiv.org - 2020-10-15

Description

Humans learn language by listening, speaking, writing, reading, and also, via interaction with the multimodal real world. Existing language pre-training frameworks show the effectiveness of text-only ...

Summary

Computer Science > Computation and Language Abstract: Existing language pre-training frameworks show the effectiveness of text-only self-supervision while we explore the idea of a visually-supervised language model in this paper.
We find that the main reason hindering this exploration is the large divergence in magnitude and distributions between the visually-grounded language datasets and pure-language corpora.

Topics

NLP (0.32)
UX (0.08)
Backend (0.08)

Similar Articles

Safe Reinforcement Learning with Natural Language Constraints

By arXiv.org - 2020-10-13

In this paper, we tackle the problem of learning control policies for tasks when provided with constraints in natural language. In contrast to instruction following, language here is used not to speci ...

Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models

By Velasticity Newsletter - 2021-02-05

On October 14th, 2020, researchers from OpenAI, the Stanford Institute for Human-Centered Artificial Intelligence, and other universities convened to discuss open research questions surrounding GPT-3,

Code and Named Entity Recognition in StackOverflow

By arXiv.org - 2020-10-14

There is an increasing interest in studying natural language and computer code together, as large corpora of programming texts become readily available on the Internet. For example, StackOverflow curr ...

Translating lost languages using machine learning

By MIT News | Massachusetts Institute of Technology - 2020-10-21

MIT researchers have created a machine learning system that aims to help linguists decipher lost languages.

Robo-writers: the rise and risks of language-generating AI

By nature - 2021-03-03

A remarkable AI can write like humans — but with no understanding of what it’s saying.

COMETA: A Corpus for Medical Entity Linking in the Social Media

By arXiv.org - 2020-10-08

Whilst there has been growing progress in Entity Linking (EL) for general language, existing datasets fail to address the complex nature of health terminology in layman's language. Meanwhile, there is ...