Adaptive Semiparametric Language Models

By Deepmind - 2021-02-24

Description

We present a language model that combines a large parametric neural network (i.e., a transformer) with a non-parametric episodic memory component in an integrated architecture. Our model uses extended ...

Summary

Abstract We present a language model that combines a large parametric neural network (i.e., a transformer) with a non-parametric episodic memory component in an integrated architecture.
We design a gating function to adaptively combine multiple information sources to make a prediction.
This mechanism allows the model to use either local context, short-term memory, or long-term memory (or any combination of them) on an ad hoc basis depending on the context.

Topics

NLP (0.31)
Machine_Learning (0.12)
UX (0.06)

Similar Articles

UKPLab/EasyNMT

By GitHub - 2021-01-27

Easy to use, state-of-the-art Neural Machine Translation for 100+ languages - UKPLab/EasyNMT

FastFormers: 233x Faster Transformers inference on CPU

By Medium - 2020-11-04

Since the birth of BERT followed by that of Transformers have dominated NLP in nearly every language-related tasks whether it is Question-Answering, Sentiment Analysis, Text classification or Text…

The Language Interpretability Tool (LIT): Interactive Exploration and Analysis of NLP Models

By Google AI Blog - 2020-11-20

Posted by James Wexler, Software Developer and Ian Tenney, Software Engineer, Google Research As natural language processing (NLP) models...

New AI Predicts Movie Ratings Before Filming

By Psychology Today - 2021-02-09

MovieBERT, a new AI machine learning tool, can rate movie content in seconds—in advance of any film production.

Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision

By arXiv.org - 2020-10-15

Humans learn language by listening, speaking, writing, reading, and also, via interaction with the multimodal real world. Existing language pre-training frameworks show the effectiveness of text-only ...

Google trained a trillion-parameter AI language model

By VentureBeat - 2021-01-12

Researchers at Google claim to have trained a natural language model containing over a trillion parameters.