High-Performance Training by Exploiting Hot-Embeddings in Recommendation Systems

By DeepAI - 2021-03-01

Description

03/01/21 - Recommendation models are commonly used learning models that suggest relevant items to a user for e-commerce and online advertisem...

Summary

  • Recommendation models are commonly used learning models that suggest relevant items to a user for e-commerce and online advertisement-based applications.
  • Current recommendation models include deep-learning-based (DLRM) and time-based sequence (TBSM) models.
  • This paper tries to leverage skewed embedded table accesses to efficiently use the GPU resources during training.
  • This framework efficiently estimates and varies the size of the hot portions of the embedding tables within GPUs and reallocates the rest of the embeddings on the CPU.

 

Topics

  1. Machine_Learning (0.15)
  2. NLP (0.13)
  3. UX (0.07)

Similar Articles

FastFormers: 233x Faster Transformers inference on CPU

By Medium - 2020-11-04

Since the birth of BERT followed by that of Transformers have dominated NLP in nearly every language-related tasks whether it is Question-Answering, Sentiment Analysis, Text classification or Text…