NMF — A visual explainer and Python Implementation

By Medium - 2021-03-19

Description

Gain an intuition for the unsupervised learning algorithm that allows data scientists to extract topics from texts, photos, and more, and build those handy recommendation systems. NMF explanation is…

Summary

  • Gain an intuition for the unsupervised learning algorithm that allows data scientists to extract topics from texts, photos, and more, and build those handy recommendation systems.
  • NMF NMF stands for Latent Semantic Analysis with the ‘Non-negative Matrix-Factorization’ method used to decompose the document-term matrix into two smaller matrices — the document-topic matrix (U) and the topic-term matrix (W) — each populated with unnormalized probabilities.
  • The Art of Topic Modeling The output of NMF changes each time you run it, and the topics are not resolved — the data scientist must infer the topic from the highest word frequencies per topic, using the H matrix.
  • This is a generative model where you can use these smaller matrices to reconstruct the original text.

 

Topics

  1. NLP (0.36)
  2. Backend (0.11)
  3. Machine_Learning (0.08)

Similar Articles

Finding the Narrative with Natural Language Processing

By Medium - 2021-01-01

When I first started studying data science, one of the areas I was most excited to learn was natural language processing. “Unsupervised machine learning” certainly has a mystical ring to it, and…

What’s in a word?

By Medium - 2020-12-28

Why tf-idf sometimes fails to accurately capture word importance, and what we can use instead