Description
When I first started studying data science, one of the areas I was most excited to learn was natural language processing. “Unsupervised machine learning” certainly has a mystical ring to it, and…
Summary
- A framework for topic modeling with tweets in Python When I first started studying data science, one of the areas I was most excited to learn was natural language processing.
- “ I also found that making a basic dictionary of word frequencies was a quick way to get a feel for the prevalence of words and hashtags in the corpus which informed my text preprocessing pipeline.
- On the left, we can see numbered bubbles of varying sizes placed on a grid.
- As an example, we can see a lot of overlap in topics 7, 8, and 9, while topic 12 sits alone in the bottom left quadrant (topic 12 represented a bunch of German tweets — makes sense!
- ).