9 Distance Measures in Data Science

By Medium - 2021-02-01

Description

Exploring the advantages and pitfalls of 9 common distance measures used in Machine Learning applications.

Summary

  • Daily Read 9 Distance Measures in Data Science The advantages and pitfalls of common distance measures Many algorithms, whether supervised or unsupervised, make use of distance measures.
  • Moreover, you can also use Hamming distance to measure the distance between categorical variables.
  • In practice, it is the total number of similar entities between sets divided by the total number of entities.
  • Intuition is important in distance measures as it allows for better usage of the metric without v The Jaccard index (or Intersection over Union) is a metric used to calculate the similarity and diversity of sample sets.

 

Topics

  1. Backend (0.25)
  2. NLP (0.17)
  3. Database (0.14)

Similar Articles

Data Transformation: Standardization vs Normalization

By KDnuggets - 2021-03-14

Increasing accuracy in your models is often obtained through the first steps of data transformations. This guide explains the difference between the key feature scaling methods of standardization and ...

HDBSCAN Clustering with Neo4j

By Medium - 2021-01-15

I recently came across the article “How HDBSCAN works” by Leland McInnes, and I was struck by the informative, accessible way he explained…

Use NEO4J To Find The Shortest Path

By Khalid Abuhakmeh’s Blog - 2020-04-28

Use the graph database, NEO4J, to find the shortest path between any two nodes. Code samples included.