"Everyone wants to do the model work, not the data work": Data Cascades in High-Stakes AI

By Google Research - 2021-02-05

Description

#CTO

Summary

  • "Everyone wants to do the model work, not the data work": Data quality carries an elevated significance in high-stakes AI due to its heightened downstream impact, impacting predictions like cancer detection, wildlife poaching, and loan allocations.
  • Paradoxically, data is the most under-valued and de-glamorised aspect of AI.
  • We define, identify, and present empirical evidence on Data Cascades---compounding events causing negative, downstream effects from data issues---triggered by conventional AI/ML practices that undervalue data quality.

 

Topics

  1. Backend (0.29)
  2. Database (0.16)
  3. Security (0.11)

Similar Articles

The Growing Importance of Metadata Management Systems

By Gradient Flow - 2021-02-02

Metadata will be the foundation for data governance solutions, data catalogs, and other enterprise data systems. By Assaf Araki and Ben Lorica. Introduction As companies embrace digital technologie…