Collecting, transforming and cleaning JSTOR metadata in Python

By Medium - 2021-03-16

Description

JSTOR database is one of the leading sources of research articles in more than 50 disciplines of science. In Data for Research section, researchers can access datasets for use in research and…

Summary

  • A simple guide into parsing meta-data from JSTOR data for research database using the ElementTree XML.
  • JSTOR database is one of the leading sources of research articles in more than 50 disciplines of science.
  • To make accessing larger volumes of data for data scientists and researchers easier, in this article, I show the python code for parsing the XML outputs, explain the process of collecting the data from JSTOR data for research database, and show a nice application of this type of data.
  • It follows the hierarchical structures of XML files and ignores Book reviews and notices.

 

Topics

  1. Backend (0.4)
  2. Database (0.23)
  3. Machine_Learning (0.16)

Similar Articles

The Growing Importance of Metadata Management Systems

By Gradient Flow - 2021-02-02

Metadata will be the foundation for data governance solutions, data catalogs, and other enterprise data systems. By Assaf Araki and Ben Lorica. Introduction As companies embrace digital technologie…