paigecm / large-dataset-for-all-5-journals

Bibliographic data from 5 journals: Book History, ELH (English Literary History), Huntington Library Quarterly, Sixteenth Century Journal, and Studies in Bibliography.

Readme

A dataset created for the CLIR microgrant-sponsored project Identifying Early Modern Books (IdEMB, http://www.idemb.org).

This data was obtained from JSTOR's Data for Research (DFR) in 2015, and minimally processed by Meaghan Brown, Jessica Otis, and Paige Morgan throughout 2016 during the period of the grant. It was converted from XML to CSVs, and then converted to RDF triples using OpenRefine and the DERI RDF OpenRefine extension. We used Jeff Chiu's OpenRefine VIAF reconciliation service (http://refine.codefork.com/) to minimally enhance the data by making it easier to see common authors across all five journals.

This dataset includes several pre-written queries that can be customized/modified, even by users with little or no knowledge of SPARQL -- see the comments at the top of each query (designated with a #) for instructions. If the query gets messed up, just refresh the page to reset it -- there's no way for users to permanently alter the queries or break them for other users.

If you have requests for a particular query, please contact Paige Morgan at paige.c.morgan at gmail.

For more information on this project, see http://www.idemb.org, and watch for a forthcoming article in Archives Journal.

Homepage: http://www.idemb.org

Data: 326,522 statements

Size: 95.5 MB

License: None (All Rights Reserved)

Recent Activity