Reimagining the Archives as Data
Presentation Type
Presentation
Start Date
10-13-2021 11:00 AM
Abstract
“Collections as data” is the idea that researchers can access our archival holdings not only at the item level, but as datasets--to me mined, analyzed, or visualized using computational methods. This presentation introduces the concept of “collections as data” through the presenters’ work on dLOC as Data, a Mellon-funded initiative of the Digital Library of the Caribbean, which had the goal to enhance access to existing Caribbean newspaper collections by making texts available for bulk download to its users. The first part of the presentation will provide an overview of text analysis fundamentals for an archives audience. In the second half of the presentation we will introduce some examples of how these newspaper collections might be utilized using text and data analysis software such as AntConc and OpenRefine.
Organization Link
Florida International University
Streaming Media
Reimagining the Archives as Data
“Collections as data” is the idea that researchers can access our archival holdings not only at the item level, but as datasets--to me mined, analyzed, or visualized using computational methods. This presentation introduces the concept of “collections as data” through the presenters’ work on dLOC as Data, a Mellon-funded initiative of the Digital Library of the Caribbean, which had the goal to enhance access to existing Caribbean newspaper collections by making texts available for bulk download to its users. The first part of the presentation will provide an overview of text analysis fundamentals for an archives audience. In the second half of the presentation we will introduce some examples of how these newspaper collections might be utilized using text and data analysis software such as AntConc and OpenRefine.