From Linked Open Data to Collections as Data: A Reproducible Framework Using Federated Queries

作者
Meltem Dişli, Giulia Osti, Gustavo Candela, and Richard Zijdeman
出版日期
2025-12-15
內容

Libraries are adopting Linked Open Data (LOD) and Collections as Data (CaD) approaches to present their collections as datasets for direct computational use. However, research focused on federated and reproducible access to these datasets is limited. This work aims to develop a federated and reproducible approach for extracting CaD from LOD repositories. In this context, data extracted from the single authors Jorge Juan y Santacilia and María de Zayas y Sotomayor, as well as from multiple authors from the Spanish Golden Age movement (1492–1659), are used as examples. Federated and reproducible queries are conducted using the Wikidata SPARQL public endpoint and three institutional LOD repositories on Jupyter Notebooks. The data are exported in a format compatible with computational tools (e.g., CSV) by focusing on works of a single author or works from a specific movement. Additionally, the work allows for the visualization of the queries. The results of this work provide a valuable framework for both digital humanities researchers working on datasets and libraries aiming to present their collections as accessible data for computational analysis.

刊名
Information Technology and Libraries
卷期
Vol. 44 No.4
頁數
1-15
關鍵字
linked open data, collections as data, libraries, digital collections, reproducible framework, federated queries, cultural heritage
網址連結
發布日期:2026年01月23日 最後更新:2026年01月29日