Quotation Weber, Thomas, Mitlöhner, Johann, Neumaier, Sebastian, Polleres, Axel. 2020. ODArchive - Creating an archive for structured data from Open Data Portals. In LNCS, Hrsg. eff Z. Pan, Valentina Tamma, Claudia d’Amato, Krzysztof Janowicz, Bo Fu, Axel Polleres, Oshani Seneviratne, Lalana Kagal, 311-327. Virtual Conference (Athens, Greece): Springer.


RIS


BibTeX

Abstract

We present ODArchive, a large corpus of structured data collected from over 260 Open Data portals worldwide, alongside with curated, integrated metadata. Furthermore we enrich the harvested datasets by heuristic annotations using the type hierarchies in existing Knowledge Graphs. We both (i) present the underlying distributed architecture to scale up regular harvesting and monitoring changes on these portals, and (ii) make the corpus available via different APIs. Moreover, we (iii) analys the characteristics of tabular data within the corpus. Our APIs can be used to regularly run such analyses or to reproduce experiments from the literature that have worked on static, not publicly available corpora.

Tags

Press 'enter' for creating the tag

Publication's profile

Status of publication Published
Affiliation WU
Type of publication Contribution to conference proceedings
Language English
Title ODArchive - Creating an archive for structured data from Open Data Portals
Title of whole publication LNCS
Editor eff Z. Pan, Valentina Tamma, Claudia d’Amato, Krzysztof Janowicz, Bo Fu, Axel Polleres, Oshani Seneviratne, Lalana Kagal
Page from 311
Page to 327
Location Virtual Conference (Athens, Greece)
Publisher Springer
Year 2020
URL http://polleres.net/publications/webe-etal-2020ISWC.pdf
Open Access N

Associations

People
Weber, Thomas (Former researcher)
Mitlöhner, Johann (Details)
Neumaier, Sebastian (Details)
Polleres, Axel (Details)
Organization
Institute for Data, Process and Knowledge Management (AE Polleres) (Details)
Research Institute for Computational Methods FI (Details)
Google Scholar: Search