Hernández-Illera, Antonio, A. Martínez-Prieto, Miguel, Fernandez Garcia, Javier David. Forthcoming. Serializing RDF in Compressed Space. In Data Compression Conference (DCC), Hrsg.
BibTeX
Abstract
The amount of generated RDF data has grown impressively over the last decade, promoting compression as an essential tool for storage and exchange. RDF compression techniques leverage syntactic and semantic redundancies, but structural repetitions are not always addressed effectively. This paper first shows two schema-based sources of redundancy underlying to the schema-relaxed nature of RDF. Then, we revisit the W3C HDT binary format to further compact its graph structure encoding. Our HDT++ approach reduces the original HDT Triples requirements up to 2 times for more structured datasets, and reports significant improvements even for highly semi-structured datasets like DBpedia. In general, HDT++ competes with the current state of the art for structural RDF compression, leading the comparison for three of the four analyzed datasets.
Tags
Press 'enter' for creating the tagPublication's profile
Affiliation | WU |
---|---|
Type of publication | Contribution to conference proceedings |
Language | English |
Title | Serializing RDF in Compressed Space |
Title of whole publication | Data Compression Conference (DCC) |
Year | 2015 |
URL | http://dataweb.infor.uva.es/wp-content/uploads/2015/01/dcc15.pdf |
Associations
- Projects
- Querying Archives of Dynamic Linked Open Data
- People
- Fernandez Garcia, Javier David (Former researcher)
- External
- A. Martínez-Prieto, Miguel
- Hernández-Illera, Antonio
- Organization
- Institute for Data, Process and Knowledge Management (AE Polleres) (Details)
- Research areas (ÖSTAT Classification 'Statistik Austria')
- 1108 Informatics (Details)
- 1109 Information and data processing (Details)