Please use this identifier to cite or link to this item: http://hdl.handle.net/1893/27991
Full metadata record
DC FieldValueLanguage
dc.contributor.authorGottron, Thomasen_UK
dc.contributor.authorKnauf, Malteen_UK
dc.contributor.authorScherp, Ansgaren_UK
dc.date.accessioned2018-10-18T00:03:33Z-
dc.date.available2018-10-18T00:03:33Z-
dc.date.issued2015-12en_UK
dc.identifier.urihttp://hdl.handle.net/1893/27991-
dc.description.abstractThe Linked Open Data (LOD) graph represents a web-scale distributed knowledge graph interlinking information about entities across various domains. A core concept is the lack of pre-defined schema which actually allows for flexibly modelling data from all kinds of domains. However, Linked Data does exhibit schema information in a twofold way: by explicitly attaching RDF types to the entities and implicitly by using domain-specific properties to describe the entities. In this paper, we present and apply different techniques for investigating the schematic information encoded in the LOD graph at different levels of granularity. We investigate different information theoretic properties of so-called Unique Subject URIs (USUs) and measure the correlation between the properties and types that can be observed for USUs on a large-scale semantic graph data set. Our analysis provides insights into the information encoded in the different schema characteristics. Two major findings are that implicit schema information is far more discriminative and that applications involving schema information based on either types or properties alone will only capture between 63.5 and 88.1 % of the schema information contained in the data. As the level of discrimination depends on how data providers model and publish their data, we have conducted in a second step an investigation based on pay-level domains (PLDs) as well as the semantic level of vocabularies. Overall, we observe that most data providers combine up to 10 vocabularies to model their data and that every fifth PLD uses a highly structured schema.en_UK
dc.language.isoenen_UK
dc.publisherBMCen_UK
dc.relationGottron T, Knauf M & Scherp A (2015) Analysis of schema structures in the Linked Open Data graph based on unique subject URIs, pay-level domains, and vocabulary usage. Distributed and Parallel Databases, 33 (4), pp. 515-553. https://doi.org/10.1007/s10619-014-7143-0en_UK
dc.rightsThe publisher does not allow this work to be made publicly available in this Repository. Please use the Request a Copy feature at the foot of the Repository record to request a copy directly from the author. You can only request a copy if you wish to use this work for your own research or private study.en_UK
dc.rights.urihttp://www.rioxx.net/licenses/under-embargo-all-rights-reserveden_UK
dc.subjectLinked Open Dataen_UK
dc.subjectSchema analysisen_UK
dc.subjectInformationen_UK
dc.subjectEntropyen_UK
dc.titleAnalysis of schema structures in the Linked Open Data graph based on unique subject URIs, pay-level domains, and vocabulary usageen_UK
dc.typeJournal Articleen_UK
dc.rights.embargodate2999-12-31en_UK
dc.rights.embargoreason[Gottron et al 2015.pdf] The publisher does not allow this work to be made publicly available in this Repository therefore there is an embargo on the full text of the work.en_UK
dc.identifier.doi10.1007/s10619-014-7143-0en_UK
dc.citation.jtitleDistributed and Parallel Databasesen_UK
dc.citation.issn1573-7578en_UK
dc.citation.issn0926-8782en_UK
dc.citation.volume33en_UK
dc.citation.issue4en_UK
dc.citation.spage515en_UK
dc.citation.epage553en_UK
dc.citation.publicationstatusPublisheden_UK
dc.citation.peerreviewedRefereeden_UK
dc.type.statusVoR - Version of Recorden_UK
dc.contributor.funderEuropean Commissionen_UK
dc.author.emailansgar.scherp@stir.ac.uken_UK
dc.citation.date11/02/2014en_UK
dc.contributor.affiliationUniversity of Koblenz-Landauen_UK
dc.contributor.affiliationUniversity of Koblenz-Landauen_UK
dc.contributor.affiliationUniversity of Kielen_UK
dc.identifier.isiWOS:000360553700003en_UK
dc.identifier.scopusid2-s2.0-84940718741en_UK
dc.identifier.wtid1007444en_UK
dc.contributor.orcid0000-0002-2653-9245en_UK
dcterms.dateAccepted2014-02-11en_UK
dc.date.filedepositdate2018-10-05en_UK
rioxxterms.apcnot requireden_UK
rioxxterms.typeJournal Article/Reviewen_UK
rioxxterms.versionVoRen_UK
local.rioxx.authorGottron, Thomas|en_UK
local.rioxx.authorKnauf, Malte|en_UK
local.rioxx.authorScherp, Ansgar|0000-0002-2653-9245en_UK
local.rioxx.projectProject ID unknown|European Commission (Horizon 2020)|en_UK
local.rioxx.freetoreaddate2264-01-12en_UK
local.rioxx.licencehttp://www.rioxx.net/licenses/under-embargo-all-rights-reserved||en_UK
local.rioxx.filenameGottron et al 2015.pdfen_UK
local.rioxx.filecount1en_UK
local.rioxx.source0926-8782en_UK
Appears in Collections:Computing Science and Mathematics Journal Articles

Files in This Item:
File Description SizeFormat 
Gottron et al 2015.pdfFulltext - Published Version1.15 MBAdobe PDFUnder Permanent Embargo    Request a copy


This item is protected by original copyright



Items in the Repository are protected by copyright, with all rights reserved, unless otherwise indicated.

The metadata of the records in the Repository are available under the CC0 public domain dedication: No Rights Reserved https://creativecommons.org/publicdomain/zero/1.0/

If you believe that any material held in STORRE infringes copyright, please contact library@stir.ac.uk providing details and we will remove the Work from public display in STORRE and investigate your claim.