Linking Datasets Using Semantic Textual Similarity

Linked data has been widely recognized as an important paradigm for representing data and one of the most important aspects of supporting its use is discovery of links between datasets. For many datasets, there is a significant amount of textual information in the form of labels, descriptions and documentation about the elements of the dataset and the fundament of a precise linking is in the application of semantic textual similarity to link these datasets. However, most linking tools so far rely on only simple string similarity metrics such as Jaccard scores. We present an evaluation of some metrics that have performed well in recent semantic textual similarity evaluations and apply these to linking existing datasets.

eISSN:: 1314-4081
Language:: English

Publication timeframe:: 4 times per year
Journal Subjects:: Computer Sciences, Information Technology

Journal RSS Feed

Linking Datasets Using Semantic Textual Similarity

Published Online: Mar 30, 2018

Page range: 109 - 123

Received: Sep 29, 2017

Accepted: Nov 30, 2017

DOI: https://doi.org/10.2478/cait-2018-0010

Keywords
Linked data, link discovery, ontology alignment, semantic textual similarity, structural similarity, NLP architectures

© 2018 John P. McCrae et al., published by De Gruyter Open

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Linking Datasets Using Semantic Textual Similarity

Published Online: Mar 30, 2018

Page range: 109 - 123

Received: Sep 29, 2017

Accepted: Nov 30, 2017

DOI: https://doi.org/10.2478/cait-2018-0010

KeywordsLinked data, link discovery, ontology alignment, semantic textual similarity, structural similarity, NLP architectures

© 2018 John P. McCrae et al., published by De Gruyter Open

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Keywords
Linked data, link discovery, ontology alignment, semantic textual similarity, structural similarity, NLP architectures