Do your Resources Sound Similar? On the Impact of Using Phonetic Similarity in Link Discovery

A.F.A. Ahmed, M. Sherif, A.-C. Ngonga Ngomo, in: K-CAP 2019: Knowledge Capture Conference, 2019.

Download
No fulltext has been uploaded.
Conference Paper | English
Abstract
An increasing number of heterogeneous datasets abiding by the Linked Data paradigm is published everyday. Discovering links between these datasets is thus central to achieving the vision behind the Data Web. Declarative Link Discovery (LD) frameworks rely on complex Link Specification (LS) to express the conditions under which two resources should be linked. Complex LS combine similarity measures with thresholds to determine whether a given predicate holds between two resources. State of the art LD frameworks rely mostly on string-based similarity measures such as Levenshtein and Jaccard. However, string-based similarity measures often fail to catch the similarity of resources with phonetically similar property values when these property values are represented using different string representation (e.g., names and street labels). In this paper, we evaluate the impact of using phonetics-based similarities in the process of LD. Moreover, we evaluate the impact of phonetic-based similarity measures on a state-of-the-art machine learning approach used to generate LS. Our experiments suggest that the combination of string-based and phonetic-based measures can improve the Fmeasures achieved by LD frameworks on most datasets.
Publishing Year
Proceedings Title
K-CAP 2019: Knowledge Capture Conference
LibreCat-ID

Cite this

Ahmed AFA, Sherif M, Ngonga Ngomo A-C. Do your Resources Sound Similar? On the Impact of Using Phonetic Similarity in Link Discovery. In: K-CAP 2019: Knowledge Capture Conference. ; 2019.
Ahmed, A. F. A., Sherif, M., & Ngonga Ngomo, A.-C. (2019). Do your Resources Sound Similar? On the Impact of Using Phonetic Similarity in Link Discovery. K-CAP 2019: Knowledge Capture Conference.
@inproceedings{Ahmed_Sherif_Ngonga Ngomo_2019, title={Do your Resources Sound Similar? On the Impact of Using Phonetic Similarity in Link Discovery}, booktitle={K-CAP 2019: Knowledge Capture Conference}, author={Ahmed, Abdullah Fathi Ahmed and Sherif, Mohamed and Ngonga Ngomo, Axel-Cyrille}, year={2019} }
Ahmed, Abdullah Fathi Ahmed, Mohamed Sherif, and Axel-Cyrille Ngonga Ngomo. “Do Your Resources Sound Similar? On the Impact of Using Phonetic Similarity in Link Discovery.” In K-CAP 2019: Knowledge Capture Conference, 2019.
A. F. A. Ahmed, M. Sherif, and A.-C. Ngonga Ngomo, “Do your Resources Sound Similar? On the Impact of Using Phonetic Similarity in Link Discovery,” 2019.
Ahmed, Abdullah Fathi Ahmed, et al. “Do Your Resources Sound Similar? On the Impact of Using Phonetic Similarity in Link Discovery.” K-CAP 2019: Knowledge Capture Conference, 2019.

Export

Marked Publications

Open Data LibreCat

Search this title in

Google Scholar