Relational clustering for multi-type entity resolution

Authors: 
Bhattacharya, Indrajit; Getoor, Lise
Author: 
Bhattacharya, I
Getoor, L
Year: 
2005
Venue: 
Conference on Knowledge Discovery in Data
URL: 
http://doi.acm.org/10.1145/1090193.1090195
Citations: 
49
Citations range: 
10 - 49

In many applications, there are a variety of ways of referring to the same underlying entity. Given a collection of references to entities, we would like to determine the set of true underlying entities and map the references to these entities. The references may be to entities of different types and more than one type of entity may need to be resolved at the same time. We propose similarity measures for clustering references taking into account the different relations that are observed among the typed references. We pose typed entity resolution in relational data as a clustering problem and present experimental results on real data showing improvements over attribute-based models when relations are leveraged.