bpa.arizona.edu

Entity identification for heterogeneous database integration: a multiple classifier system approach and empirical evaluation

Authors: 
Zhao, Huimin; Ram, Sudha
Year: 
2005
Venue: 
Information Systems

Entity identification, i.e., detecting semantically corresponding records from heterogeneous data sources, is a critical step in integrating the data sources. The objective of this research is to develop and evaluate a novel multiple classifier system approach that improves entity identification accuracy. We apply various classification techniques drawn from statistical pattern recognition, machine learning, and artificial neural networks to determine whether two records from different data sources represent the same real-world entity.

Syndicate content