Fellegi, I.P.; Sunter, A.B. A Theory for Record Linkage |
1969 |
1444 |
Oct06 |
Navarro, G A guided tour to approximate string matching |
2001 |
1369 |
May07 |
Elmagarmid, Ahmed; Ipeirotis, Panagiotis; Verykios, Vassilios Duplicate Record Detection: A Survey |
2007 |
785 |
Oct06 |
Hernandez, M.A.; Stolfo, S.J. The merge/purge problem for large databases |
1995 |
751 |
Sep06 |
Winkler, W.E. The state of record linkage and current research problems |
1999 |
634 |
Oct06 |
Hernandez, MA; Stolfo, S. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem |
1998 |
604 |
Sep06 |
McCallum, A; Nigam, K; Ungar, LH Efficient clustering of high-dimensional data sets with application to reference matching |
2000 |
550 |
Apr07 |
Ristad, ES; Yianilos, PN; Inc, M.T.; Princeton, NJ Learning string-edit distance |
1998 |
498 |
Oct06 |
Monge, A.; Elkan, C. The field matching problem: Algorithms and applications |
1996 |
443 |
Oct06 |
Cohen, WW Integration of heterogeneous databases without common domains using queries based on textual similarity |
1998 |
438 |
Sep06 |
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Motwani, R. Robust and efficient fuzzy match for online data cleaning |
2003 |
378 |
Sep06 |
Monge, A.E.; Elkan, C. An efficient domain-independent algorithm for detecting approximately duplicate database records |
1997 |
364 |
Oct06 |
Cohen, William; Richman, Jacob Learning to match and cluster large high-dimensional data sets for data integration |
2002 |
274 |
Oct06 |
Hjaltason, G.R.; Samet, H. Incremental distance join algorithms for spatial databases |
1998 |
250 |
Oct06 |
Bhattacharya, I.; Getoor, L.; Collective Entity Resolution in Relational Data |
2007 |
238 |
Apr07 |
Tejada, S; Knoblock, CA; Minton, S Learning object identification rules for information integration |
2001 |
219 |
May08 |
Bitton, D.; DeWitt, D.J. Duplicate record elimination in large data files |
1983 |
208 |
Oct06 |
Winkler, W.E. Advanced methods for record linkage |
1994 |
196 |
Oct06 |
Cohen, W.W. Data integration using similarity joins and a word-based information representation language |
2000 |
195 |
Oct06 |
Bhattacharya, I.; Getoor, L.; A Latent Dirichlet Model for Unsupervised Entity Resolution |
2006 |
144 |
Apr07 |
Kalashnikov, D.V.; Mehrotra, S.; Chen, Z. Exploiting relationships for domain-independent data cleaning |
2005 |
111 |
Oct06 |
Kalashnikov, DV; Mehrotra, S Domain-independent data cleaning via analysis of entity-relationship graph |
2006 |
98 |
Apr07 |
Monge, AE Matching Algorithms within a Duplicate Detection System |
2000 |
91 |
Apr07 |
Low, WL; Lee, ML; Ling, TW A knowledge-based approach for duplicate elimination in data cleaning |
2001 |
88 |
Apr07 |
Churches, T; Christen, P Some methods for blindfolded record linkage |
2004 |
79 |
Mar10 |
Xi, W; Fox, EA; Fan, W; Zhang, B; Chen, Z; Yan, J; J Yan, D SimFusion: measuring similarity using unified relationship matrix |
2005 |
75 |
Oct06 |
Bilenko, Mikhail; Kamath, Beena; Mooney, Raymond J. Adaptive Blocking: Learning to Scale Up Record Linkage |
2006 |
73 |
Feb08 |
Doan, AnHai; Lu, Ying; Lee, Yoonkyong; Han, Jiawei Object Matching for Information Integration: A Profiler-Based Approach |
2003 |
68 |
Sep06 |
Chen, Z; Kalashnikov, DV; Mehrotra, S Exploiting relationships for object consolidation |
2005 |
68 |
Sep06 |
Verykios, V. S.; Moustakides, G. V.; Elfeky, M. G. A Bayesian decision model for cost optimal record matching |
2003 |
66 |
Oct06 |
Michelson, Matthew; Knoblock, Craig A. Learning Blocking Schemes for Record Linkage |
2006 |
58 |
Mar08 |
Bilenko, Mikhail; Basu, Sugato; Sahami, Mehran Adaptive Product Normalization: Using Online Learning for Record Linkage in Comparison Shopping |
2005 |
51 |
Feb08 |
Bhattacharya, Indrajit; Getoor, Lise Relational clustering for multi-type entity resolution |
2005 |
49 |
Oct06 |
Saïs, Fatiha; Pernelle, Nathalie; Rousset, Marie-Christine Combining a Logical and a Numerical Method for Data Reconciliation |
2009 |
44 |
Jan10 |
Chaudhuri, Surajit;Chen, Bee-Chung;Ganti, Venkatesh;Kaushik, Raghav Example-driven Design of Efficient Record Matching Queries |
2007 |
44 |
Mar08 |
Lee, M.L.; Hsu, W.; Kothari, V. Cleaning the spurious links in data |
2004 |
40 |
Sep06 |
Christen, Peter Febrl - A freely available record linkage system with a graphical user interface |
2008 |
40 |
May08 |
Schallehn, E; Sattler, KU; Saake, G Efficient similarity-based operations for data integration |
2004 |
39 |
Apr07 |
Bhattacharya, I; Getoor, L; Licamele, L Query-time entity resolution |
2006 |
39 |
Sep06 |
Puhlmann, Sven; Weis, Melanie; Naumann, Felix XML Duplicate Detection Using Sorted Neighborhoods |
2006 |
36 |
Apr07 |
Aizawa, A; Oyama, K A Fast Linkage Detection Scheme for Multi-Source Information Integration |
2005 |
35 |
Nov07 |
Benjelloun, O.; Garcia-Molina, H.; Gong, H.; Kawai, H; Larson, T.E.; Menestrina, D.; Thavisomboon, S. D-Swoosh: A Family of Algorithms for Generic, Distributed Entity Resolution |
2007 |
32 |
Aug07 |
Cudre-Mauroux, P; Jost, M; Meer, H De idMesh: graph-based disambiguation of linked data |
2009 |
31 |
Mar10 |
Shen, W.; DeRose, P.; Vu, L.; Doan, A.; Ramakrishnan, R. Source-aware entity matching: A compositional approach |
2007 |
29 |
Apr07 |
Quass, D.; Starkey, P. Record linkage for genealogical databases |
2003 |
24 |
Sep06 |
Michalowski, M; Thakkar, S; Knoblock, CA Exploiting secondary sources for automatic object consolidation |
2003 |
23 |
Apr07 |
Arasu, A; Götz, M; Kaushik, R. On active learning of record matching packages |
2010 |
22 |
Apr11 |
Ganesh, M.; Srivastava, J.; Richardson, T. Mining entity-identification rules for database integration |
1996 |
21 |
Sep06 |
Kalashnikov, DV; Mehrotra, S A probabilistic model for entity disambiguation using relationships |
2005 |
16 |
Sep06 |
Su, W; Wang, J; Lochovsky, F.H. Record Matching over Query Results from Multiple Web Databases |
2010 |
13 |
Apr11 |