Ristad, ES; Yianilos, PN; Inc, M.T.; Princeton, NJ Learning string-edit distance |
1998 |
498 |
Oct06 |
Monge, A.; Elkan, C. The field matching problem: Algorithms and applications |
1996 |
443 |
Oct06 |
Cohen, WW Integration of heterogeneous databases without common domains using queries based on textual similarity |
1998 |
438 |
Sep06 |
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Motwani, R. Robust and efficient fuzzy match for online data cleaning |
2003 |
378 |
Sep06 |
Monge, A.E.; Elkan, C. An efficient domain-independent algorithm for detecting approximately duplicate database records |
1997 |
364 |
Oct06 |
Cohen, William; Richman, Jacob Learning to match and cluster large high-dimensional data sets for data integration |
2002 |
274 |
Oct06 |
Hjaltason, G.R.; Samet, H. Incremental distance join algorithms for spatial databases |
1998 |
250 |
Oct06 |
Bhattacharya, I.; Getoor, L.; Collective Entity Resolution in Relational Data |
2007 |
238 |
Apr07 |
Bitton, D.; DeWitt, D.J. Duplicate record elimination in large data files |
1983 |
208 |
Oct06 |
Cohen, W.W. Data integration using similarity joins and a word-based information representation language |
2000 |
195 |
Oct06 |
Bhattacharya, I.; Getoor, L.; A Latent Dirichlet Model for Unsupervised Entity Resolution |
2006 |
144 |
Apr07 |
Kalashnikov, D.V.; Mehrotra, S.; Chen, Z. Exploiting relationships for domain-independent data cleaning |
2005 |
111 |
Oct06 |