Search: no dataset

Click a term to refine your current search.

Search: no dataset

Results

Title/Author	Year	Citations	added
Nelder, J.A.; Mead, R. A simplex method for function minimization	1965	16651	Sep06
Kohavi, R.; John, G.H. Wrappers for Feature Subset Selection	1997	4115	Sep06
Fellegi, I.P.; Sunter, A.B. A Theory for Record Linkage	1969	1444	Oct06
Navarro, G A guided tour to approximate string matching	2001	1369	May07
Elmagarmid, Ahmed; Ipeirotis, Panagiotis; Verykios, Vassilios Duplicate Record Detection: A Survey	2007	785	Oct06
Rahm, Erhard; Do, Hong Hai Data Cleaning: Problems and Current Approaches	2000	778	Aug06
Hernandez, M.A.; Stolfo, S.J. The merge/purge problem for large databases	1995	751	Sep06
Winkler, W.E. The state of record linkage and current research problems	1999	634	Oct06
Hernandez, MA; Stolfo, S. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem	1998	604	Sep06
McCallum, A; Nigam, K; Ungar, LH Efficient clustering of high-dimensional data sets with application to reference matching	2000	550	Apr07
Ristad, ES; Yianilos, PN; Inc, M.T.; Princeton, NJ Learning string-edit distance	1998	498	Oct06
Monge, A.; Elkan, C. The field matching problem: Algorithms and applications	1996	443	Oct06
Cohen, WW Integration of heterogeneous databases without common domains using queries based on textual similarity	1998	438	Sep06
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Motwani, R. Robust and efficient fuzzy match for online data cleaning	2003	378	Sep06
Monge, A.E.; Elkan, C. An efficient domain-independent algorithm for detecting approximately duplicate database records	1997	364	Oct06
Bilenko, M; Mooney, R; Cohen, W; P Ravikumar, S Adaptive name matching in information integration	2003	339	Nov07
Galhardas, H; Florescu, D; Shasha, D; Simon, E; Saita, C. Declarative data cleaning: Language, model, and algorithms	2001	323	Sep06
Cohen, William; Richman, Jacob Learning to match and cluster large high-dimensional data sets for data integration	2002	274	Oct06
Hjaltason, G.R.; Samet, H. Incremental distance join algorithms for spatial databases	1998	250	Oct06
Bhattacharya, I.; Getoor, L.; Collective Entity Resolution in Relational Data	2007	238	Apr07
Tejada, S Learning Object Identification Rules for Information Integration	2002	219	Oct06
Bitton, D.; DeWitt, D.J. Duplicate record elimination in large data files	1983	208	Oct06
Winkler, W.E. Advanced methods for record linkage	1994	196	Oct06
Cohen, W.W. Data integration using similarity joins and a word-based information representation language	2000	195	Oct06
Galhardas, H; Florescu, D; Shasha, D; Simon, E AJAX: an extensible data cleaning tool	2000	175	Sep06
Bhattacharya, I.; Getoor, L.; A Latent Dirichlet Model for Unsupervised Entity Resolution	2006	144	Apr07
Kalashnikov, D.V.; Mehrotra, S.; Chen, Z. Exploiting relationships for domain-independent data cleaning	2005	111	Oct06
Kalashnikov, DV; Mehrotra, S Domain-independent data cleaning via analysis of entity-relationship graph	2006	98	Apr07
Monge, AE Matching Algorithms within a Duplicate Detection System	2000	91	Apr07
Low, WL; Lee, ML; Ling, TW A knowledge-based approach for duplicate elimination in data cleaning	2001	88	Apr07
Song, Y; Huang, J; Councill, IG; Li, J; Giles, CL Efficient topic-based unsupervised name disambiguation	2007	76	Nov07
Xi, W; Fox, EA; Fan, W; Zhang, B; Chen, Z; Yan, J; J Yan, D SimFusion: measuring similarity using unified relationship matrix	2005	75	Oct06
Karger, DR; Jones, W Data unification in personal information management	2006	71	Apr07
Borgman, CL; Siegfried, SL Getty's Synoname and its cousins: A survey of applications of personal name-matching algorithms	1992	69	Apr07
Doan, AnHai; Lu, Ying; Lee, Yoonkyong; Han, Jiawei Object Matching for Information Integration: A Profiler-Based Approach	2003	68	Sep06
Chen, Z; Kalashnikov, DV; Mehrotra, S Exploiting relationships for object consolidation	2005	68	Sep06
Verykios, V. S.; Moustakides, G. V.; Elfeky, M. G. A Bayesian decision model for cost optimal record matching	2003	66	Oct06
Tan, YF; Kan, MY; Lee, D Search engine driven author disambiguation	2006	63	Apr07
Lee, Dongwon; On, Byung-Won; Kang, Jaewoo; Park, Sanghyun Effective and scalable solutions for mixed and split citation problems in digital libraries	2005	61	Oct06
Scannapieco, M; Missier, P; Batini, C Data Quality at a Glance	2005	58	Apr07
Bhattacharya, Indrajit; Getoor, Lise Relational clustering for multi-type entity resolution	2005	49	Oct06
Lee, M.L.; Hsu, W.; Kothari, V. Cleaning the spurious links in data	2004	40	Sep06
Schallehn, E; Sattler, KU; Saake, G Efficient similarity-based operations for data integration	2004	39	Apr07
Chua, CEH; Chiang, RHL; Lim, EP Instance-based attribute identification in database integration	2003	36	Sep06
Aizawa, A; Oyama, K A Fast Linkage Detection Scheme for Multi-Source Information Integration	2005	35	Nov07
Benjelloun, O.; Garcia-Molina, H.; Gong, H.; Kawai, H; Larson, T.E.; Menestrina, D.; Thavisomboon, S. D-Swoosh: A Family of Algorithms for Generic, Distributed Entity Resolution	2007	32	Aug07
Herbert, KG; Gehani, NH; Piel, WH; Wang, JTL; Wu, CH BIO-AJAX: an extensible framework for biological data cleaning	2004	30	Mar07
Galhardas, H; Florescu, D; Shasha, D; Simon, E; E Simon, CA Improving data cleaning quality using a data lineage facility	2001	30	Oct06
Shen, W.; DeRose, P.; Vu, L.; Doan, A.; Ramakrishnan, R. Source-aware entity matching: A compositional approach	2007	29	Apr07
Christen, Peter; Churches, Tim Febrl - Freely extensible biomedical record linkage	2002	29	Oct06

Title/Author

Year

Citations