Search: no datatype

Title/Author	Year	Citations	added
Nelder, J.A.; Mead, R. A simplex method for function minimization	1965	16651	Sep06
Kohavi, R.; John, G.H. Wrappers for Feature Subset Selection	1997	4115	Sep06
Fellegi, I.P.; Sunter, A.B. A Theory for Record Linkage	1969	1444	Oct06
Navarro, G A guided tour to approximate string matching	2001	1369	May07
Elmagarmid, Ahmed; Ipeirotis, Panagiotis; Verykios, Vassilios Duplicate Record Detection: A Survey	2007	785	Oct06
Rahm, Erhard; Do, Hong Hai Data Cleaning: Problems and Current Approaches	2000	778	Aug06
Winkler, W.E. The state of record linkage and current research problems	1999	634	Oct06
Hernandez, MA; Stolfo, S. Real-world Data is Dirty: Data Cleansing and The Merge/Purge Problem	1998	604	Sep06
McCallum, A; Nigam, K; Ungar, LH Efficient clustering of high-dimensional data sets with application to reference matching	2000	550	Apr07
Ristad, ES; Yianilos, PN; Inc, M.T.; Princeton, NJ Learning string-edit distance	1998	498	Oct06
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Motwani, R. Robust and efficient fuzzy match for online data cleaning	2003	378	Sep06
Monge, A.E.; Elkan, C. An efficient domain-independent algorithm for detecting approximately duplicate database records	1997	364	Oct06
Bilenko, M; Mooney, R; Cohen, W; P Ravikumar, S Adaptive name matching in information integration	2003	339	Nov07
Galhardas, H; Florescu, D; Shasha, D; Simon, E; Saita, C. Declarative data cleaning: Language, model, and algorithms	2001	323	Sep06
Cohen, William; Richman, Jacob Learning to match and cluster large high-dimensional data sets for data integration	2002	274	Oct06
Hjaltason, G.R.; Samet, H. Incremental distance join algorithms for spatial databases	1998	250	Oct06
Bhattacharya, I.; Getoor, L.; Collective Entity Resolution in Relational Data	2007	238	Apr07
Tejada, S Learning Object Identification Rules for Information Integration	2002	219	Oct06
Winkler, W.E. Advanced methods for record linkage	1994	196	Oct06
Cohen, W.W. Data integration using similarity joins and a word-based information representation language	2000	195	Oct06
Galhardas, H; Florescu, D; Shasha, D; Simon, E AJAX: an extensible data cleaning tool	2000	175	Sep06
Bhattacharya, I.; Getoor, L.; A Latent Dirichlet Model for Unsupervised Entity Resolution	2006	144	Apr07
Kalashnikov, D.V.; Mehrotra, S.; Chen, Z. Exploiting relationships for domain-independent data cleaning	2005	111	Oct06
Kalashnikov, DV; Mehrotra, S Domain-independent data cleaning via analysis of entity-relationship graph	2006	98	Apr07
Monge, AE Matching Algorithms within a Duplicate Detection System	2000	91	Apr07
Low, WL; Lee, ML; Ling, TW A knowledge-based approach for duplicate elimination in data cleaning	2001	88	Apr07
Song, Y; Huang, J; Councill, IG; Li, J; Giles, CL Efficient topic-based unsupervised name disambiguation	2007	76	Nov07
Xi, W; Fox, EA; Fan, W; Zhang, B; Chen, Z; Yan, J; J Yan, D SimFusion: measuring similarity using unified relationship matrix	2005	75	Oct06
Karger, DR; Jones, W Data unification in personal information management	2006	71	Apr07
Borgman, CL; Siegfried, SL Getty's Synoname and its cousins: A survey of applications of personal name-matching algorithms	1992	69	Apr07
Doan, AnHai; Lu, Ying; Lee, Yoonkyong; Han, Jiawei Object Matching for Information Integration: A Profiler-Based Approach	2003	68	Sep06
Chen, Z; Kalashnikov, DV; Mehrotra, S Exploiting relationships for object consolidation	2005	68	Sep06
Verykios, V. S.; Moustakides, G. V.; Elfeky, M. G. A Bayesian decision model for cost optimal record matching	2003	66	Oct06
Tan, YF; Kan, MY; Lee, D Search engine driven author disambiguation	2006	63	Apr07
Lee, Dongwon; On, Byung-Won; Kang, Jaewoo; Park, Sanghyun Effective and scalable solutions for mixed and split citation problems in digital libraries	2005	61	Oct06
Scannapieco, M; Missier, P; Batini, C Data Quality at a Glance	2005	58	Apr07
Lee, M.L.; Hsu, W.; Kothari, V. Cleaning the spurious links in data	2004	40	Sep06
Schallehn, E; Sattler, KU; Saake, G Efficient similarity-based operations for data integration	2004	39	Apr07
Bhattacharya, I; Getoor, L; Licamele, L Query-time entity resolution	2006	39	Sep06
Chua, CEH; Chiang, RHL; Lim, EP Instance-based attribute identification in database integration	2003	36	Sep06
Aizawa, A; Oyama, K A Fast Linkage Detection Scheme for Multi-Source Information Integration	2005	35	Nov07
Benjelloun, O.; Garcia-Molina, H.; Gong, H.; Kawai, H; Larson, T.E.; Menestrina, D.; Thavisomboon, S. D-Swoosh: A Family of Algorithms for Generic, Distributed Entity Resolution	2007	32	Aug07
Herbert, KG; Gehani, NH; Piel, WH; Wang, JTL; Wu, CH BIO-AJAX: an extensible framework for biological data cleaning	2004	30	Mar07
Galhardas, H; Florescu, D; Shasha, D; Simon, E; E Simon, CA Improving data cleaning quality using a data lineage facility	2001	30	Oct06
Shen, W.; DeRose, P.; Vu, L.; Doan, A.; Ramakrishnan, R. Source-aware entity matching: A compositional approach	2007	29	Apr07
Christen, Peter; Churches, Tim Febrl - Freely extensible biomedical record linkage	2002	29	Oct06
Barateiro, José; Galhardas, Helena A Survey of Data Quality Tools	2005	28	Apr07
Quass, D.; Starkey, P. Record linkage for genealogical databases	2003	24	Sep06
Michalowski, M; Thakkar, S; Knoblock, CA Exploiting secondary sources for automatic object consolidation	2003	23	Apr07
Ganesh, M.; Srivastava, J.; Richardson, T. Mining entity-identification rules for database integration	1996	21	Sep06

Data Cleaning publication categorizer

Guided search

Data Cleaning

Data sets

Data type

Paper type

Venue type

Author

Year

Citations range

Keyword search

Results

Current search

Data type

User login