Chaudhuri, S.; Ganti, V.; Kaushik, R. A Primitive Operator for Similarity Joins in Data Cleaning |
2006 |
201 |
Oct06 |
Bhattacharya, I.; Getoor, L.; A Latent Dirichlet Model for Unsupervised Entity Resolution |
2006 |
144 |
Apr07 |
Bai, Y.; Wang, F.; Liu, P. Efficiently Filtering RFID Data Streams |
2006 |
114 |
Sep06 |
Kalashnikov, DV; Mehrotra, S Domain-independent data cleaning via analysis of entity-relationship graph |
2006 |
98 |
Apr07 |
Hassell, J.; Aleman-Meza, B.; Arpinar, I.B. Ontology-Driven Automatic Entity Disambiguation in Unstructured Text |
2006 |
86 |
Apr07 |
Bilenko, Mikhail; Kamath, Beena; Mooney, Raymond J. Adaptive Blocking: Learning to Scale Up Record Linkage |
2006 |
73 |
Feb08 |
Karger, DR; Jones, W Data unification in personal information management |
2006 |
71 |
Apr07 |
Tan, YF; Kan, MY; Lee, D Search engine driven author disambiguation |
2006 |
63 |
Apr07 |
Michelson, Matthew; Knoblock, Craig A. Learning Blocking Schemes for Record Linkage |
2006 |
58 |
Mar08 |
Menestrina, D.; Benjelloun, O.; Garcia-Molina, H. Generic Entity Resolution with Data Confidences |
2006 |
44 |
Sep06 |
Bhattacharya, I; Getoor, L; Licamele, L Query-time entity resolution |
2006 |
39 |
Sep06 |
Zhuang, Y.; Chen, L. In-network Outlier Cleaning for Data Collection in Sensor Networks |
2006 |
38 |
Sep06 |
Puhlmann, Sven; Weis, Melanie; Naumann, Felix XML Duplicate Detection Using Sorted Neighborhoods |
2006 |
36 |
Apr07 |
Li, Huajing; Councill, Isaac; Lee, Wang-Chien; Giles, C. Lee CiteSeerX: an Architecture and Web Service Design for an Academic Document Search Engine |
2006 |
29 |
Feb07 |
Milano, D.; Scannapieco, M.; Catarci, T. Structure Aware XML Object Identification |
2006 |
27 |
Sep06 |
Kirsten, T.; Rahm, E. BioFuice: Mapping-based data integration in bioinformatics |
2006 |
18 |
Sep06 |
Bolelli, Levent; Ertekin, Seyda; Giles, C. Lee Clustering Scientific Literature Using Sparse Citation Graph Analysis |
2006 |
15 |
Feb07 |
Weis, M.; Naumann, F.; Brosy, F. A Duplicate Detection Benchmark for XML (and Relational) Data |
2006 |
13 |
Oct06 |
Councill, Isaac G.; Giles, C. Lee; Iorio, Ernesto Di; Gori, Marco; Maggini, Marco; Pucci, Augusto Towards Next Generation CiteSeer: A Flexible Architecture for Digital Library Deployment |
2006 |
13 |
Feb07 |
Phua, C; Lee, V; Smith, K The Personal Name Problem And a Recommended Data Mining Solution |
2006 |
12 |
Apr07 |
Councill, Isaac G.; Li, Huajing; Zhuang, Ziming; Debnath, Sandip; Bolelli, Levent; Lee, Wang-Chien; Sivasubramaniam, Anand; Giles, C. Lee Learning metadata from the evidence in an on-line citation matching scheme |
2006 |
10 |
Feb07 |
Kotidis, Y.; Marian, A.; Srivastava, D. Circumventing Data Quality Problems Using Multiple Join Paths |
2006 |
10 |
Sep06 |
Jakoniene, V; Rundqvist, D;Lambrix, P A method for similarity-based grouping of biological data |
2006 |
8 |
Mar07 |
Dai, B. T.; Koudas, N.; Ooi, B. C.; Srivastava, D.; Venkatasubramanian, S. Column Heterogeneity as a Measure of Data Quality |
2006 |
8 |
Sep06 |
Qi, Y.; Candan, K. S.; Sapino, M. L.; Kintigh, K. W. QUEST: QUery-driven Exploration of Semistructured Data with ConflicTs and Partial Knowledge |
2006 |
7 |
Sep06 |
Benedikt, M.; Bohannon, P.; Bruns, G. Data Cleaning for Decision Support |
2006 |
6 |
Sep06 |
Bhattacharya, I.; Licamele, L.; Getoor, L.; Relational Clustering for Entity Resolution Queries |
2006 |
2 |
Apr07 |
Reuther, P; Walter, B; Ley, M; Weber, A; Klink, S Managing the Quality of Person Names in DBLP |
2006 |
|
Oct07 |
Batini, C.; Scannapieca, M. Data Quality: Concepts, Methodologies and Techniques |
2006 |
|
Oct09 |
Reuther, P Personal Name Matching: New Test Collections and a Social Network based Approach. |
2006 |
|
Oct07 |
Naumann, Felix; Bilke, Alexander; Bleiholder, Jens; Weis, Melanie Data Fusion in Three Steps: Resolving Schema, Tuple, and Value Inconsistencies |
2006 |
|
Apr07 |
Malin, B Re-identification of Familial Database Records. |
2006 |
|
Apr07 |
Mazeika, A.; Bohlen, M.H. Cleansing Databases of Misspelled Proper Nouns |
2006 |
|
Sep06 |
On, BW; Elmacioglu, E; Lee, D; Kang, J; Pei, J Improving Grouped-Entity Resolution using Quasi-Cliques |
2006 |
|
Feb07 |
Lee, D; Kang, J; Mitra, P; Giles, CL; On, BW Are Your Citations Clean? New Scenarios and Challenges in Maintaining Digital Libraries |
2006 |
|
Feb07 |
Lee, D.; Kang, J.; Mitra, P.; Giles, C. Lee; On, B.-W. Large-Scale Citation Matching of Scientific Digital Libraries |
2006 |
|
Mar07 |