Cohen, WW; Ravikumar, P; Fienberg, SE A comparison of string distance metrics for name-matching tasks |
2003 |
1091 |
Sep06 |
Bilenko, M; Mooney, RJ Adaptive duplicate detection using learnable string similarity measures |
2003 |
573 |
Sep06 |
Gravano, L.; Ipeirotis, P.G.; Jagadish, H.V.; Koudas, N.; Muthukrishnan, S.; Srivastava, D. Approximate string joins in a database (almost) for free |
2001 |
411 |
Oct06 |
Dong, X.; Halevy, A.; Madhavan, J. Reference reconciliation in complex information spaces |
2005 |
380 |
Sep06 |
Ananthakrishna, R; Chaudhuri, S; Ganti, V Eliminating fuzzy duplicates in data warehouses |
2002 |
334 |
Sep06 |
Mann, GS; Yarowsky, D Unsupervised Personal Name Disambiguation |
2003 |
283 |
Apr07 |
Pasula, H; Marthi, B; Milch, B; Russell, S; Shpitser, I Identity uncertainty and citation matching |
2003 |
267 |
Apr07 |
Tejada, S; Knoblock, CA; Minton, S Learning domain-independent string transformation weights for high accuracy object identification |
2002 |
202 |
Sep06 |
Cohen, W.W.; Hirsh, H. Joins that generalize: text classification using Whirl |
1998 |
161 |
Sep06 |
Jin, L.; Li, C.; Mehrotra, S. Efficient record linkage in large data sets |
2003 |
154 |
Oct06 |
Chaudhuri, Surajit; Ganti, Venkatesh; Motwani, Rajeev Robust Identification of Fuzzy Duplicates |
2005 |
140 |
Aug06 |
Maletic, J.I.; Marcus, A. Data Cleansing: Beyond Integrity Analysis |
2000 |
138 |
Sep06 |
Gravano, L.; Ipeirotis, P.G.; Koudas, N.; Srivastava, D. Text joins in an RDBMS for web data integration |
2003 |
129 |
Oct06 |
Bai, Y.; Wang, F.; Liu, P. Efficiently Filtering RFID Data Streams |
2006 |
114 |
Sep06 |
Cohen, WW; Kautz, H; McAllester, D Hardening soft information sources |
2000 |
91 |
Sep06 |
Hassell, J.; Aleman-Meza, B.; Arpinar, I.B. Ontology-Driven Automatic Entity Disambiguation in Unstructured Text |
2006 |
86 |
Apr07 |
Guha, S.; Koudas, N.; Marathe, A.; Srivastava, D. Merging the Results of Approximate Match Operations |
2004 |
73 |
Oct06 |
Whang, Steven Euijong; Menestrina, David; Koutrika, Georgia; Theobald, Martin; Garcia-Molina, Hector Entity resolution with iterative blocking |
2009 |
68 |
Sep09 |
Shen, W; Li, X; Doan, AH Constraint-Based Entity Matching |
2005 |
58 |
Sep06 |
Hassanzadeh, O; Consens, M Linked movie data base |
2009 |
57 |
May10 |
Arasu, Arvind; Ré, Christopher; Suciu, Dan Large-Scale Deduplication with Constraints Using Dedupalog |
2009 |
56 |
Sep09 |
Singla, P; Domingos, P Object identification with attribute-mediated dependences |
2005 |
56 |
Apr07 |
Menestrina, D.; Benjelloun, O.; Garcia-Molina, H. Generic Entity Resolution with Data Confidences |
2006 |
44 |
Sep06 |
Zhao, Huimin; Ram, Sudha Entity identification for heterogeneous database integration: a multiple classifier system approach and empirical evaluation |
2005 |
42 |
Oct06 |
On, Byung-Won; Koudas, Nick; Lee, Dongwon; Srivastava, Divesh Group Linkage |
2007 |
40 |
Feb07 |
Yan, S; Lee, D; Kan, MY; Giles, CL Adaptive sorted neighborhood methods for efficient record linkage |
2007 |
32 |
Nov07 |
Li, Huajing; Councill, Isaac; Lee, Wang-Chien; Giles, C. Lee CiteSeerX: an Architecture and Web Service Design for an Academic Document Search Engine |
2006 |
29 |
Feb07 |
Hassanzadeh, Oktie; Chiang, Fei; Miller, Renée; Lee, Hyun Chul Framework for Evaluating Clustering Algorithms in Duplicate Detection |
2009 |
29 |
Sep09 |
Zhao, H; Ram, S Combining schema and instance information for integrating heterogeneous data sources |
2007 |
28 |
Nov09 |
Raman, V; Hellerstein, J Potters Wheel: An Interactive Framework for Data Cleaning and Transformation |
2001 |
26 |
Sep06 |
Chen, Zhaoqi; Kalashnikov, Dmitri V.; Mehrotra, Sharad Exploiting context analysis for combining multiple entity resolution systems |
2009 |
25 |
Sep09 |
Yakout, Mohamed; Atallah, Mikhail J.; Elmagarmid, Ahmed K. Efficient Private Record Linkage |
2009 |
22 |
Sep09 |
Bolelli, Levent; Ertekin, Seyda; Giles, C. Lee Clustering Scientific Literature Using Sparse Citation Graph Analysis |
2006 |
15 |
Feb07 |
Miller, Renee; Kementsietsidis, Anastasios; Lim, Lipyeow; Wang, Min Linkage Query Writer |
2009 |
13 |
Sep09 |
Councill, Isaac G.; Giles, C. Lee; Iorio, Ernesto Di; Gori, Marco; Maggini, Marco; Pucci, Augusto Towards Next Generation CiteSeer: A Flexible Architecture for Digital Library Deployment |
2006 |
13 |
Feb07 |
Phua, C; Lee, V; Smith, K The Personal Name Problem And a Recommended Data Mining Solution |
2006 |
12 |
Apr07 |
Councill, Isaac G.; Li, Huajing; Zhuang, Ziming; Debnath, Sandip; Bolelli, Levent; Lee, Wang-Chien; Sivasubramaniam, Anand; Giles, C. Lee Learning metadata from the evidence in an on-line citation matching scheme |
2006 |
10 |
Feb07 |
Kotidis, Y.; Marian, A.; Srivastava, D. Circumventing Data Quality Problems Using Multiple Join Paths |
2006 |
10 |
Sep06 |
Wellner, B; Castano, J; Pustejovsky, J Adaptive string similarity metrics for biomedical reference resolution |
2005 |
9 |
Feb09 |
Kang, J.; Han, T.S.; Lee, D.; Mitra, P. Establishing value mappings using statistical models and user feedback |
2005 |
9 |
Sep06 |
Dai, B. T.; Koudas, N.; Ooi, B. C.; Srivastava, D.; Venkatasubramanian, S. Column Heterogeneity as a Measure of Data Quality |
2006 |
8 |
Sep06 |
Qi, Y.; Candan, K. S.; Sapino, M. L.; Kintigh, K. W. QUEST: QUery-driven Exploration of Semistructured Data with ConflicTs and Partial Knowledge |
2006 |
7 |
Sep06 |
Lu, Y; Nie, Z; Cheng, T; Gao, Y; Wen, JR Name Disambiguation Using Web Connection |
2007 |
4 |
Feb09 |
Chaudhuri, S; Sarma, AD; Ganti, V; Kaushik, R Leveraging aggregate constraints for deduplication |
2007 |
|
Sep09 |
Silva, Yasin N.; Aref, Walid G.; Ali, Mohamed H. Similarity Group-By |
2009 |
|
Sep09 |
Chen, Z; Kalashnikov, DV; Mehrotra, S Adaptive graphical approach to entity resolution |
2007 |
|
Nov07 |
Frigui, Hichem MembershipMap: Data Transformation Based on Membership Aggregation |
2004 |
|
Oct06 |
On, BW; Elmacioglu, E; Lee, D; Kang, J; Pei, J Improving Grouped-Entity Resolution using Quasi-Cliques |
2006 |
|
Feb07 |
Bilenko, M; Mooney, RJ On evaluation and training-set construction for duplicate detection |
2003 |
|
Oct06 |
Singla, P.; Domingos, P. Multi-relational record linkage |
2004 |
|
Sep06 |