Cohen, WW; Ravikumar, P; Fienberg, SE A comparison of string distance metrics for name-matching tasks |
2003 |
1091 |
Sep06 |
Gravano, L.; Ipeirotis, P.G.; Jagadish, H.V.; Koudas, N.; Muthukrishnan, S.; Srivastava, D. Approximate string joins in a database (almost) for free |
2001 |
411 |
Oct06 |
Ananthakrishna, R; Chaudhuri, S; Ganti, V Eliminating fuzzy duplicates in data warehouses |
2002 |
334 |
Sep06 |
Tejada, S; Knoblock, CA; Minton, S Learning domain-independent string transformation weights for high accuracy object identification |
2002 |
202 |
Sep06 |
Chaudhuri, S.; Ganti, V.; Kaushik, R. A Primitive Operator for Similarity Joins in Data Cleaning |
2006 |
201 |
Oct06 |
Cohen, W.W.; Hirsh, H. Joins that generalize: text classification using Whirl |
1998 |
161 |
Sep06 |
Chaudhuri, Surajit; Ganti, Venkatesh; Motwani, Rajeev Robust Identification of Fuzzy Duplicates |
2005 |
140 |
Aug06 |
Gravano, L.; Ipeirotis, P.G.; Koudas, N.; Srivastava, D. Text joins in an RDBMS for web data integration |
2003 |
129 |
Oct06 |
Bai, Y.; Wang, F.; Liu, P. Efficiently Filtering RFID Data Streams |
2006 |
114 |
Sep06 |
Cong, Gao; Fan, Wenfei; Geerts, Floris; Jia, Xibei; Ma, Shuai; Improving Data Quality: Consistency and Accuracy |
2007 |
107 |
Jan08 |
Cohen, WW; Kautz, H; McAllester, D Hardening soft information sources |
2000 |
91 |
Sep06 |
Guha, S.; Koudas, N.; Marathe, A.; Srivastava, D. Merging the Results of Approximate Match Operations |
2004 |
73 |
Oct06 |
Koudas, N.; Marathe, A.; Srivastava, D. Flexible string matching against large databases in practice |
2004 |
71 |
Oct06 |
Arasu, Arvind; Ré, Christopher; Suciu, Dan Large-Scale Deduplication with Constraints Using Dedupalog |
2009 |
56 |
Sep09 |
On, Byung-Won; Koudas, Nick; Lee, Dongwon; Srivastava, Divesh Group Linkage |
2007 |
40 |
Feb07 |
Pinheiro, J.C.; Sun, D.X. Methods for linking and mining massive heterogeneous databases |
1998 |
33 |
Sep06 |
Hassanzadeh, Oktie; Chiang, Fei; Miller, Renée; Lee, Hyun Chul Framework for Evaluating Clustering Algorithms in Duplicate Detection |
2009 |
29 |
Sep09 |
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Kapoor, R.; Narasayya, V.; Vassilakis, T. Data cleaning in microsoft SQL server 2005 |
2005 |
26 |
Sep06 |
Chen, Zhaoqi; Kalashnikov, Dmitri V.; Mehrotra, Sharad Exploiting context analysis for combining multiple entity resolution systems |
2009 |
25 |
Sep09 |
Baumgartner, Robert; Gottlob, Georg; Herzog, Marcus Scalable Web Data Extraction for Online Market Intelligence |
2009 |
21 |
Sep09 |
Neiling, M; Jurk, S; Lenz, HJ; Naumann, F Object Identification Quality |
2003 |
19 |
Oct06 |
Branting, LK A comparative evaluation of name-matching algorithms |
2003 |
16 |
Apr07 |
Chaudhuri, Surajit; Ganti, Venkatesh; Xin, Dong Mining Document Collections to Facilitate Accurate Approximate Entity Matching |
2009 |
15 |
Sep09 |
Miller, Renee; Kementsietsidis, Anastasios; Lim, Lipyeow; Wang, Min Linkage Query Writer |
2009 |
13 |
Sep09 |
Arasu, Arvind; Kaushik, Raghav A grammar-based entity representation framework for data cleaning |
2009 |
12 |
Sep09 |
Kotidis, Y.; Marian, A.; Srivastava, D. Circumventing Data Quality Problems Using Multiple Join Paths |
2006 |
10 |
Sep06 |
Borthwick, A; Buechi, M; Goldberg, A Key Concepts in the ChoiceMaker 2 Record Matching System |
2003 |
8 |
Apr07 |
Dai, B. T.; Koudas, N.; Ooi, B. C.; Srivastava, D.; Venkatasubramanian, S. Column Heterogeneity as a Measure of Data Quality |
2006 |
8 |
Sep06 |
Arasu, A; Chaudhuri, S; Ganjam, K; Kaushik, R Incorporating string transformations in record matching |
2008 |
7 |
Sep09 |
Lengu, R; Missier, P; Fernandes, AAA; G Guerrini, M .. Time-completeness trade-offs in record linkage using Adaptive Query Processing |
2009 |
5 |
Feb09 |
Lu, Y; Nie, Z; Cheng, T; Gao, Y; Wen, JR Name Disambiguation Using Web Connection |
2007 |
4 |
Feb09 |
Chaudhuri, S; Sarma, AD; Ganti, V; Kaushik, R Leveraging aggregate constraints for deduplication |
2007 |
|
Sep09 |
Silva, Yasin N.; Aref, Walid G.; Ali, Mohamed H. Similarity Group-By |
2009 |
|
Sep09 |
Chaudhuri, S; Dayal, U An overview of data warehousing and OLAP technology |
1997 |
|
Sep06 |