Search: com

34 results

Results

Title/Author Year Citationssort icon added
Cohen, WW; Ravikumar, P; Fienberg, SE
A comparison of string distance metrics for name-matching tasks
2003 1091 Sep06
Gravano, L.; Ipeirotis, P.G.; Jagadish, H.V.; Koudas, N.; Muthukrishnan, S.; Srivastava, D.
Approximate string joins in a database (almost) for free
2001 411 Oct06
Ananthakrishna, R; Chaudhuri, S; Ganti, V
Eliminating fuzzy duplicates in data warehouses
2002 334 Sep06
Tejada, S; Knoblock, CA; Minton, S
Learning domain-independent string transformation weights for high accuracy object identification
2002 202 Sep06
Chaudhuri, S.; Ganti, V.; Kaushik, R.
A Primitive Operator for Similarity Joins in Data Cleaning
2006 201 Oct06
Cohen, W.W.; Hirsh, H.
Joins that generalize: text classification using Whirl
1998 161 Sep06
Chaudhuri, Surajit; Ganti, Venkatesh; Motwani, Rajeev
Robust Identification of Fuzzy Duplicates
2005 140 Aug06
Gravano, L.; Ipeirotis, P.G.; Koudas, N.; Srivastava, D.
Text joins in an RDBMS for web data integration
2003 129 Oct06
Bai, Y.; Wang, F.; Liu, P.
Efficiently Filtering RFID Data Streams
2006 114 Sep06
Cong, Gao; Fan, Wenfei; Geerts, Floris; Jia, Xibei; Ma, Shuai;
Improving Data Quality: Consistency and Accuracy
2007 107 Jan08
Cohen, WW; Kautz, H; McAllester, D
Hardening soft information sources
2000 91 Sep06
Guha, S.; Koudas, N.; Marathe, A.; Srivastava, D.
Merging the Results of Approximate Match Operations
2004 73 Oct06
Koudas, N.; Marathe, A.; Srivastava, D.
Flexible string matching against large databases in practice
2004 71 Oct06
Arasu, Arvind; Ré, Christopher; Suciu, Dan
Large-Scale Deduplication with Constraints Using Dedupalog
2009 56 Sep09
On, Byung-Won; Koudas, Nick; Lee, Dongwon; Srivastava, Divesh
Group Linkage
2007 40 Feb07
Pinheiro, J.C.; Sun, D.X.
Methods for linking and mining massive heterogeneous databases
1998 33 Sep06
Hassanzadeh, Oktie; Chiang, Fei; Miller, Renée; Lee, Hyun Chul
Framework for Evaluating Clustering Algorithms in Duplicate Detection
2009 29 Sep09
Chaudhuri, S.; Ganjam, K.; Ganti, V.; Kapoor, R.; Narasayya, V.; Vassilakis, T.
Data cleaning in microsoft SQL server 2005
2005 26 Sep06
Chen, Zhaoqi; Kalashnikov, Dmitri V.; Mehrotra, Sharad
Exploiting context analysis for combining multiple entity resolution systems
2009 25 Sep09
Baumgartner, Robert; Gottlob, Georg; Herzog, Marcus
Scalable Web Data Extraction for Online Market Intelligence
2009 21 Sep09
Neiling, M; Jurk, S; Lenz, HJ; Naumann, F
Object Identification Quality
2003 19 Oct06
Branting, LK
A comparative evaluation of name-matching algorithms
2003 16 Apr07
Chaudhuri, Surajit; Ganti, Venkatesh; Xin, Dong
Mining Document Collections to Facilitate Accurate Approximate Entity Matching
2009 15 Sep09
Miller, Renee; Kementsietsidis, Anastasios; Lim, Lipyeow; Wang, Min
Linkage Query Writer
2009 13 Sep09
Arasu, Arvind; Kaushik, Raghav
A grammar-based entity representation framework for data cleaning
2009 12 Sep09
Kotidis, Y.; Marian, A.; Srivastava, D.
Circumventing Data Quality Problems Using Multiple Join Paths
2006 10 Sep06
Borthwick, A; Buechi, M; Goldberg, A
Key Concepts in the ChoiceMaker 2 Record Matching System
2003 8 Apr07
Dai, B. T.; Koudas, N.; Ooi, B. C.; Srivastava, D.; Venkatasubramanian, S.
Column Heterogeneity as a Measure of Data Quality
2006 8 Sep06
Arasu, A; Chaudhuri, S; Ganjam, K; Kaushik, R
Incorporating string transformations in record matching
2008 7 Sep09
Lengu, R; Missier, P; Fernandes, AAA; G Guerrini, M ..
Time-completeness trade-offs in record linkage using Adaptive Query Processing
2009 5 Feb09
Lu, Y; Nie, Z; Cheng, T; Gao, Y; Wen, JR
Name Disambiguation Using Web Connection
2007 4 Feb09
Chaudhuri, S; Sarma, AD; Ganti, V; Kaushik, R
Leveraging aggregate constraints for deduplication
2007 Sep09
Silva, Yasin N.; Aref, Walid G.; Ali, Mohamed H.
Similarity Group-By
2009 Sep09
Chaudhuri, S; Dayal, U
An overview of data warehousing and OLAP technology
1997 Sep06