Search: Duplicate/matching, no dataset, XML, Sigmod / VLDB, Naumann, F, 2005