Web taxonomy integration through co-bootstrapping

Authors: 
Zhang, D; Lee, WS
Author: 
Zhang, D
Lee, WS
Year: 
2004
Venue: 
Proc. ACM SIGIR
URL: 
http://portal.acm.org/citation.cfm?id=1008992.1009062
Citations: 
24
Citations range: 
10 - 49
AttachmentSize
Zhang2004Webtaxonomyintegrationthroughcobootstrapping.pdf186 KB

We address the problem of integrating objects from a source taxonomy into a master taxonomy. This problem is not only currently pervasive on the web, but also important to the emerging semantic web. A straightforward approach to automating this process would be to learn a classifier that can classify objects from the source taxonomy into categories of the master taxonomy. The key insight is that the availability of the source taxonomy data could be helpful to build better classifiers for the master taxonomy if their categorizations have some semantic overlap. In this paper, we propose a new approach, co-bootstrapping, to enhance the classification by exploiting such implicit knowledge. Our experiments with real-world web data show substantial improvements in the performance of taxonomy integration.