Managing source schema evolution in web warehouses

Authors: 
Marotta, A; Motz, R; Ruggia, R
Author: 
Marotta, A
Motz, R
Ruggia, R
Year: 
2002
Venue: 
Journal of the Brazilian Computer Society
URL: 
http://www.scielo.br/scielo.php?pid=S0104-65002002000200003&script=sci_arttext&tlng=en
Citations: 
14
Citations range: 
10 - 49

Web Data Warehouses have been introduced to enable the analysis of integrated Web data. One of the main challenges in these systems is to deal with the volatile and dynamic nature of Web sources. In this work we address the effects of adding/removing/changing Web sources and data items to the Data Warehouse (DW) schema. By managing source evolution we mean the automatic propagation of these changes to the DW. The proposed approach is based on a wrapper/mediator architecture, which reduces the impact of Web source changes on the DW schema. This paper presents this architecture and analyses some selected evolution cases in the context of Web DW.