BIO-AJAX: an extensible framework for biological data cleaning

Authors: 
Herbert, KG; Gehani, NH; Piel, WH; Wang, JTL; Wu, CH
Author: 
Herbert, K
Gehani, N
Piel, W
Wang, J
Wu, C
Year: 
2004
Venue: 
ACM SIGMOD Record
URL: 
http://portal.acm.org/citation.cfm?id=1024703
Citations: 
30
Citations range: 
10 - 49
AttachmentSize
Herbert2004BIOAJAXanextensible.pdf2.2 MB

As databases become more pervasive through the biological sciences, various data quality issues regarding data legacy, data uniformity and data duplication arise. Due to the nature of this data, each of these problems is non-trivial. For biological data to be corrected and standardized, new methods and frameworks must be developed. This paper proposes one such framework, called BIO-AJAX, which uses principles from data cleaning to improve data quality in biological information systems, specifically in TreeBASE.