Biomedical questions are often complex and address multiple topics simultaneously. Answering them requires the comprehensive evaluation of several different types of data. They are often available, but in distributed and heterogeneous data sources; this hampers their global evaluation. We developed a software architecture to create and maintain updated a Genomic and Proteomic Data Warehouse (GPDW), which integrates several of the main of such dispersed data. It uses a modular and multi-level global data schema based on abstraction and generalization of integrated data features. Such a schema eases integration of data sources evolving in data content, structure and number, and assures provenance tracking of all the integrated data. Thanks to the developed software architecture and adopted data schema, the GPDW has been kept updated easily and progressively extended with additional data types and sources; it is publicly usable at http://www.bioinformatics.dei.polimi.it/GPKB/.
Integrative warehousing of biomolecular information to support complex multi-topic queries for biomedical knowledge discovery
CANAKOGLU, ARIF;MASSEROLI, MARCO;CERI, STEFANO;TETTAMANTI, LUCA;GHISALBERTI, GIORGIO;CAMPI, ALESSANDRO
2013-01-01
Abstract
Biomedical questions are often complex and address multiple topics simultaneously. Answering them requires the comprehensive evaluation of several different types of data. They are often available, but in distributed and heterogeneous data sources; this hampers their global evaluation. We developed a software architecture to create and maintain updated a Genomic and Proteomic Data Warehouse (GPDW), which integrates several of the main of such dispersed data. It uses a modular and multi-level global data schema based on abstraction and generalization of integrated data features. Such a schema eases integration of data sources evolving in data content, structure and number, and assures provenance tracking of all the integrated data. Thanks to the developed software architecture and adopted data schema, the GPDW has been kept updated easily and progressively extended with additional data types and sources; it is publicly usable at http://www.bioinformatics.dei.polimi.it/GPKB/.File | Dimensione | Formato | |
---|---|---|---|
06701584.pdf
Accesso riservato
Descrizione: Articolo principale
:
Publisher’s version
Dimensione
325.54 kB
Formato
Adobe PDF
|
325.54 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.