Targeting biological questions requires comprehensive evaluation of multiple types of annotations describing current biological knowledge; they are increasingly available, but their fast evolution, heterogeneity and dispersion in many different sources hamper their effective use. Leveraging on innovative flexible data schema and automatic software procedures that support the integration of data sources evolving in number, data content and structure, while assuring quality and provenance tracking of the integrated data, we created a multi-organism Genomic and Proteomic Knowledge Base (GPKB) and easily maintained it updated. From several well-known databases it imports and integrates very numerous gene and protein data, external references and annotations, expressed through multiple biomedical terminologies. To easily query such integrated data, we developed intuitive web interfaces and services for programmatic access to the GPKB; they are publicly available respectively at http://www.bioinformatics. deib.polimi.it/GPKB/ and http://www.bioinformatics.deib.polimi.it/ GPKB-REST/. The created GPKB is a very valuable resource used in several projects by many users; the developed interfaces enhance its relevance to the community by allowing the seamlessly composition of queries, although complex, on all data integrated in the GPKB, which can help unveiling new biomedical knowledge.

Biomolecular annotation integration and querying to help unveiling new biomedical knowledge

CANAKOGLU, ARIF;CERI, STEFANO;MASSEROLI, MARCO
2016-01-01

Abstract

Targeting biological questions requires comprehensive evaluation of multiple types of annotations describing current biological knowledge; they are increasingly available, but their fast evolution, heterogeneity and dispersion in many different sources hamper their effective use. Leveraging on innovative flexible data schema and automatic software procedures that support the integration of data sources evolving in number, data content and structure, while assuring quality and provenance tracking of the integrated data, we created a multi-organism Genomic and Proteomic Knowledge Base (GPKB) and easily maintained it updated. From several well-known databases it imports and integrates very numerous gene and protein data, external references and annotations, expressed through multiple biomedical terminologies. To easily query such integrated data, we developed intuitive web interfaces and services for programmatic access to the GPKB; they are publicly available respectively at http://www.bioinformatics. deib.polimi.it/GPKB/ and http://www.bioinformatics.deib.polimi.it/ GPKB-REST/. The created GPKB is a very valuable resource used in several projects by many users; the developed interfaces enhance its relevance to the community by allowing the seamlessly composition of queries, although complex, on all data integrated in the GPKB, which can help unveiling new biomedical knowledge.
2016
Bioinformatics and Biomedical Engineering
9783319317434
Biomedical ontologies; Biomolecular annotations querying; Heterogeneous and distributed biological data management and integration; Theoretical Computer Science; Computer Science (all)
INF; bioinformatics
File in questo prodotto:
File Dimensione Formato  
IWBBIO_2016_802-816.pdf

Accesso riservato

: Publisher’s version
Dimensione 1.32 MB
Formato Adobe PDF
1.32 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11311/1013755
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact