![]() |
Search Options | Help | Site Map | Cultivate Web Site | |||||
|
||||||
| Home | Current Issue | Index of Back Issues |
| Issue 8 Home | Editorial | Features | Regular Columns | News & Events | Misc. | ||
By Les Grivell- November 2002
Les Grivell reports on the EU-funded ORIEL Project, a project focused on digital information management in the biosciences.
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Starting on 1st January 2002 and running for three years, the European Commission is funding ORIEL, a project focused on digital information management in the biosciences (Contract number IST-2001-32688). ORIEL (Online Research Information Environment for the Life Sciences) was formulated as a response to the need of biologists to deal with a growing stream of information generated by genomics, imaging and other data-intensive technologies. ORIEL aims to provide biological research communities with tools to manage large, complex, multimedia datasets and to navigate through an increasingly intricate and potentially confusing information landscape.
Writing in 1985 in a committee report for the US National Academy of Sciences, Morowitz argued that biological research had reached a point where "new generalizations and higher order biological laws are being approached, but may be obscured by the simple mass of data" [1]. His warning was perhaps at that time a little premature, but it has turned out to be truly prophetic. Biology is currently undergoing a revolution.
The development that probably has had most impact on research methodology and perspectives is Genomics. This young and rapidly expanding discipline focuses primarily on the determination and interpretation of the DNA sequences of the entire genetic blueprint (genome) of an organism. Genomics research is characterised by the production of vast amounts of raw and derived data obtained from high throughput technologies that include whole genome gene and protein analysis, systematic protein 3D-structure determination, and real time molecular and cellular imaging. Each of these areas shares a number of features:
To give some impression of the scale of the problem, as of 10th October 2002, the number of DNA sequences deposited in the combined international nucleotide sequence databanks formed by DDBJ (DNA Data Bank of Japan; CIB, Mishima, Japan), EMBL (EBI, Hinxton, UK) and GenBank (NCBI, Bethesda, USA) amounted to a staggering 29.9 gigabases in around 19.7 million records [2]. In 1985, the year of the Morowitz report, the total number of DNA sequence records in the EBI-EMBL nucleotide database was around 5000, while in 2001, the number of entries added to the database per day was around five times this number (see Figure 1 below).
![]() |
| Figure 1 : Exponential growth of DNA sequence data
held in the joint EMBL- GenBank-DDBJ databanks, with a number of landmarks in genomic sequencing |
The integration of the exponentially growing amounts of these data and associated information in the scientific literature is presenting one of the most demanding current challenges to both biologists and information technologists. It is as a response to this challenge to develop improved tools for information management that the ORIEL project [3] was conceived. ORIEL is an acronym for Online Research Information Environment in the Life Sciences. In architecture, an Oriel window is one that projects from an upper floor of a building. Such windows were popular in buildings of the late Gothic and Tudor periods and offered a particularly good view of their surroundings.
| Participant name | Role within the project |
|---|---|
| European Molecular Biology Organisation (EMBO) http://www.embo.org/ |
Scientific and technical coordination, scientific quality control, integration of ORIEL research activities with the functionality of the E-BioSci platform |
| European Molecular Biology Laboratory - the European Bio-informatics Institute
(EMBL-EBI) http://www.ebi.ac.uk/ |
Expertise in bio-informatics and involvement in biological database and ontology development |
| University of Oxford (UOXF-AT) http://www.bioimage.org/ |
Development of The BioImage Database, its population with useful images and its integration with other biological information sources |
| Ingenta UK Ltd (Ingenta) http://www.ingenta.com/ |
Development of integration technologies and web-front ends for complex sites that integrate multiple distributed databases (including the BioImage database). Enhancement of the integration of diverse factual resources with traditional literature and abstracts databases |
| Centre Informatique National de l'Enseignement Supérieur (CINES) http://www.cines.fr/ |
Collaboration with LIRMM in the realisation of adaptive user interfaces; testing of software components and integrative activities preceding wider dissemination |
| Consiglio Nazionale delle Ricerche - Istituto di Tecnologie Biomediche Avanzate and Istituto di Biologia Cellulare (CNR-ITB and CNR-IBC) http://www.itba.mi.cnr.it/ http://www.emmanet.org/ |
Development of tools for gene analysis; integration of gene-mining applications with information retrieved from the scientific literature; implementation in live bioinformatics services, including the Resource database maintained by the CNR-IBC on behalf of the European Mutant Mouse Archive (EMMA) |
| International Centre for Genetic Engineering and Technology (ICGEB) http://www.icgeb.trieste.it/ |
Bioinformatics expertise; development of gene analysis applications and their integration with other tools developed within the project; delivery as service to the research community |
| Centro Nacional de Biotecnologia (CNB-CSIC) http://www.pdg.cnb.uam.es/ |
Bioinformatics expertise; application of information extraction technology to the mining of biological information in text repositories |
| Laboratoire d'Informatique, de Robotique et de Microélectronique de Montpellier (LIRMM) http://www.lirmm.fr/ |
Development of tools for creation and editing of ontologies and collaborative browsing of information resources |
| Institut National de Recherche en Informatique et Automatique (INRIA) http://www.inria.fr/ |
Construction of a core of well-defined XML specifications for the dissemination of biological data |
| Institut de Génétique Humaine UPR CNRS1142 (IGH) http://www.igh.cnrs.fr/ |
Data evolution and transformation using XSLT; Application of XQUERY to the querying of XML biological data |
With EMBO, the European Molecular Biology Organisation, as coordinator, the project combines the skills of a number of partner institutions and organisations in Europe (Table 1), with focus on development of:
Individual components of the project consist of:
A unique feature of the ORIEL project is its degree of integration with the E-BioSci initiative [9] that is funded by the EC as a research infrastructure under its Quality of Life Programme (contract no. QLRI-CT-2001-30266). The E-BioSci platform aims to provide services that will help biologists to find and bring together many different kinds of digital information. It will focus primarily on interlinkage between genomics-related, image and molecular datasets and the relevant research literature, implementing novel technologies for retrieval as these become available. Given the complexity of the biological knowledge base, solutions to many of the issues involved will require significant research effort. ORIEL will play an important role in focusing and inputting such effort during this developmental process. Conversely, the E-BioSci platform will offer ORIEL partners opportunity for rigorous testing of prototype toolkits and other project components in a controlled, live environment, with critical feedback from selected groups of test users.
As mentioned in the introduction to this brief overview, biology is undergoing a revolution. Driven by advances in technology and huge amounts of new information, this revolution is dramatically changing the way in which researchers formulate and test concepts and make their experimental data known to the research community. The tools developed within ORIEL will help biologists meet the challenge of turning these data into useful knowledge.
Les Grivell
ORIEL Project Coordinator
European Molecular Biology Organisation
Meyerhofstrasse 1
69117 Heidelberg
Germany
URL: <http://www.oriel.org/>
URL: <http://www.e-biosci.org/>
Email: les.grivell@embo.org
Phone: +49 6221 889 1501
Fax: +49 6221 889 1210
Les Grivell is manager of EMBO's Electronic Information Programme and coordinates both ORIEL and E-BioSci projects.
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
For citation purposes:
Grivell, L. "The ORIEL Project - biological information management from a new perspective", Cultivate Interactive, issue
8, 15 November 2002
URL: <http://www.cultivate-int.org/issue8/oriel/>
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Related articles:
If you would like to view similar articles to this one click on a key word below:
< - ORIEL - life sciences - biology - information management - genomics - bioinformatics - multi-dimensional images - sequence databases - text retrieval - ontologies - >
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
|
Copyright ©2000 - 2001 Cultivate. | Published by UKOLN | Design by ILRT | Contact Us |