Logo
 Home | Sitemap | Contact us | Search | Language
Left Right
Home >> Biotechnology and Genomics >> Bioinformatics and Data Mining - In Silico Biology >> Bioinformatics ans Data Mining - In Silico biology Introduction

Bioinformatics and data mining Introduction

Bioinformatics and data mining are two new areas of research, which involve computer­assisted management of data that is generated for biotechnology applications. Although bioinforrnatics and data mining are currently treated as two independent areas, these will merge and will eventually become indistinguishable.

These two areas of research are being practised worldwide by academic groups, biotechnology companies, and by national and international consortia to make the best use of data that are fast accumulating in massive databases. For instance, data on molecular markers and molecular maps of the genomes of a large number of animals and plants were generated during 1980-2000. and are now available in the databases.

In February, 2001, rough drafts of human genome sequences were also released both by the Human Genome Project and an individual genomics company (Celera Genomics). The genomes of four eukaryotes including budding yeast (Saccharomyces cerevisiae),a nematode (Coenorhabditis elegans), the fruitfly (Drosophila melanogaster)and a higher plant (Arabidopsis thaliana) were also fully sequenced by the turp of the 20th century.

The genome of fission yeast (Schizo saccharomyces pombe) was also published in early 2002. Rough drafts of rice genome sequences have also been produced by two private companies. Monsanto and Syngenta and also by a group in China (see Science, April 5, 2002) and the fully refined sequence of the genome of this crop will become available by the year 2003. These sequences are available in the databases and are being used for assigning meaning to each of these sequences.

Data on proteins their amino acid sequences and three dimensional structures are also being generated at an accelerated pace and are being utilized for discovery of genes and for assigning functions to these genes.The ability to collect  store analyze utilize and distribute biological information derived from DNA/protein sequencing analyses is fundamental to modem biotechnology.

In view of this, funding agencies biotechnology companies and academic institutions are joining hands to help each other. The real long-term value of bioinformatics and data mining capabilities does not lie so much in developing tools, as it does in converting knowledge or information into products, like therapeutics, animal and plant bioreactors in the form of genetically modified organisms (GMOs), newly discovered genes, etc.

In view of the significance of the use of computers in biosciences, a new journal CABIOS (Computer Applications in Biosciences) was also launched in 1984, which was later named Bioinformatics, published by Oxford University Press. In 1990, the Bioinformatics and Genomics Research conference series was also initiated, which evolved into a forum for discussions and exchange of ideas among scientists and commercial entrepreneurs about recent developments and future needs in the field of bioinformatics and genomics.

The seventh such international conference was held in 1999. A new area of biology described as 'in silico biology' is also emerging due to the solutions, which can be found with the increasing use of computers in biology.

The tidal wave of data accumulation will continue for many years in the 21st century. It will include not only updated sequence information about genes and proteins, but will also contain information on functional annotations of genomic sequences, disease correlations, population variation and other information relevant to biotechnology.

The use of these data will depend upon the available software and the development of new software for data storage and analysis as well as for data mining. Therefore, every student studying biotechnology will need to know about the techniques and tools of bioinformatics and data mining. Different aspects of these two new and related areas of biotechnology.

 

Left Right