Logo
 Home | Sitemap | Contact us | Search | Language
Left Right
Home >> Biotechnology and Genomics >> Bioinformatics and Data Mining - In Silico Biology >>GenBank Flybase, DDBJ, RefSeq

GenBank, Flybase, DDBJ, RefSeq.

GenBank is a database, that contains every published DNA sequence. It can be used to search for a given DNA sequence, with the objective to assign meaning to this given sequence. Database like GenBank have revolutionized biology, providing researchers with powerful tool to hunt for new genes, compare the way the genes have evolved in many different organisms and to figure out the functions of newly discovered genes. However, according to some surveys conducted, genomics experts estimated that some 2% of GenBank entries may contain sequences of DNA that are introduced due to experimental errors. In many cases, the annotations in the GenBank are either not verified or not up to data and the error propagation is explosive. Therefore, according to some bioinformaticians and biologists, GenBank is full of mistakes, and so are the other databases like Flybase (fruitfully database), DDBJ (Database of Japan), PDB (protein Data Bank), Swiss-Prot (protein data bank at EBI, and SIB = Swiss institute of Bioinformatics).

As a consequence of increasing number of mistakes, the utility of GenBank declined gradually during 1995-2000. However, efforts are being made to tackle these problems of errors in the databases (e.g. RefSeq of NCBI) are being set up.

A division of GenBank is also specifically devoted to the management of EST (expressed sequence tag) sequences and is known as EST database (dbEST). NCBI has also developed alogorithms to assign ESTs with sequence similarity to assign ESTs with sequence similarity to clusters, forming the basis of the UniGene database (http://www.ncbi.nlm.nih.gov/UniGene/). Recently, more than 30,000 of the Unigenes belonging to human genome have been mapped, facilitating positional cloning projects

 

Left Right