EST Contigs and unigene sets
Although extensive data on ESTs for a variety of plants are available in the databases, there seems to be considerable redundancy for many gene transcripts. Therefore, available ESTs have been assembled in contiguous overlapping clusters, which have been described as contiguous. The ESTs which appear as singles and which can not be assembled in contigs are described as singletons. A combined set of contigs and singletons is described as a unigene set, which represents the minimum number - of genes, although rarely same gene may be a part of two contigs, when an overlapping EST between these two contigs is missing in the database. These unigene sets have been constructed in many species including Arabidopsis, tomato, rice, sorghum, bread wheat.


