Genomic Annotations
In collaboration with colleagues from Princeton University we have been studying the genome of Herpesvirus 5. By visiting this IBM DB2-based repository you can browse through and download the results from our analyses.
Additionally, we make available automatically-generated annotations for the proteomes of more than 120 complete genomes. This genomic repository can be accessed by visiting this web site. Soon, and from the same location, we will be making available a DB2-based system that will allow users to interact with these data and to retrieve information by constructing complex queries.
Bio-Dictionaries for Individual Genomes
In a number of publications, we have presented and discussed the idea of the Bio-Dictionary: the latter is a collection of recurrent amino acid combinations (='seqlets') which completely cover the sequence space defined by the biggest possible collection of amino acid sequences. Normally, we recompute the contents of the Bio-Dictionary on a regular basis, typically once a year. For several research activities, it is also very useful to compute such amino acid combinations by processing smaller input datasets such as the proteome of an individual genome. The following list provides access to Bio-Dictionaries computed from the proteomes of the corresponding genomes:
ARCHAEAL GENOMES
BACTERIAL GENOMES
