A comprehensive compilation of 1001 nucleotide sequences coding for proteins from the yeast Saccharomyces cerevisiae (=ListA2) |
| |
Authors: | Marie-Odile Mossé Patrick Linder Jaga Lazowska Piotr P. Slonimski |
| |
Affiliation: | (1) Centre de Génétique Moléculaire, Laboratoire propre du CNRS associé à l'Université Pierre et Marie Curie, F-91190 Gif-sur Yvette, France;(2) Department of Microbiology, Biozentrum, 70 Klingelbergstrasse, CH-4056 Basel, Switzerland |
| |
Abstract: | Summary The amount of nucleotide sequence data is increasing exponentially. We therefore continued our effort to make a comprehensive database for the yeast Saccharomyces cerevisiae. In this database (ListA2) we have compiled 1001 protein coding sequences from this organism. Each sequence has been attributed a single genetic name and in the case of allelic duplicated sequences, synonyms are given, if necessary. For the nomenclature we have introduced a standard principle for naming gene sequences based on priority rules. We have also applied a simple method to distinguish duplicated sequences of one and the same gene from non-allelic sequences of duplicated genes. By using these principles we have sorted out a lot of confusion in the literature and databanks. Along with the genetic name, the mnemonic from the EMBL databank, the codon bias, reference of the publication of the sequence and the EMBL accession numbers are included for each entry. The database is available on request. |
| |
Keywords: | Yeast Open reading frames Database Genetic nomenclature Codon bias Duplicated genes |
本文献已被 SpringerLink 等数据库收录! |
|