首页 | 本学科首页   官方微博 | 高级检索  
     


A comprehensive compilation of 1001 nucleotide sequences coding for proteins from the yeast Saccharomyces cerevisiae (=ListA2)
Authors:Marie-Odile Mossé  Patrick Linder  Jaga Lazowska  Piotr P. Slonimski
Affiliation:(1) Centre de Génétique Moléculaire, Laboratoire propre du CNRS associé à l'Université Pierre et Marie Curie, F-91190 Gif-sur Yvette, France;(2) Department of Microbiology, Biozentrum, 70 Klingelbergstrasse, CH-4056 Basel, Switzerland
Abstract:Summary The amount of nucleotide sequence data is increasing exponentially. We therefore continued our effort to make a comprehensive database for the yeast Saccharomyces cerevisiae. In this database (ListA2) we have compiled 1001 protein coding sequences from this organism. Each sequence has been attributed a single genetic name and in the case of allelic duplicated sequences, synonyms are given, if necessary. For the nomenclature we have introduced a standard principle for naming gene sequences based on priority rules. We have also applied a simple method to distinguish duplicated sequences of one and the same gene from non-allelic sequences of duplicated genes. By using these principles we have sorted out a lot of confusion in the literature and databanks. Along with the genetic name, the mnemonic from the EMBL databank, the codon bias, reference of the publication of the sequence and the EMBL accession numbers are included for each entry. The database is available on request.
Keywords:Yeast  Open reading frames  Database  Genetic nomenclature  Codon bias  Duplicated genes
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号