Complete Sequence of a 93.4-kb Contig from Chromosome 3 of Trypanosoma cruzi Containing a Strand-Switch Region |
| |
Authors: | Bj rn Andersson, Lena slund, Martti Tammi, Anh-Nhi Tran, J rg D. Hoheisel, Ulf Pettersson |
| |
Affiliation: | Björn Andersson, Lena Åslund, Martti Tammi, Anh-Nhi Tran, Jörg D. Hoheisel, and Ulf Pettersson |
| |
Abstract: | We have initiated large-scale sequencing of the third smallest chromosome of the CL Brener strain of Trypanosoma cruzi and we report here the complete sequence of a contig consisting of three cosmids. This contig covers 93.4 kb and has been found to contain 20–30 novel genes and several repeat elements, including a novel chromosome 3-specific 400-bp repeat sequence. The intergenic sequences were found to be rich in di- and trinucleotide repeats of varying lengths and also contained several known T. cruzi repeat elements. The sequence contains 29 open reading frames (ORFs) longer than 700 bp, the longest being 5157 bp, and a large number of shorter ORFs. Of the long ORFs, seven show homology to known genes in parasites and other organisms, whereas four ORFs were confirmed by sequencing of cDNA clones. Two shorter ORFs were confirmed by a database homology and a cDNA clone, respectively, and one RNA gene was identified. The identified genes include two copies of the gene for alanine-aminotransferase as well as genes for glucose-6-phosphate isomerase, protein kinases and phosphatases, and an ATP synthase subunit. An interesting feature of the sequence was that the genes appear to be organized in two long clusters containing multiple genes on the same strand. The two clusters are transcribed in opposite directions and they are separated by an ~20-kb long, relatively GC-rich sequence, that contains two large repetitive elements as well as a pseudogene for cruzipain and a gene for U2snRNA. It is likely that this strand switch region contains one or more regulatory and promoter regions. The reported sequence provides the first insight into the genome organization of T. cruzi and shows the potential of this approach for rapid identification of novel genes. |
| |
Keywords: | |
|
|