Published online by Cambridge University Press: 01 October 1999
The three trypanosomatid genome projects have employed common strategies which include: analysis of pulsed-field gel electrophoretic chromosomal karyotypes; physical mapping using big DNA (cosmid, pacmid P1, bacterial artificial chromosome, yeast artificial chromosome) libraries; partial cDNA sequence analysis to develop sets of expressed sequence tags (ESTs) for gene discovery and use as markers in physical mapping; genomic sequencing; dissemination of information through development of web-sites and ACeDB-based fully integrated databases; and establishment of functional genomics programmes to maximize useful application of genome data. Highlights of the projects to date have been the demonstration that, despite extensive chromosomal size polymorphisms for diploid homologues within Africa trypanosomes, T. cruzi or Leishmania, the physical linkage groups for markers on each chromosome are retained across all isolates/species studied within each group. For African trypanosomes, detailed analysis of chromosome 1 has demonstrated that repetitive sequences and the two retroposon-like elements RIME and INGI are localized to a defined region at one end of the chromosome, with the bulk of the central region of the chromosome containing genes coding for expressed proteins. Comparative mapping shows that, although subtelomeric changes account for a large proportion of the polymorphism in chromosome size in African trypanosomes, there are significant expansions and contractions in regions across the entire chromosome. The highlight of the genomic sequencing projects has been the demonstration of just 2 putative transcriptional units of chromosome 1 of Leishmania major, extending on opposite strands from a point in the central region of the chromosome. A similar observation made on 93·4 kb of contiguous sequence for T. cruzi chromosome 3 suggests the presence of promoter and regulatory elements at the junctions of large polycistronic transcriptional units. All data obtained from the genome projects are made available through the public domain, which has prompted changing philosophies in how we approach analysis of the biology of these organisms, and strategies that we can employ now in the search for new therapies and vaccines.