REFERENCES

1.         Kowalczuk, M., Mackiewicz, P., Gierlik, A., Dudek, M. R. & Cebrat, S. Total number of coding open reading frames in the yeast genome. Yeast 15, 1031-4 (1999).

2.         Harrison, P. M., Kumar, A., Lang, N., Snyder, M. & Gerstein, M. A question of size: the eukaryotic proteome and the problems in defining it. Nucleic Acids Res 30, 1083-90 (2002).

3.         Velculescu, V. E. et al. Characterization of the yeast transcriptome. Cell 88, 243-51 (1997).

4.         Blandin, G. et al. Genomic exploration of the hemiascomycetous yeasts: 4. The genome of Saccharomyces cerevisiae revisited. FEBS Lett 487, 31-6 (2000).

5.         Wood, V., Rutherford, K. M., Ivens, A., Rajandream, M.-A. & Barrell, B. A Re-annotation of the Saccaromyces cerevisiae Genome. Comparative and Functional Genomics 2, 143-154 (2001).

6.         Toda, T. et al. Deletion analysis of the enolase gene (enoA) promoter from the filamentous fungus Aspegillus oryzae. Curr Genet 40, 260-7 (2001).

7.         Bailey, T. L. & Elkan, C. Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol 2, 28-36 (1994).

8.         Tavazoie, S., Hughes, J. D., Campbell, M. J., Cho, R. J. & Church, G. M. Systematic determination of genetic network architecture. Nat Genet 22, 281-5 (1999).

9.         Stormo, G. D. DNA binding sites: representation and discovery. Bioinformatics 16, 16-23 (2000).

10.       McGuire, A. M., Hughes, J. D. & Church, G. M. Conservation of DNA regulatory motifs and discovery of new motifs in microbial genomes. Genome Res 10, 744-57 (2000).

11.       Loots, G. G. et al. Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science 288, 136-40 (2000).

12.       Pennacchio, L. A. & Rubin, E. M. Genomic strategies to identify mammalian regulatory sequences. Nat Rev Genet 2, 100-9 (2001).

13.       Oeltjen, J. C. et al. Large-scale comparative sequence analysis of the human and murine Bruton's tyrosine kinase loci reveals conserved regulatory domains. Genome Res 7, 315-29 (1997).

14.       Cliften, P. F. et al. Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis. Genome Res 11, 1175-86 (2001).

15.       Alm, R. A. et al. Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen Helicobacter pylori. Nature 397, 176-80 (1999).

16.       Carlton, J. M. et al. Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii. Nature 419, 512-9 (2002).

17.       Perrin, A. et al. Comparative genomics identifies the genetic islands that distinguish Neisseria meningitidis, the agent of cerebrospinal meningitis, from other Neisseria species. Infect Immun 70, 7063-72 (2002).

18.       McClelland, M. et al. Comparison of the Escherichia coli K-12 genome with sampled genomes of a Klebsiella pneumoniae and three salmonella enterica serovars, Typhimurium, Typhi and Paratyphi. Nucleic Acids Res 28, 4974-86 (2000).

19.       Intl_Mouse_Genome_Consortium. Initial sequencing and comparative analysis of the mouse genome. Nature 420, 520-62 (2002).

20.       Galabru, J., Rey-Cuille, M. A. & Hovanessian, A. G. Nucleotide sequence of the HIV-2 EHO genome, a divergent HIV-2 isolate. AIDS Res Hum Retroviruses 11, 873-4 (1995).

21.       Read, T. D. et al. The genome sequence of Bacillus anthracis Ames and comparison to closely related bacteria. Nature 423, 81-6 (2003).

22.       Genome_Sciences_Centre. The complete genome of the SARS associated Coronavirus. Unpublished (2003).

23.       Goffeau, A. et al. Life with 6000 genes. Science 274, 546, 563-7 (1996).

24.       Galagan, J. E. et al. The genome sequence of the filamentous fungus Neurospora crassa. Nature 422, 859-68 (2003).

25.       The_C._elegans_Sequencing_Consortium. Genome sequence of the nematode C. elegans: a platform for investigating biology. Science 282, 2012-8 (1998).

26.       Adams, M. D. et al. The genome sequence of Drosophila melanogaster. Science 287, 2185-95 (2000).

27.       Intl_Human_Genome_Sequencing_Consortium. in Nature 860-921 (2001).

28.       Arabidopsis_Genome_Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408, 796-815 (2000).

29.       Kellis, M., Patterson, N., Endrizzi, M., Birren, B. & Lander, E. S. Sequencing and comparison of yeast species to identify genes and regulatory elements. Nature 423, 241-54 (2003).

30.       Fitch, W. M. Uses for evolutionary trees. Philos Trans R Soc Lond B Biol Sci 349, 93-102 (1995).

31.       Fitch, W. M. Distinguishing homologous from analogous proteins. Syst Zool 19, 99-113 (1970).

32.       Tatusov, R. L., Koonin, E. V. & Lipman, D. J. A genomic perspective on protein families. Science 278, 631-7 (1997).

33.       Tatusov, R. L. et al. The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res 29, 22-8 (2001).

34.       Keogh, R. S., Seoighe, C. & Wolfe, K. H. Evolution of gene order and chromosome number in Saccharomyces, Kluyveromyces and related fungi. Yeast 14, 443-57 (1998).

35.       Batzoglou, S. et al. ARACHNE: a whole-genome shotgun assembler. Genome Res 12, 177-89 (2002).

36.       Jaffe, D. B. et al. Whole-genome sequence assembly for Mammalian genomes: arachne 2. Genome Res 13, 91-6 (2003).

37.       Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J Mol Biol 215, 403-10 (1990).

38.       Thompson, J. D., Higgins, D. G. & Gibson, T. J. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22, 4673-80 (1994).

39.       Dujon, B. et al. Complete DNA sequence of yeast chromosome XI. Nature 369, 371-8 (1994).

40.       Sharp, P. M. & Li, W. H. The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res 15, 1281-95 (1987).

41.       Clark, T. A., Sugnet, C. W. & Ares, M., Jr. Genomewide analysis of mRNA processing in yeast using splicing-specific microarrays. Science 296, 907-10 (2002).

42.       Batzoglou, S., Pachter, L., Mesirov, J. P., Berger, B. & Lander, E. S. Human and mouse gene structure: comparative analysis and application to exon prediction. Genome Res 10, 950-8 (2000).

43.       Hampson, S., Kibler, D. & Baldi, P. Distribution patterns of over-represented k-mers in non-coding yeast DNA. Bioinformatics 18, 513-28 (2002).

44.       Blanchette, M. & Tompa, M. Discovery of regulatory elements by a computational method for phylogenetic footprinting. Genome Res 12, 739-48 (2002).

45.       McCue, L. et al. Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomes. Nucleic Acids Res 29, 774-82 (2001).

46.       Gelfand, M. S., Koonin, E. V. & Mironov, A. A. Prediction of transcription regulatory sites in Archaea by a comparative genomic approach. Nucleic Acids Res 28, 695-705 (2000).

47.       Lawrence, C. E. et al. Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science 262, 208-14 (1993).

48.       Grundy, W. N., Bailey, T. L., Elkan, C. P. & Baker, M. E. Meta-MEME: motif-based hidden Markov models of protein families. Comput Appl Biosci 13, 397-406 (1997).

49.       Hughes, J. D., Estep, P. W., Tavazoie, S. & Church, G. M. Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. J Mol Biol 296, 1205-14 (2000).

50.       Roth, F. P., Hughes, J. D., Estep, P. W. & Church, G. M. Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation. Nat Biotechnol 16, 939-45 (1998).

51.       Liu, X., Brutlag, D. L. & Liu, J. S. BioProspector: discovering conserved DNA motifs in upstream regulatory regions of co-expressed genes. Pac Symp Biocomput, 127-38 (2001).

52.       Jiao, K. et al. Phylogenetic footprinting reveals multiple regulatory elements involved in control of the meiotic recombination gene, REC102. Yeast 19, 99-114 (2002).

53.       Tompa, M. Identifying functional elements by comparative DNA sequence analysis. Genome Res 11, 1143-4 (2001).

54.       Blanchette, M., Schwikowski, B. & Tompa, M. Algorithms for phylogenetic footprinting. J Comput Biol 9, 211-23 (2002).

55.       Keegan, L., Gill, G. & Ptashne, M. Separation of DNA binding from the transcription-activating function of a eukaryotic regulatory protein. Science 231, 699-704 (1986).

56.       Zhu, J. & Zhang, M. Q. SCPD: a promoter database of the yeast Saccharomyces cerevisiae. Bioinformatics 15, 607-11 (1999).

57.       Zhang, M. Q. Promoter analysis of co-regulated genes in the yeast genome. Comput Chem 23, 233-50 (1999).

58.       Mewes, H. W. et al. MIPS: a database for genomes and protein sequences. Nucleic Acids Res 27, 44-8 (1999).

59.       Gasch, A. P. & Eisen, M. B. Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering. Genome Biol 3, RESEARCH0059 (2002).

60.       Simon, I. et al. Serial regulation of transcriptional regulators in the yeast cell cycle. Cell 106, 697-708 (2001).

61.       Lee, T. I. et al. Transcriptional regulatory networks in Saccharomyces cerevisiae. Science 298, 799-804 (2002).

62.       Mewes, H. W. et al. MIPS: a database for genomes and protein sequences. Nucleic Acids Res 30, 31-4 (2002).

63.       Gavin, A. C. et al. Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, 141-7 (2002).

64.       Dwight, S. S. et al. Saccharomyces Genome Database (SGD) provides secondary gene annotation using the Gene Ontology (GO). Nucleic Acids Res 30, 69-72 (2002).

65.       Mosley, A. L., Lakshmanan, J., Aryal, B. K. & Ozcan, S. Glucose-mediated phosphorylation converts the transcription factor Rgt1 from a repressor to an activator. J Biol Chem (2003).

66.       Lindgren, A. et al. The pachytene checkpoint in Saccharomyces cerevisiae requires the Sum1 transcriptional repressor. Embo J 19, 6489-97 (2000).

67.       Jacobs Anderson, J. S. & Parker, R. Computational identification of cis-acting elements affecting post-transcriptional control of gene expression in Saccharomyces cerevisiae. Nucleic Acids Res 28, 1604-17 (2000).

68.       Zeitlinger, J. et al. Program-specific distribution of a transcription factor dependent on partner transcription factor and MAPK signaling. Cell 113, 395-404 (2003).

69.       Gardner, M. J. et al. Genome sequence of the human malaria parasite Plasmodium falciparum. Nature 419, 498-511 (2002).

70.       Wolfe, K. H. & Shields, D. C. Molecular evidence for an ancient duplication of the entire yeast genome. Nature 387, 708-13 (1997).

71.       Fischer, G., Neuveglise, C., Durrens, P., Gaillardin, C. & Dujon, B. Evolution of gene order in the genomes of two related yeast species. Genome Res 11, 2009-19 (2001).

72.       Fischer, G., James, S. A., Roberts, I. N., Oliver, S. G. & Louis, E. J. Chromosomal evolution in Saccharomyces. Nature 405, 451-4 (2000).

73.       Dunham, M. J. et al. Characteristic genome rearrangements in experimental evolution of Saccharomyces cerevisiae. Proc Natl Acad Sci U S A 99, 16144-9 (2002).

74.       Bon, E. et al. Genomic exploration of the hemiascomycetous yeasts: 5. Saccharomyces bayanus var. uvarum. FEBS Lett 487, 37-41 (2000).

75.       Haber, J. E. Mating-type gene switching in Saccharomyces cerevisiae. Annu Rev Genet 32, 561-99 (1998).

76.       Hurst, L. D. The Ka/Ks ratio: diagnosing the form of sequence evolution. Trends Genet 18, 486 (2002).

77.       Chu, S. et al. The transcriptional program of sporulation in budding yeast. Science 282, 699-705 (1998).

78.       True, H. L. & Lindquist, S. L. A yeast prion provides a mechanism for genetic variation and phenotypic diversity. Nature 407, 477-83 (2000).

79.       Koufopanou, V., Goddard, M. R. & Burt, A. Adaptation for horizontal transfer in a homing endonuclease. Mol Biol Evol 19, 239-46 (2002).