Abstract: | The ecology and genetic diversity of the model yeast Saccharomyces cerevisiae before human domestication remain poorly understood. Taiwan is regarded as part of this yeast''s geographic birthplace, where the most divergent natural lineage was discovered. Here, we extensively sampled the broadleaf forests across this continental island to probe the ancestral species’ diversity. We found that S. cerevisiae is distributed ubiquitously at low abundance in the forests. Whole-genome sequencing of 121 isolates revealed nine distinct lineages that diverged from Asian lineages during the Pleistocene, when a transient continental shelf land bridge connected Taiwan to other major landmasses. Three lineages are endemic to Taiwan and six are widespread in Asia, making this region a focal biodiversity hotspot. Both ancient and recent admixture events were detected between the natural lineages, and a genetic ancestry component associated with isolates from fruits was detected in most admixed isolates. Collectively, Taiwanese isolates harbor genetic diversity comparable to that of the whole Asia continent, and different lineages have coexisted at a fine spatial scale even on the same tree. Patterns of variations within each lineage revealed that S. cerevisiae is highly clonal and predominantly reproduces asexually in nature. We identified different selection patterns shaping the coding sequences of natural lineages and found fewer gene family expansion and contractions that contrast with domesticated lineages. This study establishes that S. cerevisiae has rich natural diversity sheltered from human influences, making it a powerful model system in microbial ecology.The yeast genus Saccharomyces, which includes S. cerevisiae, is a powerful model system for revealing patterns of genomic variation underlying reproductive isolation and adaptation in eukaryotic microorganisms. Surveys of population genetic data have been used in S. cerevisiae to date the origin of key domestication events (Gallone et al. 2016; Duan et al. 2018; Peter et al. 2018), to determine life cycle frequencies in nature (Tsai et al. 2008), to determine the genomic basis of adaptation at continental scale (Duan et al. 2018; Peter et al. 2018), and, more recently, to establish its geographical origin and dispersal history (Xia et al. 2017). Phylogenomic analyses of the Saccharomyces sensu stricto complex and extensive sequencing of collections across the world suggest that S. cerevisiae originated in East Asia (Duan et al. 2018; Peter et al. 2018). The 1011 Genome Project—the most broad large-scale yeast population genomic study—discovered that three wild isolates from Taiwan showed an unprecedented high genetic diversity compared with populations from the rest of the world (Peter et al. 2018). Population genomics of 266 domestic and wild isolates in China revealed six wild lineages from primeval forests. The newly identified CHN-IX group represents the most diverged lineage (Duan et al. 2018). Isolates from this group and the three Taiwanese isolates were grouped into a single lineage that showed a disjunct geographic distribution (Bendixsen et al. 2021). Although considerable knowledge is available on the biogeography and population genetics of plants and animals across continents (Whittaker et al. 2017), little is known about how eukaryotic microorganisms such as S. cerevisiae disperse, establish, reproduce, and persist in nature (Liti 2015).Most S. cerevisiae biology has been based on experiments on a handful of laboratory domesticated strains, but comprehensive analyses of the ecology and evolutionary biology of S. cerevisiae in the wild are still unavailable. In nature, S. cerevisiae have been isolated from the bark, fruits, surrounding soil, and leaves of plants belonging to several different families (Naumov et al. 2013), with early reports suggesting that the yeast is most successfully isolated from the oak family Fagaceae (Sniegowski et al. 2002; Sampaio and Gonçalves 2008; Wang et al. 2012). S. cerevisiae contains high genetic diversity in certain populations, including lineage-specific variants that display clear population structures (Barnett 1992; Wang et al. 2012; Cromie et al. 2013; Strope et al. 2015; Gallone et al. 2016; Gonçalves et al. 2016; Zhu et al. 2016; Duan et al. 2018; Legras et al. 2018; Peter et al. 2018) and explain phenotypic variance similar to common variants (Fournier et al. 2019). Samples from natural habitats tend to be homozygous diploids forming unique populations with minimal genetic admixture, whereas lineages associated with human activities were likely heterozygous, containing higher ploidy and greater genetic admixture leading to a mosaic genome makeup (Diezmann and Dietrich 2009; Liti et al. 2009; Wang et al. 2012; Almeida et al. 2015). The diverse natural lineages of S. cerevisiae present in East Asia provide an excellent opportunity to study the natural diversity of this species, which was previously believed to be fully domesticated (Fay and Benavides 2005).Taiwan is a continental shelf island with the fifth highest tree density in the world (Crowther et al. 2015). Among the 13 climate-related forests types in Taiwan, five are Fagaceae-dominated natural forests on low- and mid-elevation mountains (Li et al. 2013), thus a potentially ideal natural habitat for S. cerevisiae. Taiwan also harbors a high phylogenetic diversity of flowering plants (53 out of 64 angiosperm orders present under the APG IV classification system) (Lin and Chung 2017) and endemism compared with other oceanic islands (Hsieh 2002), raising the possibility that the associated microbial populations are genetically different from their continental counterparts. Here, we set out to characterize the intra-genetic diversity, relative abundance, and distribution of S. cerevisiae in Taiwanese forests over 4 yr of broad sampling. Our study provides novel insights of the predomestication phase of S. cerevisiae and broadens our understanding of the ecological and biogeographic implications before anthropogenic impacts. |