posted on 2025-07-31, 23:50authored byAC Gregory, AA Zayed, N Conceição-Neto, B Temperton, B Bolduc, A Alberti, M Ardyna, K Ardyna, M Arkhipova, C Cruaud, C Dimier, J Ferland, S Kandels-Lewis, Y Liu, C Marec, S Pesant, M Picheral, S Pisarev, J Poulain, J-E Tremblay, D Vik, Tara Oceans coordinators, M Babin, C Bowler, C de Vargas, BE Dutilh, D Iudicone, L Karp-Boss, S Roux, S Sunagawa, P Wincker, MB Sullivan
Microbes drive most ecosystems and are modulated by viruses that impact their lifespan, gene flow and metabolic outputs. However, the influence of viral community diversity at the ecosystem level remains difficult to assess due to classification issues and few reference genomes. Here we establish a ~12-fold expanded global ocean virome dataset of 195,728 viral populations, now including the Arctic Ocean, and validate that these populations form discrete genotypic clusters. Meta-community analyses revealed just five ecological zones throughout the global ocean, and established local and global patterns and drivers in viral community diversity at levels of both macrodiversity (inter-population diversity) and microdiversity (intra-population genetic variation). These patterns sometimes, but not always, paralleled those from macro- organisms and revealed temperate and tropical surface waters and the Arctic as biodiversity hotspots and mechanistic hypotheses to explain them. With this further understanding of viral populations and ecology in the ocean, viruses can be more broadly included in ecosystem models.
Funding
3790
864.14.004
AI112542
ANR-10-INBS-09
ANR-11-BTBR0008
France Genomique
Gordon and Betty Moore Foundation
Helmut Horten Foundation
National Institutes of Health (NIH)
National Science Foundation (NSF)
Netherlands Organization for Scientific Research (NOWO)
This is the author accepted manuscript. The final version is available from Elsevier (Cell Press) via the DOI in this record.
Scripts used in this manuscript are available on the Sullivan laboratory bitbucket under GOV 2.0.
All raw reads are available through ENA (Tara Oceans and TOPC) or IMG (Malapsina) using
the identifiers listed in Table S3. Processed data are available through iVirus, including all
assembled contigs, viral populations and genes.