posted on 2025-08-01, 17:15authored byK-W Kim, R De-Kayne, IJ Gordon, KS Omufwoko, DJ Martins, R Ffrench-Constant, SH Martin
Supergenes maintain adaptive clusters of alleles in the face of genetic mixing. Although usually attributed to inversions, supergenes can be complex, and reconstructing the precise processes that led to recombination suppression and their timing is challenging. We investigated the origin of the BC supergene, which controls variation in warning coloration in the African monarch butterfly, Danaus chrysippus. By generating chromosome-scale assemblies for all three alleles, we identified multiple structural differences. Most strikingly, we find that a region of more than 1 million bp underwent several segmental duplications at least 7.5 Ma. The resulting duplicated fragments appear to have triggered four inversions in surrounding parts of the chromosome, resulting in stepwise growth of the region of suppressed recombination. Phylogenies for the inversions are incongruent with the species tree and suggest that structural polymorphisms have persisted for at least 4.1 Myr. In addition to the role of duplications in triggering inversions, our results suggest a previously undescribed mechanism of recombination suppression through independent losses of divergent duplicated tracts. Overall, our findings add support for a stepwise model of supergene evolution involving a variety of structural changes. This article is part of the theme issue 'Genomic architecture of supergenes: causes and evolutionary consequences'.
This is the final version. Available from the Royal Society via the DOI in this record.
Data accessibility
Sequencing reads and assemblies are available at the European Nucleotide Archive project (see the electronic supplementary material, table S1 for accession numbers [48]). Assemblies are available in the European Nucleotide Archive project accession PRJEB52180. Additional data files are available from the Dryad Digital Repository: https://doi.org/10.5061/dryad.xwdbrv1g0 [49], including genome assemblies and annotations, repeat library and windowed repeat content, whole-genome alignments, VCF and genotype files, window-based diversity and divergence measures and read depth, sequence alignments for genes and phylogenetic trees. Scripts for assembly polishing, the analysis of repeat content, genome annotation and phylogenetic tree construction are available at https://github.com/RishiDeKayne/Danaus_supergene_structure. Scripts for genome alignment and synteny block inference, ancestry painting and divergence analyses, and read depth and copy number analyses are available at https://github.com/simonhmartin/Danaus_supergene_structure.