A Chinese indicine pangenome reveals a wealth of novel structural variants introgressed from other Bos species

Research output: Contribution to journalJournal articleResearchpeer-review

Documents

  • Fulltext

    Final published version, 8 MB, PDF document

  • Xuelei Dai
  • Peipei Bian
  • Dexiang Hu
  • Funong Luo
  • Yongzhen Huang
  • Shaohua Jiao
  • Xihong Wang
  • Mian Gong
  • Ran Li
  • Yudong Cai
  • Jiayue Wen
  • Qimeng Yang
  • Weidong Deng
  • Hojjat Asadollahpour Nanaei
  • Yu Wang
  • Fei Wang
  • Zijing Zhang
  • Benjamin D. Rosen
  • Heller, Rasmus
  • Yu Jiang

Chinese indicine cattle harbor a much higher genetic diversity compared with other domestic cattle, but their genome architecture remains uninvestigated. Using PacBio HiFi sequencing data from 10 Chinese indicine cattle across southern China, we assembled 20 high-quality partially phased genomes and integrated them into a multiassembly graph containing 148.5 Mb (5.6%) of novel sequence. We identified 156,009 high-confidence nonredundant structural variants (SVs) and 206 SV hotspots spanning ~195 Mb of gene-rich sequence. We detected 34,249 archaic introgressed fragments in Chinese indicine cattle covering 1.93 Gb (73.3%) of the genome. We inferred an average of 3.8%, 3.2%, 1.4%, and 0.5% of introgressed sequence originating, respectively, from banteng-like, kouprey-like, gayal-like, and gaur-like Bos species, as well as 0.6% of unknown origin. Introgression from multiple donors might have contributed to the genetic diversity of Chinese indicine cattle. Altogether, this study highlights the contribution of interspecies introgression to the genomic architecture of an important livestock population and shows how exotic genomic elements can contribute to the genetic variation available for selection.

Original languageEnglish
JournalGenome Research
Volume33
Issue number8
Pages (from-to)1284-1298
Number of pages15
ISSN1088-9051
DOIs
Publication statusPublished - Aug 2023

Bibliographical note

Publisher Copyright:
© 2023 Dai et al.

Number of downloads are based on statistics from Google Scholar and www.ku.dk


No data available

ID: 370586270