[go: nahoru, domu]

Full resolution HLA and KIR gene annotations for human genome assemblies

  1. Heng Li3,4
  1. 1 Dana-Farber Cancer institute;
  2. 2 Dartmouth College;
  3. 3 Harvard University, Dana-Farber Cancer Institute
  • * Corresponding author; email: hli{at}jimmy.harvard.edu
  • Abstract

    The Human leukocyte antigens (HLA) genes and the Killer cell immunoglobulin like receptors (KIR) genes are critical to immune responses and are associated with many immune-related diseases. Located in highly polymorphic regions, they are hard to study with traditional short-read alignment-based methods. Although modern long-read assemblers can often assemble these genes, using existing tools to annotate HLA and KIR genes in these assemblies remains a nontrivial task. Here, we describe Immuannot, a new computation tool to annotate the gene structures of HLA and KIR genes and to type the allele of each gene. Applying Immuannot to 56 regional and 212 whole-genome assemblies from previous studies, we annotated 9,931 HLA and KIR genes and found that almost half of these genes, 4,068, had novel sequences compared to the current Immuno Polymorphism Database (IPD). These novel gene sequences were represented by 2,664 distinct alleles, some of which contained nonsynonymous variations resulting in 92 novel protein sequences. We demonstrated the complex haplotype structures at the two loci and reported the linkage between HLA/KIR haplotypes and gene alleles. We anticipate that Immuannot will speed up the discovery of new HLA/KIR alleles and enable the association of HLA/KIR haplotype structures with clinical outcomes in the future.

    • Received January 12, 2024.
    • Accepted May 22, 2024.

    This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see https://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.

    ACCEPTED MANUSCRIPT

    Preprint Server