TY - JOUR AU - Vyse, Timothy J AB - Abstract The omnigenic model of complex disease stipulates that the majority of the heritability will be explained by the effects of common variation on genes in the periphery of core disease pathways. Rare variant associations, expected to explain far less of the heritability, may be enriched in core disease genes and thus will be instrumental in the understanding of complex disease pathogenesis and their potential therapeutic targets. Here, using complementary whole-exome sequencing, high-density imputation, and in vitro cellular assays, we identify candidate core genes in the pathogenesis of systemic lupus erythematosus (SLE). Using extreme-phenotype sampling, we sequenced the exomes of 30 SLE parent-affected-offspring trios and identified 14 genes with missense de novo mutations (DNM), none of which are within the >80 SLE susceptibility loci implicated through genome-wide association studies. In a follow-up cohort of 10, 995 individuals of matched European ancestry, we imputed genotype data to the density of the combined UK10K-1000 genomes Phase III reference panel across the 14 candidate genes. Gene-level analyses indicate three functional candidates: DNMT3A, PRKCD, and C1QTNF4. We identify a burden of rare variants across PRKCD associated with SLE risk (P = 0.0028), and across DNMT3A associated with two severe disease prognosis sub-phenotypes (P = 0.0005 and P = 0.0033). We further characterise the TNF-dependent functions of the third candidate gene C1QTNF4 on NF-κB activation and apoptosis, which are inhibited by the p.His198Gln DNM. Our results identify three novel genes in SLE susceptibility and support extreme-phenotype sampling and DNM gene discovery to aid the search for core disease genes implicated through rare variation. Introduction Considerable progress has been made in elucidating the genetic basis of complex diseases. The vast majority of identified disease-associated genetic polymorphisms are common in the population and the risk alleles impart a modest individual increment to the likelihood of developing disease. Although large-scale genome-wide association studies (GWAS) have so far explained less of the heritability than originally predicted (1), much of the ‘missing heritability’ is expected to be accounted for by common variants with effect sizes below the genome-wide significance threshold (2). However, under the newly proposed omnigenic model of complex traits, the majority of associated common variants—both identified and unidentified—will primarily be found in periphery genes expressed in relevant cell types but not necessarily biologically relevant to disease (3). In contrast, the role of rare variants in complex disease is largely unknown and often dismissed. A recent study, however, with an extremely large sample size, identified rare and low frequency variants contributing to the genetic variance of adult human height (4)—a polygenic trait with a genetic architecture similar to that of complex diseases (5)—suggesting previous complex disease studies with seemingly large sample sizes were perhaps still insufficiently powered to detect rare variant associations (6). Furthermore, studies of rare variants typically find gene sets enriched in biologically relevant functions/pathways (3,7,8). Therefore, although estimated to explain less of the heritable disease risk at a population level than common variants, identifying rare and low frequency variants is of paramount importance to understanding disease pathogenesis as they are likely to implicate biologically relevant core genes (3). The underrepresentation of rare variant associations within GWAS loci supports the theory that a discrete set of genes will be implicated through rare variants (9). Exome-wide searches, which provides a highly enriched source of potential disease-causing mutations (10), have revealed limited numbers of rare variation associated with complex diseases. Even though greater statistical power is achieved by gene-level analyses whereby aggregated variants are tested for an allelic burden of collective rare variation, widely used gene-based association tests have been shown to lack power at the exome-wide level (11). Coupled with the insufficient sample sizes currently available in the study of most complex diseases, hypothesis-free searches for core genes with rare variant associations are unlikely to be fruitful. Our strategy to address this problem in autoimmune disease SLE (SLE; MIM 152700), is outlined here and summarised in Figure 1. Using a discovery cohort of 30 unrelated SLE cases with a severe disease (young age of onset and clinical features associated with poorer outcome), we hypothesized that these individuals would exhibit unique mutation events in their protein-coding DNA that may predispose to disease risk. We undertook whole-exome sequencing (WES) in 30 family trios (both parents and affected offspring) and scrutinized the data for non-inherited de novo mutations (DNM) in the individual with SLE to identify a group of candidate genes for an independent follow-up rare variant analysis. This method allowed the identification of novel loci harbouring disease risk through collective rare variation, and emphasises the value of phenotypic extremes in the search for core genes in multifactorial disorders (12). Figure 1. Open in new tabDownload slide Overview of study. De novo mutations (DNM) in a discovery cohort revealed candidate genes for imputation-based rare variant burden testing using a follow-up cohort. Independent functional analyses demonstrate the functional effects of one DNM in a candidate gene. Results Identification of DNM in extreme-phenotype SLE cases We screened for DNM by WES of 30 family trios with an affected offspring with more severe SLE (Supplementary Material, Fig. S1). A total of 584 798 variants (≥20X), including single nucleotide variants and indels, were identified in the 30 affected probands. Using three bioinformatic tools and employing conservative parameters, 17 putative missense DNM were identified across 17 genes (Supplementary Material, Table S1, Fig. S2). We also analysed the SLE proband WES data alone, without the unaffected parents. This revealed 1194 non-silent, heterozygous, rare variants in 1, 067 genes distributed across the genome, which would make prioritization for downstream analysis a difficult task, highlighting the benefit of parent-offspring trio sequencing (Supplementary Material, Fig. S3). Sanger sequencing confirmed 14 true positive non-silent DNM (Table 1; Supplementary Material, Table S2), present in the SLE proband but absent in both parents and any unaffected siblings, in 11 of the 30 probands (36.7%) for further analysis. No DNM was found in any of the >80 known SLE-associated genes. Of the three false positive DNM (11.7%; Supplementary Material, Table S1) one, within LAMC2, is likely a result of germline mosaicism because, although not observed in either parent, it is observed in an unaffected sibling in addition to the SLE proband (13), and the other two variants are within KRTAP10–2 and KLRC1—both members of highly homologous gene families. Such sequence identity may have caused false positive identification of DNM in the WES analysis and suggests our NGS error-prone genes (NEPG) filter, which removes loci known to be problematic for genome mapping during NGS analyses, should have been more conservative. Indeed the KRLC1 p.Ile225Met missense variant appears to be a polymorphic Paralogous Sequence Variant (PSV)—the paralogous variant being p.Met223Ile in KLRC2. Table 1. De novo mutations in SLE probands with extreme phenotypes Family . Mutation (chr: position ref: alt) . Gene . Gene description . Exon . Amino acid . MAF in ExACa . CADD Phred . Mutation typeb . SLE0751 22: 38336799 C: T MICALL1 MICAL-like 1 16 Arg852Cys 1.5 × 10−4 35 Ti CpG SLE0496 3: 53223122 G: A PRKCD protein kinase C, delta 16 Gly535Arg – 34 Ti CpG SLE0679 12: 57588368 C: T LRP1 Low-density lipoprotein receptor-related protein 1 50 Arg2693Cys 8.3 × 10−4 34 Ti CpG SLE0592 6: 36260896 G: A PNPLA1 patatin-like phospholipase domain containing 1 3 Arg166His 5.8 × 10−5 33 Ti CpG SLE0296 2: 25457236 G: A DNMT3A DNA (cytosine-5-)-methyltransferase 3 alpha 19 Ala695Val – 32 Ti CpG SLE0571 4: 79512728 G: T ANXA3 annexin A3 7 Ser145Ile – 25.2 Tv SLE0679 3: 171431716 G: A PLD1 phospholipase D1, phosphatidylcholine-specific 9 Thr293Met 5.8 × 10−5 25.1 Ti CpG SLE0411 5: 179743769 C: T GFPT2 glutamine-fructose-6-phosphate transaminase 2 12 Val383Met 2.6 × 10−5 23.4 Ti CpG SLE0679 7: 138968784 C: A UBN2 ubinuclein 2 15 Pro1045Thr – 18.46 Tv SLE0080 16: 2812426 C: T SRRM2 serine/arginine repetitive matrix 2 11 Arg633Cys – 14.32 Ti CpG SLE0852 11: 47611769 G: C C1QTNF4 C1q and tumor necrosis factor related protein 4 2 His198Gln – 12.29 Tv SLE0321 18: 61621642 G: A HMSD histocompatibility (minor) serpin domain containing 3 Ala25Thr – 9.732 Ti SLE0390 12: 32369376 G: C BICD1 bicaudal D homolog 1 (Drosophila) 2 Val137Leu – 8.673 Tv SLE0321 1: 35251125 C: G GJB3 gap junction protein, beta 3 2 Asp254Glu – 0.002 Tv Family . Mutation (chr: position ref: alt) . Gene . Gene description . Exon . Amino acid . MAF in ExACa . CADD Phred . Mutation typeb . SLE0751 22: 38336799 C: T MICALL1 MICAL-like 1 16 Arg852Cys 1.5 × 10−4 35 Ti CpG SLE0496 3: 53223122 G: A PRKCD protein kinase C, delta 16 Gly535Arg – 34 Ti CpG SLE0679 12: 57588368 C: T LRP1 Low-density lipoprotein receptor-related protein 1 50 Arg2693Cys 8.3 × 10−4 34 Ti CpG SLE0592 6: 36260896 G: A PNPLA1 patatin-like phospholipase domain containing 1 3 Arg166His 5.8 × 10−5 33 Ti CpG SLE0296 2: 25457236 G: A DNMT3A DNA (cytosine-5-)-methyltransferase 3 alpha 19 Ala695Val – 32 Ti CpG SLE0571 4: 79512728 G: T ANXA3 annexin A3 7 Ser145Ile – 25.2 Tv SLE0679 3: 171431716 G: A PLD1 phospholipase D1, phosphatidylcholine-specific 9 Thr293Met 5.8 × 10−5 25.1 Ti CpG SLE0411 5: 179743769 C: T GFPT2 glutamine-fructose-6-phosphate transaminase 2 12 Val383Met 2.6 × 10−5 23.4 Ti CpG SLE0679 7: 138968784 C: A UBN2 ubinuclein 2 15 Pro1045Thr – 18.46 Tv SLE0080 16: 2812426 C: T SRRM2 serine/arginine repetitive matrix 2 11 Arg633Cys – 14.32 Ti CpG SLE0852 11: 47611769 G: C C1QTNF4 C1q and tumor necrosis factor related protein 4 2 His198Gln – 12.29 Tv SLE0321 18: 61621642 G: A HMSD histocompatibility (minor) serpin domain containing 3 Ala25Thr – 9.732 Ti SLE0390 12: 32369376 G: C BICD1 bicaudal D homolog 1 (Drosophila) 2 Val137Leu – 8.673 Tv SLE0321 1: 35251125 C: G GJB3 gap junction protein, beta 3 2 Asp254Glu – 0.002 Tv The mutations are ordered by level of severity, from most to least, predicted by CADD score. a Frequencies are presented from all 61 468 multiethnic individuals in ExAC because the de novo mutations observed in ExAC are likely to be identity-by-state not identity-by-descent. b Tv = Transversion; Ti = Transition; Ti CpG = Transition within a CpG dinucleotide. Open in new tab Table 1. De novo mutations in SLE probands with extreme phenotypes Family . Mutation (chr: position ref: alt) . Gene . Gene description . Exon . Amino acid . MAF in ExACa . CADD Phred . Mutation typeb . SLE0751 22: 38336799 C: T MICALL1 MICAL-like 1 16 Arg852Cys 1.5 × 10−4 35 Ti CpG SLE0496 3: 53223122 G: A PRKCD protein kinase C, delta 16 Gly535Arg – 34 Ti CpG SLE0679 12: 57588368 C: T LRP1 Low-density lipoprotein receptor-related protein 1 50 Arg2693Cys 8.3 × 10−4 34 Ti CpG SLE0592 6: 36260896 G: A PNPLA1 patatin-like phospholipase domain containing 1 3 Arg166His 5.8 × 10−5 33 Ti CpG SLE0296 2: 25457236 G: A DNMT3A DNA (cytosine-5-)-methyltransferase 3 alpha 19 Ala695Val – 32 Ti CpG SLE0571 4: 79512728 G: T ANXA3 annexin A3 7 Ser145Ile – 25.2 Tv SLE0679 3: 171431716 G: A PLD1 phospholipase D1, phosphatidylcholine-specific 9 Thr293Met 5.8 × 10−5 25.1 Ti CpG SLE0411 5: 179743769 C: T GFPT2 glutamine-fructose-6-phosphate transaminase 2 12 Val383Met 2.6 × 10−5 23.4 Ti CpG SLE0679 7: 138968784 C: A UBN2 ubinuclein 2 15 Pro1045Thr – 18.46 Tv SLE0080 16: 2812426 C: T SRRM2 serine/arginine repetitive matrix 2 11 Arg633Cys – 14.32 Ti CpG SLE0852 11: 47611769 G: C C1QTNF4 C1q and tumor necrosis factor related protein 4 2 His198Gln – 12.29 Tv SLE0321 18: 61621642 G: A HMSD histocompatibility (minor) serpin domain containing 3 Ala25Thr – 9.732 Ti SLE0390 12: 32369376 G: C BICD1 bicaudal D homolog 1 (Drosophila) 2 Val137Leu – 8.673 Tv SLE0321 1: 35251125 C: G GJB3 gap junction protein, beta 3 2 Asp254Glu – 0.002 Tv Family . Mutation (chr: position ref: alt) . Gene . Gene description . Exon . Amino acid . MAF in ExACa . CADD Phred . Mutation typeb . SLE0751 22: 38336799 C: T MICALL1 MICAL-like 1 16 Arg852Cys 1.5 × 10−4 35 Ti CpG SLE0496 3: 53223122 G: A PRKCD protein kinase C, delta 16 Gly535Arg – 34 Ti CpG SLE0679 12: 57588368 C: T LRP1 Low-density lipoprotein receptor-related protein 1 50 Arg2693Cys 8.3 × 10−4 34 Ti CpG SLE0592 6: 36260896 G: A PNPLA1 patatin-like phospholipase domain containing 1 3 Arg166His 5.8 × 10−5 33 Ti CpG SLE0296 2: 25457236 G: A DNMT3A DNA (cytosine-5-)-methyltransferase 3 alpha 19 Ala695Val – 32 Ti CpG SLE0571 4: 79512728 G: T ANXA3 annexin A3 7 Ser145Ile – 25.2 Tv SLE0679 3: 171431716 G: A PLD1 phospholipase D1, phosphatidylcholine-specific 9 Thr293Met 5.8 × 10−5 25.1 Ti CpG SLE0411 5: 179743769 C: T GFPT2 glutamine-fructose-6-phosphate transaminase 2 12 Val383Met 2.6 × 10−5 23.4 Ti CpG SLE0679 7: 138968784 C: A UBN2 ubinuclein 2 15 Pro1045Thr – 18.46 Tv SLE0080 16: 2812426 C: T SRRM2 serine/arginine repetitive matrix 2 11 Arg633Cys – 14.32 Ti CpG SLE0852 11: 47611769 G: C C1QTNF4 C1q and tumor necrosis factor related protein 4 2 His198Gln – 12.29 Tv SLE0321 18: 61621642 G: A HMSD histocompatibility (minor) serpin domain containing 3 Ala25Thr – 9.732 Ti SLE0390 12: 32369376 G: C BICD1 bicaudal D homolog 1 (Drosophila) 2 Val137Leu – 8.673 Tv SLE0321 1: 35251125 C: G GJB3 gap junction protein, beta 3 2 Asp254Glu – 0.002 Tv The mutations are ordered by level of severity, from most to least, predicted by CADD score. a Frequencies are presented from all 61 468 multiethnic individuals in ExAC because the de novo mutations observed in ExAC are likely to be identity-by-state not identity-by-descent. b Tv = Transversion; Ti = Transition; Ti CpG = Transition within a CpG dinucleotide. Open in new tab Variant- and gene-level functional characterization of DNM In order to best predict the phenotypic effect of the 14 DNM, we used both variant-level and gene-level metrics (14). We used the ExAC database (15) and Combined Annotation Dependent Depletion (CADD) scores (16) to characterise the frequency and predicted functional effects, respectively, of the variants. Five of the 14 DNM—found in MICALL1, LRP1, PNPLA1, PLD1, and GFTP2—have been observed, at very rare frequencies, in the ∼60 000 exomes documented in ExAC (Table 1). All five mutations are CpG transitions and therefore likely to be identity-by-state, reflecting the higher mutability rate of these sites. Within the mutation set, five (35.7%)—found in DNMT3A, PRKCD, MICALL1, LRP1, and PNPLA1—have CADD Phred scores >30, placing them in the top 0.1% of possible damaging mutations in the human genome (Table 1). We further explored the function, expression (BioGPS), existing autoimmunity associations (ImmunoBase), and gene-level constraint against missense mutations (ExAC), of the DNM genes to build a profile of a priori evidence of a role in SLE pathogenesis. None of the candidate genes have been previously associated with SLE through GWAS in any population (17). We also identify candidate genes through known/predicted function and expression profiles (C1QTNF4, SRRM2, HMSD), and four genes (PRKCD, DNMT3A, C1QTNF4 and LRP1) with a significant (Z > 3.09) constraint against missense variants (Table 2). However, across the entire gene set, there was no difference in the median Z-score (0.50) compared with the median Z-score across all genes in ExAC (0.51). Table 2. Evidence for role of de novo mutation gene in autoimmunity Gene . Functional candidatea . Association with SLEb . Associations with other AIDb . Immune cell type with highest expressionc . Missense constraintd . PRKCD B cell signaling and self-antigen induced B cell tolerance induction Monogenic forms30 IBD, UC, CD28 Dendritic 3.75* DNMT3A DNA methyltransferase Candidate gene study35 CD29 – 4.31* C1QTNF4 Pro-inflammatory cytokine – – CD34+ 3.17* SRRM2 Spliceosome-associated pre-mRNA splicing – – CD8+ No data LRP1 Endo/Phagocytosis of apoptotic cells – – – 10.60* HMSD Minor histocompatibility antigen – – n/a 0.25 UBN2 DNA binding – – – 0.01 ANXA3 – – RA17 – −0.37 PLD1 – – – Lymphoblasts −0.73 PNPLA1 – – – – 0.27 GFPT2 – – – – 1.59 BICD1 – – – – 2.12 GJB3 – – – – −0.81 MICALL1 – – – – 0.50 Gene . Functional candidatea . Association with SLEb . Associations with other AIDb . Immune cell type with highest expressionc . Missense constraintd . PRKCD B cell signaling and self-antigen induced B cell tolerance induction Monogenic forms30 IBD, UC, CD28 Dendritic 3.75* DNMT3A DNA methyltransferase Candidate gene study35 CD29 – 4.31* C1QTNF4 Pro-inflammatory cytokine – – CD34+ 3.17* SRRM2 Spliceosome-associated pre-mRNA splicing – – CD8+ No data LRP1 Endo/Phagocytosis of apoptotic cells – – – 10.60* HMSD Minor histocompatibility antigen – – n/a 0.25 UBN2 DNA binding – – – 0.01 ANXA3 – – RA17 – −0.37 PLD1 – – – Lymphoblasts −0.73 PNPLA1 – – – – 0.27 GFPT2 – – – – 1.59 BICD1 – – – – 2.12 GJB3 – – – – −0.81 MICALL1 – – – – 0.50 Genes appear in descending order of supporting evidence. UC = ulcerative colitis, CD = Crohn’s Disease, IBD = inflammatory bowel disease, RA = Rheumatoid Arthritis. a See Supplementary Material, Table S5. b See Supplementary Material, Table S6. c See Supplementary Material, Figure S4. Data from BioGPS. If gene expression is highest in immune cells compared with all other cells, the immune cell type with highest expression is listed. d Gene-wise ExAC Constraint Z-scores. Genes with significant restraint against missense variants are highlighted with an asterisk. Open in new tab Table 2. Evidence for role of de novo mutation gene in autoimmunity Gene . Functional candidatea . Association with SLEb . Associations with other AIDb . Immune cell type with highest expressionc . Missense constraintd . PRKCD B cell signaling and self-antigen induced B cell tolerance induction Monogenic forms30 IBD, UC, CD28 Dendritic 3.75* DNMT3A DNA methyltransferase Candidate gene study35 CD29 – 4.31* C1QTNF4 Pro-inflammatory cytokine – – CD34+ 3.17* SRRM2 Spliceosome-associated pre-mRNA splicing – – CD8+ No data LRP1 Endo/Phagocytosis of apoptotic cells – – – 10.60* HMSD Minor histocompatibility antigen – – n/a 0.25 UBN2 DNA binding – – – 0.01 ANXA3 – – RA17 – −0.37 PLD1 – – – Lymphoblasts −0.73 PNPLA1 – – – – 0.27 GFPT2 – – – – 1.59 BICD1 – – – – 2.12 GJB3 – – – – −0.81 MICALL1 – – – – 0.50 Gene . Functional candidatea . Association with SLEb . Associations with other AIDb . Immune cell type with highest expressionc . Missense constraintd . PRKCD B cell signaling and self-antigen induced B cell tolerance induction Monogenic forms30 IBD, UC, CD28 Dendritic 3.75* DNMT3A DNA methyltransferase Candidate gene study35 CD29 – 4.31* C1QTNF4 Pro-inflammatory cytokine – – CD34+ 3.17* SRRM2 Spliceosome-associated pre-mRNA splicing – – CD8+ No data LRP1 Endo/Phagocytosis of apoptotic cells – – – 10.60* HMSD Minor histocompatibility antigen – – n/a 0.25 UBN2 DNA binding – – – 0.01 ANXA3 – – RA17 – −0.37 PLD1 – – – Lymphoblasts −0.73 PNPLA1 – – – – 0.27 GFPT2 – – – – 1.59 BICD1 – – – – 2.12 GJB3 – – – – −0.81 MICALL1 – – – – 0.50 Genes appear in descending order of supporting evidence. UC = ulcerative colitis, CD = Crohn’s Disease, IBD = inflammatory bowel disease, RA = Rheumatoid Arthritis. a See Supplementary Material, Table S5. b See Supplementary Material, Table S6. c See Supplementary Material, Figure S4. Data from BioGPS. If gene expression is highest in immune cells compared with all other cells, the immune cell type with highest expression is listed. d Gene-wise ExAC Constraint Z-scores. Genes with significant restraint against missense variants are highlighted with an asterisk. Open in new tab PRKCD and DNMT3A are associated with SLE through collective rare variation Although the variant- and gene-level metric analyses suggested intriguing functional candidates, we took a comprehensive approach and tested each locus for an allelic burden of rare variation. We hypothesized that, while some observed DNM were random background variation as present in the exome of every individual regardless of disease status (18), others may be reflecting a hitherto unknown gene contributing to SLE risk, and this may be shown through rare variant burden. Therefore, genotype data were imputed (Supplementary Material, Figs S6 and S7) to the density of the combined UK10K and 1000 genomes Phase III reference panel (UK10K-1000GP3) across all 14 DNM genes in a follow-up cohort of 10 995 individuals of matched European ancestry previously genotyped on the Illumina HumanOmni1 BeadChip (19). Under the hypothesis that rare variants at these loci would be causal and not protective, we employed a one-tailed collapsing burden test (20) to survey each of the 14 genes for an excess of aggregated rare (MAF < 1%) exonic variants in SLE cases compared with healthy controls. We identify an association of PRKCD rare variants with SLE (Supplementary Material, Table S3; P = 0.0028; ncases=4036). In sub-phenotype analyses, we identify collective rare exonic variants in DNMT3A associated with both anti-dsDNA (Supplementary Material, Table S3; P = 0.0005; ncases=1261) and renal involvement with hypocomplementemia (Supplementary Material, Table S3; P = 0.0033; ncases=186), both of which are markers of more severe disease. We also collapsed all exons from the 14 genes together to test for an overall burden of rare variants across these loci. These analyses revealed no excess of rare exonic variants across the grouped genes, reflecting the hypothesis that some/most genes will not be relevant to disease status because the observed DNM are random background variation only. These data reflect the results of our gene-level constraint metric, in which the aggregated gene set do not have a significant mutation constraint. Together, these results suggest further prioritization based on gene-level metrics would not have resulted in true positive associations being excluded from analyses. Implication of C1QTNF4 in SLE through functional effect of DNM p.His198Gln Although no rare variant association was found at the novel candidate gene C1QTNF4, its potential role in disease is supported by gene-level metrics—it is a compelling functional candidate and one of four genes constrained against missense variants (ExAC gene-level constraints Z = 3.17, Table 2). Although gene coding length does not correlate with missense constraint scores (15), the small (<1Kb) coding sequence of this candidate gene may have contributed to insufficient power to detect a rare variant association in the burden testing. On the variant-level, the DNM in C1QTNF4 generates a p.His198Gln sequence change with a modest CADD score of 12.3 (Table 1). Although useful in the absence of suitable functional assays, the sensitivity of bioinformatic prediction tools is known to be suboptimal. Where functional assays are available, previous studies have also demonstrated functional effects of variants predicted to be tolerated/benign (21). We therefore pursued a functional analysis of the p.His198Gln DNM detected in the C1QTNF4 gene as an alternative method to add support for its potential role in disease. Although its function is rather poorly understood, the protein product, C1QTNF4 (CTRP4) is secreted and may act as a cytokine, as it has homology with TNF and the complement component C1q (Fig. 2). C1QTNF4 has been shown to influence NF-κB activation (22), a pathway known to be implicated in SLE pathogenesis, therefore we looked for an effect of the p.His198Gln mutation on NF-κB production. Using a HEK293-NF-κB reporter cell line, we showed that C1QTNF4 p.His198Gln mutant protein was expressed and that it inhibited the NF-κB activation generated by exposure to TNF (Fig. 2). Furthermore, we showed that the fibroblast L929 cell line, which is sensitive to TNF-induced cell death, was rescued by exposure to C1QTNF4 p.His198Gln, but not by wild type C1QTNF4. Thus, the mutant form of C1QTNF4 appears to inhibit some of the actions of TNF (23–25). Figure 2. Open in new tabDownload slide Structural and functional characterization of C1QTNF4 p.His198Gln substitution. (A) Domain organization of human C1QTNF4, showing signal peptide (yellow), first C1q domain (green), second C1q domain (blue) and linker peptides (grey). Arrow highlights substitution site. (B) 3D structure prediction of C1QTNF4 and C1QTNF4 p.His198Gln using Phyre2 (47). Ribbons show the interaction between the positively charged Histidine 198 and Proline 196 lost in C1QTNF4 p.His198Gln due to the substitution of Histidine with Glutamine. (C) Immunoblot demonstrating that p.His198Gln does not affect secretion of C1QTNF4 in HEK293 supernatants. (D) Size exclusion chromatography profile showing no difference in oligomerization between supernatant containing C1QTNF4 (blue) and C1QTNF4 p.His198Gln (red). (E) Luciferase assay in HEK293-NF-κB reporter cell line showing that C1QTNF4 p.His198Gln inhibits NF-κB activation in response to 4 h stimulation with 5 ng/ml TNFα. Error bars represent standard error of the mean. (F) Inhibition of L929 induced cell death by C1QTNF4 p.His198Gln after 24h of stimulation with 0.45 ng/ml TNFα in presence of Actinomycin 1 μg/ml. EV = empty vector. DNM genes do not harbour common variant associations We next tested for additional common variant associations at these 14 loci using the high-density UK10K-1000GP3 imputed data. No significant association at any locus was observed with overall risk in a case-control comparison (ncases=4036), nor with anti-dsDNA (ncases=1261) or renal involvement with hypocomplementemia (ncases=186) sub-phenotypes (Supplementary Material, Table S4). The lack of an associated common variant within PRKCD and DNMT3A supports the hypothesis that discrete gene sets will be identified through rare and common variant associations, with the former expecting to be enriched for core disease genes (3). Discussion To fully understand the pathogenesis of complex diseases we must analyse the full frequency spectrum of genetic variants (4). The study of rare variants associated with disease is of paramount importance to the discovery of core genes that have the potential to be therapeutic targets (12). Our data support the omnigenic hypothesis that rare genetic risk may be found in a discrete set of non-canonical susceptibility genes, as we report an association of collective rare variation across PRKCD and DNMT3A, and found no evidence of an association with common variants across these loci. This, to the best of our knowledge, is the first WES study in polygenic cases of autoimmune disease to use DNM discovery to identify candidate genes for rare variant analyses. Furthermore, our study supports the importance of phenotypic extremes in elucidating the genetic basis of multifactorial disorders (26). Searching GWAS-identified canonical disease susceptibility genes for additional rare variant risk has not been fruitful. Although there are examples—and perhaps more to discover—of canonical disease genes harbouring both common and rare risk alleles (27), the vast majority of such loci do not. Indeed the common variant associated loci which have also been shown to harbour rare coding variant risk are often those distinct minority of loci where the common polymorphisms are non-silent coding variants [e.g. NCF2 (9)]. It is important to note, however, that the separation of periphery and core genes may not necessarily be binary (3). DNMT3A and PRKCD, although hitherto not associated with polygenic SLE, are known autoimmunity susceptibility loci; DNMT3A is associated with Crohn’s disease (CD) (28) and PRKCD is associated with both CD and ulcerative colitis (UC) (29). The notion that a locus could harbour common variants contributing to one autoimmune disease and rare variants contributing to another is intriguing, and could provide further hypothesis-driven searches in the hunt for disease-specific core genes. A functional missense variant p.G510S (c.G1528A) in PRKCD has previously been reported in a consanguineous family with monogenic SLE (30). It was demonstrated that the PRKCD-encoded protein, PRCδ, was essential in the regulation of B cell tolerance and affected family members with the homozygous mutation had increased numbers of immature B cells. Our study implicates the role of rare variants in PRKCD in the broader context of SLE susceptibility, beyond a monogenic recessive disease model. Indeed the analysis of rare and low frequency variants contributing to human height found significant overlap with genes mutated in monogenic growth disorders (4). Furthermore, PRKCB, another member of the protein kinase C gene family, has been implicated in SLE risk (31). DNMT3A, a DNA methyltransferase, is a very intriguing candidate gene for SLE as altered patterns of DNA methylation are reported in autoimmune diseases (32), and hypomethylation of apoptotic DNA has been reported to induce autoantibody production in SLE (33). DNA methylation changes are also associated with monozygotic twin discordance in SLE (34). A candidate gene study previously reported a trend of association between the common DNMT3A intronic SNP rs1550117 (MAF∼7%) and SLE in a European cohort (35). Our analysis did not replicate this finding (P = 0.23) and found no evidence of a common variant association at this locus. Instead, we find an association of collective rare variants and SLE sub-phenotypes and emphasises the importance of deep phenotyping and the potential role of rare variants in specific sub-phenotype, or indeed autoimmune, manifestations. Despite progress with diagnosis and treatment, particular SLE sub-phenotypes—including those used in this study—are still associated with reduced life expectancy. Therefore, elucidating the specific underlying genetic risk is of paramount importance. Through two in vitro assays, we demonstrated the functional effect of a DNM, p.His198Gln in candidate gene C1QTNF4, despite this mutation being predicted to be of little functional importance across variant-level prediction tools. We showed the mutated protein product of C1QTNF4, C1QTNF4, inhibits some TNF-mediated cellular responses, including activation of NF-κB and TNF-induced apoptosis. The role of TNF in SLE is complex and incompletely understood, although, in this context, it is noteworthy that TNF inhibition may promote antinuclear autoimmunity (24). Gene-level metrics for C1QTNF4 were supportive of a role in disease and our result support the importance of combined gene- and variant-level metrics, and the dangers of relying heavily on variant-level metrics alone, when interpreting the potential role of mutations (14). C1QTNF6 is a known susceptibility locus for Type 1 Diabetes and is implicated in Rheumatoid Arthritis (36,37), and a suggestive association with SLE has recently been described in a transancestral Immunochip analysis (38). Together, these data suggest a potential role of the hitherto understudied C1QTNF superfamily of genes in autoimmunity. Although our study allowed a comprehensive approach to test all DNM genes for allelic burden of rare variants, our results show that filtering based on gene- or variant-level metrics would not have resulted in true associations of DNMT3A and PRKCD being missed. When larger datasets require further prioritization of genes, we suggest both variant- and gene-level metrics are used. Each human—regardless of the disease status—is estimated to have one DNM in their exome (18). The simple presence of a provisionally functional DNM in a proband is therefore not sufficient evidence that it contributes to disease risk. A major challenge of WES studies, therefore, is how to differentiate between variants truly important to disease and background variation (39). In light of recent studies which have demonstrated the limitations of large-scale exome-wide case-control studies in detecting rare variant associations (6,40), despite such associations being found when no limitation on sample size exists (4), our results support extreme-phenotype sampling and DNM discovery to aid a hypothesis-driven search for rare variant associations with complex diseases, in the hunt to determine core disease genes. Materials and Methods Selection of trios for sequencing SLE patients of European ancestry—as determined by genome-wide genotyping as part of a GWAS (19)—were selected from the UK SLE genetic repository assembled in the Vyse laboratory on the following criteria: age of onset of SLE < 25 years (median age 21 years); more marked disease phenotype as shown by either evidence for renal involvement as per standard classification criteria and/or the presence of hypocomplementemia and anti-dsDNA autoantibodies; and DNA available from both unaffected parents. The 30 trios (90 individuals) were exome sequenced, as described in SI Methods. Ethical approval for the research was granted by the NRES Committee London (12/LO/1273 and 06/MRE02/9). DNM calling Three bioinformatics tools with conservative parameters were used for DNM screening: BCFtools (41), DeNovoGear (42) and DeNovoCheck (43). A detailed description of the methods applied can be found in SI Methods. Briefly, 454 variants were identified with BCFtools and DeNovoGear and eight additional variants were identified by DeNovoCheck and validated by IGV, resulting in a total of 462 variants, which map to 257 genes. The variants were next filtered sequentially filtered (Supplementary Material, Fig. S2): (A) Removal of NEPG; (B) Fulfil a Het: Ref: Ref for Child: Father: Mother de novo pattern of inheritance and further selected variants that did not contain any trace of alternate allele in any of the parents; (C) Non-silent variant annotation. This process resulted in a total of 17 variants in 17 genes (Supplementary Material, Table S1). Analysis of WES in cases only 584, 798 variants with ≥20X coverage depth and within Gencode capture regions were identified in the analysis of 30 SLE probands only. Stringent filters were applied for variant refinement, described in full in SI Methods, resulting in 1194 variants in 1067 genes (Supplementary Material, Fig. S3). Sanger sequencing confirmation Primers were designed using Primer 3. 10ng of DNA from SLE probands, any unaffected siblings and both parents was amplified with Hot Start Taq polymerase. PCR products were first purified with EXO-SAP before BigDye labelling in a linear PCR and sequenced on an ABI 3300XL. Primers and PCR conditions available on request. The reads were analysed using Chromas Lite (v.2.1.1). Imputation Illumina HumanOmni1 BeadChip genotype data from 6995 controls and 4036 SLE patients of matched European ancestry were used, which had undergone quality control as previously described including Principal Component Analysis (PCA) to account for population structure (19). The UK10K (REL-2012–06-02) plus 1000 Genomes Project Phase3 data (release 20131101.v5) merged reference panel (UK10K-1000GP3) was accessed through the European Genome-phenome Archive (EGAD00001000776). The genotype data were imputed using the UK10K-1000GP3 reference panel across the coding regions of the 14 DNM genes plus a 2Mb flanking region. To increase the accuracy of imputed genotype calls, a full imputation without pre-phasing was conducted using IMPUTE2 (44,45). Imputed genotypes were filtered for confidence using an info score (IMPUTE2) threshold of 0.3 (Supplementary Material, Figs S6 and S7). The most likely genotype from IMPUTE2 was taken if its probability was > 0.5. If the probability fell below this threshold, it was set as missing. Variants with >10% missing genotype calls were removed for further analysis. All individuals had <8% missing genotype data. Rare variant burden tests Imputed data were filtered, using Plink v1.9, to include only variants mapping to coding exons of hg19 RefSeq transcripts. Plink/SEQv1.0 (20) was used to run gene-wise one-tailed burden testing with a MAF < 1% threshold. A 5% false discovery rate was used for multiple testing correction for 14 genes. Common variant association tests SNPTEST 2.5.2 (46) was used to test for associated variants with MAF > 1% across the region spanning the encoded gene. The first four covariates from the original GWAS were included (19). Bonferroni correction was used for 3000 tests across the loci (q = 1.66E-5). Plasmids Myc-Flag-tagged C1QTNF4 on the pCMV6 vector and the empty pCMV6 vector were used (OriGene). The mutant pCMV6-C1QTNF4 C594G (p.His198Gln) was generated by site-directed mutagenesis (Quikchange II XL; Stratagene) according the manufacturer’s instructions: mutagenic primer: 5’-GCGAGTGGTTGCTGCCGCGGCCC-3’ (Sigma-Aldrich). The plasmids production was carried out in XL10-Gold Ultracompetent cells, isolated and purified using EndoFree Maxi Prep kit (Qiagen) and plasmid ORFs were confirmed by full Sanger sequencing (GATC-Biotech). The expression and secretion of the flagged proteins was confirmed by western blot on cell lysates and supernatants with monoclonal anti-FLAG antibody (clone M2; Sigma-Aldrich). Luciferase assays and TNF-induced programmed cell death GloResponse NF-κB-RE-luc2P HEK293 cell line (Promega) and TNF-sensitive L929 fibrosarcoma cell line (ATCC) were cultured in Dulbecco's Modified Eagle Medium (DMEM) supplemented with 10% fetal bovine serum (FBS) and 1% Penicillin/Streptomycin at 37 °C, 5% CO2. HEK293 were seeded 24 h before transfection in antibiotic free DMEM in 96 wells plate (2 × 104 cells/well), transfected with either C1QTNF4, C1QTNF4 C594G or Empty Vector via Fugene HD (Promega). Forty eight hours after transfection the cell were left unstimulated or stimulated with TNFα 5 ng/ml (PeproTech) for 4 h. Luciferase activity was assayed by One-Glo (Promega) on Berthold Orion luminometer, the values were normalized to cell viability measured by CellTiter Glo (Promega). L929 were challenged with TNFα 0.45 ng/ml and Actinomycin D 1 μg/ml (R&D) for 24 h in presence of C1QTNF4 or C1QTNF4 p.His198Gln containing media, cell viability was measured by CellTiter Glo. Size exclusion chromatography Supernatants (750 µl) of HEK293 producing C1QTNF4 or C1QTNF4 p.His198Gln were buffer exchanged in PBS on Zeba Spin Desalting Columns (Thermo Fisher) and 0.5 ml loaded on an AKTA FPLC with a Superdex 200 10/300 GL column (GE Healthcare). Absorbance was normalized to the maximum peak of each sample. Supplementary Material Supplementary Material is available at HMG online. Acknowledgements Sequencing was performed by the SNP&SEQ Technology Platform in Uppsala, which is part of the National Genomics Infrastructure (NGI) hosted by Science for Life Laboratory in Sweden. We thank Johanna Lagensjö and Olof Karlberg for assistance with sequencing. Conflict of Interest statement. None declared. Funding European Union FP7 programme (grant agreement 262055) via the European Sequencing and Genotyping Infrastructure (ESGI) and Arthritis Research UK (grant 20580), Swedish Research Council for Medicine and Health (grant E0226301), the Knut and Alice Wallenberg Foundation (KAW 2011.0073), National Institute for Health Research (NIHR) Biomedical Research Centre based at Guy’s and St Thomas’ NHS Foundation Trust and King’s College London. Funding to pay the Open Access publication charges for this article was provided by a block grant to KCL, to which various charities and research councils contribute. References 1 Manolio T.A. , Collins F.S., Cox N.J., Goldstein D.B., Hindorff L.A., Hunter D.J., McCarthy M.I., Ramos E.M., Cardon L.R., Chakravarti A. ( 2009 ) Finding the missing heritability of complex diseases . Nature , 461 , 747 – 753 . Google Scholar Crossref Search ADS PubMed WorldCat 2 Yang J. , Benyamin B., McEvoy B.P., Gordon S., Henders A.K., Nyholt D.R., Madden P.A., Heath A.C., Martin N.G., Montgomery G.W., Goddard M.E., Visscher P.M. et al. ( 2010 ) Common SNPs explain a large proportion of the heritability for human height . Nat. Genet ., 42 , 565 – 569 . Google Scholar Crossref Search ADS PubMed WorldCat 3 Boyle E.A. , Li Y.I., Pritchard J.K. ( 2017 ) Leading edge perspective an expanded view of complex traits: from polygenic to omnigenic . Cell , 169 , 1177 – 1186 . http://dx.doi.org/10.1016/j.cell.2017.05.038 Google Scholar Crossref Search ADS PubMed WorldCat 4 Marouli E. , Graff M., Medina-Gomez C., Lo K.S., Wood A.R., Kjaer T.R., Fine R.S., Lu Y., Schurmann C., Highland H.M. et al. ( 2017 ) Rare and low-frequency coding variants alter human adult height . Nature , 542 , 186 – 190 . Google Scholar Crossref Search ADS PubMed WorldCat 5 Shi H. , Kichaev G., Pasaniuc B. ( 2016 ) Contrasting the genetic architecture of 30 complex traits from summary association data . Am. J. Hum. Genet ., 99 , 139 – 153 . Google Scholar Crossref Search ADS PubMed WorldCat 6 Fuchsberger C. , Flannick J., Teslovich T.M., Mahajan A., Agarwala V., Gaulton K.J., Ma C., Fontanillas P., Moutsianas L., McCarthy D.J. ( 2016 ) The genetic architecture of type 2 diabetes . Nature , 536 , 41 – 47 . Google Scholar Crossref Search ADS PubMed WorldCat 7 Purcell S.M. , Moran J.L., Fromer M., Ruderfer D., Solovieff N., Roussos P., O'Dushlaine C., Chambert K., Bergen S.E., Kähler A. et al. ( 2014 ) A polygenic burden of rare disruptive mutations in schizophrenia . Nature , 506 , 185 – 190 . Google Scholar Crossref Search ADS PubMed WorldCat 8 Ripke S. , Neale B.M., Corvin A., Walters J.T.R., Farh K.-H., Holmans P. a., Lee P., Bulik-Sullivan B., Collier D. a., Huang H. et al. ( 2014 ) Biological insights from 108 schizophrenia-associated genetic loci . Nature , 511 , 421 – 427 . Google Scholar Crossref Search ADS PubMed WorldCat 9 Hunt K.A. , Mistry V., Bockett N. a., Ahmad T., Ban M., Barker J.N., Barrett J.C., Blackburn H., Brand O., Burren O. et al. ( 2013 ) Negligible impact of rare autoimmune-locus coding-region variants on missing heritability . Nature , 498 , 232 – 235 . Google Scholar Crossref Search ADS PubMed WorldCat 10 Bamshad M.J. , Ng S.B., Bigham A.W., Tabor H.K., Emond M.J., Nickerson D.A., Shendure J. ( 2011 ) Exome sequencing as a tool for Mendelian disease gene discovery . Nat. Rev. Genet ., 12 , 745 – 755 . Google Scholar Crossref Search ADS PubMed WorldCat 11 Moutsianas L. , Agarwala V., Fuchsberger C., Flannick J., Rivas M.A., Gaulton K.J., Albers P.K., McVean G., Boehnke M., Altshuler D. et al. ( 2015 ) The power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease . PLoS Genet ., 11 , e1005165. Google Scholar Crossref Search ADS PubMed WorldCat 12 Chakravarti A. , Turner T.N. ( 2016 ) Revealing rate-limiting steps in complex disease biology: The crucial importance of studying rare, extreme-phenotype families . BioEssays , 38 , 578 – 586 . http://dx.doi.org/10.1002/bies.201500203 Google Scholar Crossref Search ADS PubMed WorldCat 13 Rahbari R. , Wuster A., Lindsay S.J., Hardwick R.J., Alexandrov L.B., Al Turki S., Dominiczak A., Morris A., Porteous D., Smith B. et al. ( 2015 ) Timing, rates and spectra of human germline mutation . Nat. Genet ., 48 , 1 – 11 . Google Scholar OpenURL Placeholder Text WorldCat 14 Itan Y. , Shang L., Boisson B., Patin E., Bolze A., Moncada-Vélez M., Scott E., Ciancanelli M.J., Lafaille F.G., Markle J.G. et al. ( 2015 ) The human gene damage index as a gene-level approach to prioritizing exome variants . Proc. Natl. Acad. Sci. U. S. A , 112 , 13615 – 13620 . Google Scholar Crossref Search ADS PubMed WorldCat 15 Lek M. , Karczewski K.J., Samocha K.E., Banks E., Fennell T., O A.H., Ware J.S., Hill A.J., Cummings B.B., Birnbaum D.P. et al. ( 2016 ) Analysis of protein-coding genetic variation in 60, 706 humans . Nature , 536 , 285 – 291 . http://dx.doi.org/10.1038/nature19057 Google Scholar Crossref Search ADS PubMed WorldCat 16 Kircher M. , Witten D.M., Jain P., O'Roak B.J., Cooper G.M., Shendure J. ( 2014 ) A general framework for estimating the relative pathogenicity of human genetic variants . Nat. Genet ., 46 , 310 – 315 . Google Scholar Crossref Search ADS PubMed WorldCat 17 Chen L. , Morris D.L., Vyse T.J. ( 2017 ) Genetic advances in systemic lupus erythematosus . Curr. Opin. Rheumatol ., 29 , 423 – 433 . Google Scholar Crossref Search ADS PubMed WorldCat 18 Veltman J. a. , Brunner H.G. ( 2012 ) De novo mutations in human genetic disease . Nat. Rev. Genet ., 13 , 565 – 575 . Google Scholar Crossref Search ADS PubMed WorldCat 19 Bentham J. , Morris D.L., Cunninghame Graham D.S., Pinder C.L., Tombleson P., Behrens T.W., Martín J., Fairfax B.P., Knight J.C., Chen L. et al. ( 2015 ) Genetic association analyses implicate aberrant regulation of innate and adaptive immunity genes in the pathogenesis of systemic lupus erythematosus . Nat. Genet ., 47 , 1457 – 1464 . Google Scholar Crossref Search ADS PubMed WorldCat 20 Purcell S.M. PLINK/SEQ: A library for the analysis of genetic variation data. http://atgu.mgh.harvard.edu/plinkseq/; May 2017, date last accessed. 21 Roberts A.L. , Thomas E.R., Bhosle S., Game L., Obraztsova O., Aitman T.J., Vyse T.J., Rhodes B. ( 2014 ) Resequencing the susceptibility gene, ITGAM, identifies two functionally deleterious rare variants in systemic lupus erythematosus cases . Arthritis Res. Ther ., 16 , R114. Google Scholar Crossref Search ADS PubMed WorldCat 22 Li Q. , Wang L., Tan W., Peng Z., Luo Y., Zhang Y., Zhang G., Na D., Jin P., Shi T. et al. ( 2011 ) Identification of C1qTNF-related protein 4 as a potential cytokine that stimulates the STAT3 and NF-κB pathways and promotes cell survival in human cancer cells . Cancer Lett ., 308 , 203 – 214 . Google Scholar Crossref Search ADS PubMed WorldCat 23 Beigel F. , Schnitzler F., Paul Laubender R., Pfennig S., Weidinger M., Göke B., Seiderer J., Ochsenkühn T., Brand S. ( 2011 ) Formation of antinuclear and double-strand DNA antibodies and frequency of lupus-like syndrome in anti-TNF-α antibody-treated patients with inflammatory bowel disease . Inflamm. Bowel Dis ., 17 , 91 – 98 . Google Scholar Crossref Search ADS PubMed WorldCat 24 Eriksson C. , Engstrand S., Sundqvist K.-G., Rantapää-Dahlqvist S. ( 2005 ) Autoantibody formation in patients with rheumatoid arthritis treated with anti-TNF alpha . Ann. Rheum. Dis ., 64 , 403 – 407 . Google Scholar Crossref Search ADS PubMed WorldCat 25 Pink A.E. , Fonia A., Allen M.H., Smith C.H., Barker J.N.W.N. ( 2009 ) Antinuclear antibodies associate with loss of response to antitumour necrosis factor-alpha therapy in psoriasis: a retrospective, observational study . Br. J. Dermatol ., 162 , 780 – 785 . Google Scholar Crossref Search ADS PubMed WorldCat 26 Turner T.N. , Sharma K., Oh E.C., Liu Y.P., Collins R.L., Sosa M.X., Auer D.R., Brand H., Sanders S.J., Moreno-De-Luca D. et al. ( 2015 ) Loss of δ-catenin function in severe autism . Nature , 520 , 51 – 56 . Google Scholar Crossref Search ADS PubMed WorldCat 27 Jordan C.T. , Cao L., Roberson E.D.O., Duan S., Helms C. a., Nair R.P., Duffin K.C., Stuart P.E., Goldgar D., Hayashi G. et al. ( 2012 ) Rare and common variants in CARD14, encoding an epidermal regulator of NF-kappaB, in psoriasis . Am. J. Hum. Genet ., 90 , 796 – 808 . Google Scholar Crossref Search ADS PubMed WorldCat 28 Franke A. , McGovern D.P.B., Barrett J.C., Wang K., Radford-Smith G.L., Ahmad T., Lees C.W., Balschun T., Lee J., Roberts R. et al. ( 2010 ) Genome-wide meta-analysis increases to 71 the number of confirmed Crohn’s disease susceptibility loci . Nat. Genet ., 42 , 1118 – 1125 . Google Scholar Crossref Search ADS PubMed WorldCat 29 Jostins L. , Ripke S., Weersma R.K., Duerr R.H., McGovern D.P., Hui K.Y., Lee J.C., Schumm L.P., Sharma Y., Anderson C. a. et al. ( 2012 ) Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease . Nature , 491 , 119 – 124 . http://dx.doi.org/10.1038/nature11582 Google Scholar Crossref Search ADS PubMed WorldCat 30 Belot A. , Kasher P.R., Trotter E.W., Foray A.P., Debaud A.L., Rice G.I., Szynkiewicz M., Zabot M.T., Rouvet I., Bhaskar S.S. et al. ( 2013 ) Protein kinase cδ deficiency causes mendelian systemic lupus erythematosus with B cell-defective apoptosis and hyperproliferation . Arthritis Rheum ., 65 , 2161 – 2171 . Google Scholar Crossref Search ADS PubMed WorldCat 31 Sheng Y.-J. , Gao J.-P., Li J., Han J.-W., Xu Q., Hu W.-L., Pan T.-M., Cheng Y.-L., Yu Z.-Y., Ni C. et al. ( 2011 ) Follow-up study identifies two novel susceptibility loci PRKCB and 8p11.21 for systemic lupus erythematosus . Rheumatology (Oxford) , 50 , 682 – 688 . Google Scholar Crossref Search ADS PubMed WorldCat 32 Ballestar E. ( 2011 ) Epigenetic alterations in autoimmune rheumatic diseases . Nat. Rev. Rheumatol ., 7 , 263 – 271 . Google Scholar Crossref Search ADS PubMed WorldCat 33 Wen Z.K. , Xu W., Xu L., Cao Q.H., Wang Y., Chu Y.W., Xiong S.D. ( 2007 ) DNA hypomethylation is crucial for apoptotic DNA to induce systemic lupus erythematosus-like autoimmune disease in SLE-non-susceptible mice . Rheumatology , 46 , 1796 – 1803 . Google Scholar Crossref Search ADS PubMed WorldCat 34 Javierre B.M. , Fernandez A.F., Richter J., Al-Shahrour F., Martin-Subero J.I., Rodriguez-Ubreva J., Berdasco M., Fraga M.F., O’Hanlon T.P., Rider L.G. et al. ( 2010 ) Changes in the pattern of DNA methylation associate with twin discordance in systemic lupus erythematosus . Genome Res ., 20 , 170 – 179 . Google Scholar Crossref Search ADS PubMed WorldCat 35 Piotrowski P. , Grobelna M.K., Wudarski M., Olesińska M., Jagodziński P.P. ( 2015 ) Genetic variants of DNMT3A and systemic lupus erythematosus susceptibility . Mod. Rheumatol ., 25 , 96 – 99 . Google Scholar Crossref Search ADS PubMed WorldCat 36 Onengut-Gumuscu S. , Chen W.-M., Burren O., Cooper N.J., Quinlan A.R., Mychaleckyj J.C., Farber E., Bonnie J.K., Szpak M., Schofield E. ( 2015 ) Fine mapping of type 1 diabetes susceptibility loci and evidence for colocalization of causal variants with lymphoid gene enhancers . Nat. Genet ., 47 , 381 – 386 . Google Scholar Crossref Search ADS PubMed WorldCat 37 Murayama M.A. , Kakuta S., Inoue A., Umeda N., Yonezawa T., Maruhashi T., Tateishi K., Ishigame H., Yabe R., Ikeda S. et al. ( 2015 ) CTRP6 is an endogenous complement regulator that can effectively treat induced arthritis . Nat. Commun ., 6 , 8483 . Google Scholar Crossref Search ADS PubMed WorldCat 38 Langefeld C.D. , Ainsworth H.C., Cunninghame Graham D.S., Kelly J.A., Comeau M.E., Harley J.B., Wakeland E.K., Graham R.R., Gaffney P.M., et al. ( 2017 ) Transancestral mapping and genetic load in systemic lupus erythematosus . Nat. Commun , 17 , 6021 Google Scholar OpenURL Placeholder Text WorldCat 39 Cooper G.M. , Shendure J. ( 2011 ) Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data . Nat. Rev. Genet ., 12 , 628 – 640 . Google Scholar Crossref Search ADS PubMed WorldCat 40 Luo Y. , de Lange K.M., Jostins L., Moutsianas L., Randall J., Kennedy N.A., Lamb C.A., McCarthy S., Ahmad T., Edwards C. et al. ( 2016 ) Exploring the genetic architecture of inflammatory bowel disease by whole genome sequencing identifies association at ADCY7 . Nat. Genet ., 49 , 186 – 192 . Google Scholar Crossref Search ADS WorldCat 41 Li H. , Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R. ( 2009 ) The sequence alignment/map format and SAMtools . Bioinformatics , 25 , 2078 – 2079 . Google Scholar Crossref Search ADS PubMed WorldCat 42 Ramu A. , Noordam M.J., Schwartz R.S., Wuster A., Hurles M.E., Cartwright R., Conrad D.F. ( 2013 ) DeNovoGear: de novo indel and point mutation discovery and phasing . Nat. Methods , 10 , 985 – 987 . Google Scholar Crossref Search ADS PubMed WorldCat 43 de Ligt J. , Willemsen M.H., van Bon B.W.M., Kleefstra T., Yntema H.G., Kroes T., Vulto-van Silfhout A.T., Koolen D. a., de Vries P., Gilissen C. et al. ( 2012 ) Diagnostic exome sequencing in persons with severe intellectual disability . N. Engl. J. Med ., 367 , 1921 – 1929 . Google Scholar Crossref Search ADS PubMed WorldCat 44 Howie B. , Fuchsberger C., Stephens M., Marchini J., Abecasis G.R. ( 2012 ) Fast and accurate genotype imputation in genome-wide association studies through pre-phasing . Nat. Genet ., 44 , 955 – 959 . Google Scholar Crossref Search ADS PubMed WorldCat 45 Roshyara N.R. , Horn K., Kirsten H., Ahnert P., Scholz M., An P., Leeuwen E.M., van Z., Lambert E., Olama J.C., Al A.A. et al. ( 2016 ) Comparing performance of modern genotype imputation methods in different ethnicities . Sci. Rep ., 6 , 34386. Google Scholar Crossref Search ADS PubMed WorldCat 46 Marchini J. , Howie B. ( 2010 ) Genotype imputation for genome-wide association studies . Nat. Rev. Genet ., 11 , 499 – 511 . Google Scholar Crossref Search ADS PubMed WorldCat 47 Kelley L.A. , Sternberg M.J.E. ( 2009 ) Protein structure prediction on the Web: a case study using the Phyre server . Nat. Protoc ., 4 , 363 – 371 . Google Scholar Crossref Search ADS PubMed WorldCat Author notes V.P and A.L.R contributed equally to this work. Present address: Department of Cancer Biology, The Institute of Cancer Research, London SW3 6JB, UK. Present address: Scientific Genomics Platforms, Laboratory of Functional Genomics, Nutrigenomics and Systems Biology, Max Delbrück Centre for Molecular Medicine (BIMSB/BIH), Berlin 13092, Germany. © The Author(s) 2017. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. © The Author(s) 2017. Published by Oxford University Press. TI - De novo mutations implicate novel genes in systemic lupus erythematosus JF - Human Molecular Genetics DO - 10.1093/hmg/ddx407 DA - 2018-02-01 UR - https://www.deepdyve.com/lp/oxford-university-press/de-novo-mutations-implicate-novel-genes-in-systemic-lupus-CnRUR4lp3d SP - 421 EP - 429 VL - 27 IS - 3 DP - DeepDyve ER -