Open Access

Clinical next generation sequencing reveals an H3F3A gene as a new potential gene candidate for microcephaly associated with severe developmental delay, intellectual disability and growth retardation


Cite

Introduction

The clinical sign of abnormally small head, usually defined as head circumference below the third percentile, or at least two standard deviations below the mean from an age-and sex-matched reference population, affects approximately 2.0-3.0% of the world population [1,2]. It is classified into primary if present at birth and secondary if developing thereafter [2]. Microcephaly can be an isolated sign or present as one of the associated features defining various genetic syndromes. Over 900 distinct phenotypes and about 800 genes have so far been recorded in the OMIM database [3], which implies significant clinical and genetic heterogeneity.

Consequently, establishing genetic etiology is challenging; genetic pathogenesis was reported in about 29.0% of cases based on metabolic and classical genetic testing [4], while using next generation sequencing (NGS) improved diagnostic yield. A recent study [3], employing molecular karyotyping and exome sequencing, improved diagnostic rates to 58.0% with primary microcephaly, 27.0% with secondary microcephaly, and to 15.0% in patients with unknown onset. Moreover, whole exome sequencing in the clinical setting may identify not only mutations in genes already known to be associated with investigated phenotypes, but also discover new potential genes and mechanisms for human disorders [5,6]. In this study, we describe a new association between microcephaly accompanied with severe developmental delay and intellectual disability and a potentially new gene H3F3A involved in maintaining heterochromatin and telomeric integrity.

Clinical Report

The proband is a girl, the only child born to healthy Caucasian, non consanguineous, healthy parents, who were at the time of birth 26 (mother) and 29 (father) years old. The pregnancy was uneventful, with no evidence of infection, drug exposure or radiation. Delivery was at the 41st week of pregnancy by cesarean section due to the breech presentation. At birth, the proband weighed 2550 g, 4th centile, 46 cm, 4th centile and head circumference 33 cm, 5th centile. She was hypotonic and had slight transitory breathing problems with no malformations except small (3 mm), hemodynamically insignificant atrial septum defect. Magnetic resonance imaging (MRI) demonstrated hypoplasia of corpus callosum and cerebellum as well as thin layer of frontal and parietal periventricular gliosis. Visual evoked potentials demonstrated abnormal cortical responses with low amplitudes, suggesting a dysfunction of afferent optical system. At age 5, the proband weighed 16.5 kg, 10th centile, height of 94 cm, p <1st centile, and head circumference 45.5 cm, p <1st centile. She has severe global developmental delay, she is able to sit independently, but has not developed walking or speech. Her social interaction is poor. She has a wide, depressed nasal bridge, hypertelorism, hypotonic face, poorly formed, posteriorly set ears (Figure 1) and mild flexion contractures of fingers.

Figure 1

Facial features of the patient with a heterozygous de novo H3F3A gene variant p.(Leu62Arg) include wide and depressed nasal bridge, hypertelorism, hypotonic face and poorly formed, posteriorly set ears.

Methods

Exome Sequencing. Exome sequencing was performed in the affected proband and unaffected parents. Briefly, the library preparation was performed using the TruSeq protocol (Illumina Inc., San Diego, CA, USA), followed by the capture of exonic targets using the xGen Exome Research Panel v1.0 using Integrated DNA Technologies (IDT), probes (Coralville, IA, USA). The exome capture targeted 39 Mb of exonic regions of 19,396 genes in the human genome (hg19). Sequencing was subsequently performed using the pair-end sequencing protocol on Next-Seq 550 (Illumina Inc.) in 2 × 150 cycles. Sequencing data was processed using an in-house analysis pipeline, based on the combination of Burrows-Wheeler (BWA) aligner (v0.7.2) (http://bio-bwa.sourceforge.net/) and GATK software (v3.2) (https://software.broadinstitute.rg/gatk/) for variant calling. Duplicate sequences were removed using Picard MarkDuplicates (https://broadinstitutegithub.io.picard/), followed by base quality score recalibration, variant calling, variant quality score recalibration and variant filtering using the tools in the GATK framework [7].

Variant Analysis. The resulting variants were collected using the VariantTools software (https:/github.com/vatlab/variant tools) and their transcript and protein consequences predicted using the ANNOVAR (http://annovar.openbioinformatics.org/en/latest/) and SNPeff software (http://snpeff.sourceforge.net/) [8, 9, 10]. Variant consequences were predicted based on Refseq gene models (https://www.ncbi.nlm.nih.gov/refseq/) Precomputed pathogenicity predictions for missense variants were obtained from the dbNSFP v2 database (https://google.com/site/popgen/dbNSFP) [11] and the MutationTaster, SIFT, Polyphen2, MetaSVM, CADD and REVEL (which are included in the dbNSFP database referred above) estimates were used in prioritization of pathogenic variants. Evolutionary conservation rates of the variant sites was based on GERP++ rejected substation (RS) scores [12].

Variant frequency information for worldwide populations was based on the data from gnomAD project (gnomad.broadinstitute.com) We used the in-house population variant frequency estimates based on the internal data of 3000 Slovenian exomes as a source of variation frequency in the background population. We used ClinVar as the source of information on variant-disease associations [13].

Variant Filtration Strategy. The annotated variants were filtered using three strategies, based on the assumed pattern of inheritance, autosomal dominant (de novo variants), autosomal recessive (homozygous and compound heterozygous variants) and X-linked (de novo and maternally inherited variants). We used the frequency threshold of 0.01% in any of the surveyed populations in the dominant scenario and the frequency threshold of 0.1% for the autosomal recessive and X-linked scenarios. We performed manual interpretation of the remaining variants with consideration of clinical overlap, variant rarity in the general population, theoretical pathogenicity predictions and evolutionary conservation. At this stage, all the variants were also manually inspected at read level to control the quality of the variant calls.

Results

Exome analysis. Initially, we focused the search for causative variants in the proband to panels of genes, possibly associated with the clinical condition, including 1607 genes in an expanded developmental delay panel, 611 genes in an undiagnosed metabolic disorders panel and 155 genes in an intellectual disability panel. This initial search failed to reveal any causative variants.

Subsequently, we performed the search for candidate de novo and biallelic variants in novel genes. This revealed the presence of a heterozygous de novo missense variant in the H3F3A gene (NM_002107.4: c.185T>G) detected in the proband and absent in the parents. This variant is predicted to result in a p.(Leu62Arg) amino acid substitution in the Histone H3 protein.

The identified variant was absent in all populations of the gnomAD project and was also not detected in our in-house database of 3000 exomes. Furthermore, we observed a depletion of the overall rate of missense variants in the H3F3A gene in the gnomAD database, indicating an evolutionary constraint against missense variation in this gene, with only five missense variants in H3F3A observed in the gnomAD database populations (Z score for missense variant constraint in gnomAD dataset equaled 3.2).

The identified variant affects a highly conserved leucine residue at position 62 of the Histone H3 protein and the GERP rejected substitution score indicated high conservation rate (GERP RS score 4.97) of the variant nucleotide position across species. The substitution of the aliphatic hydrophobic leucine residue with a positively charged hydrophilic arginine residue affects the polarity at position 62 and thus predicted to affect the physico-chemical properties of the affected protein domain in the proximity of the DNA binding site. All the utilized tools for in silico prediction of the variant’s effect on protein function predict its damaging effect in consensus, including MutationTaster, SIFT, Polyphen2, MetaSVM, CADD and REVEL (Figure 2).

Figure 2

The identified H3F3A gene variant affects a highly evolutionary conserved leucine at amino acid position 62 in the Histone H3 protein, which is invariant in all the surveyed species (according to Multiz alignments). Figure was generated using the University of CA Santa Cruz (UCSC) genome browser (https://genome.ucsc.edu/)

Discussion

We report the H3F3A gene as a new candidate gene, associated with microcephaly, severe global developmental delay and intellectual disability. Additionally, prenatal and postnatal growth retardation, structural abnormalities of central nervous system and dysmorphological features were present in the reported patient.

The H3F3A gene, together with the H3F3B gene, codes the replacement histone H3.3. The H3.3-containing nucleosomes remain highly dynamic and are involved in the control of neuronal- and glial-specific gene expression patterns throughout life, which are required to support synaptic connectivity and behavioral plasticity [14]. H3.3 knock down resulted in decreased numbers of dendritic spines on pyramidal cells and global reduction of both excitatory and inhibitory synapses in the mouse model [14]. Disruption of the H3F3A gene in transgenic mice resulted in stunted growth and neuromuscular deficits [15]. Somatic mutations in the H3F3 gene were reported in pediatric glioblastoma [16] and in giant cell tumors of the bone [17]. Furthermore, it was recently suggested that the pathogenic mechanism of germline histone mutations is distinct from that of the published cancer-associated somatic histone mutations, and may converge on control of cell proliferation [18].

In our patient, we identified a heterozygous de novo missense variant in the H3F3A gene (NM_002107.4: c. 185 T>G), which is absent from the gnomAD and from the Slovenian Genome databases. The identified variant affects a highly conserved leucine residue at position 62 of the Histone H3 protein and is predicted to affect the physico-chemical properties of the affected protein domain.

Further evidence for the potential role of the H 3F3A gene in neurodevelopment is also based on the ClinVar reports of two patients with likely pathogenic heterozygous missense mutations in the H3F3A gene, c.377A>G (p.Gln 126Arg), who both demonstrated a neurological phenotype including hypotonia, global developmental delay, intellectual disability, visual impairment and dysmorphic features (ClinVar accession: VCV000520774.1) [13].

The limitation of our case report is lack of functional studies and lack of additional patients with mutations in the H3F3A gene and the suggested phenotype. Nevertheless, we provide evidence on a potentially new mechanism involving histones in the regulation of gene expression in the brain as the cause of a syndromic microcephaly.

eISSN:
1311-0160
Language:
English
Publication timeframe:
2 times per year
Journal Subjects:
Medicine, Basic Medical Science, other