• Something wrong with this record ?

Delineation of functionally essential protein regions for 242 neurodevelopmental genes

S. Iqbal, T. Brünger, E. Pérez-Palma, M. Macnee, A. Brunklaus, MJ. Daly, AJ. Campbell, D. Hoksza, P. May, D. Lal

. 2023 ; 146 (2) : 519-533. [pub] 2023Feb13

Language English Country England, Great Britain

Document type Journal Article, Research Support, N.I.H., Extramural, Research Support, Non-U.S. Gov't

Neurodevelopmental disorders (NDDs), including severe paediatric epilepsy, autism and intellectual disabilities are heterogeneous conditions in which clinical genetic testing can often identify a pathogenic variant. For many of them, genetic therapies will be tested in this or the coming years in clinical trials. In contrast to first-generation symptomatic treatments, the new disease-modifying precision medicines require a genetic test-informed diagnosis before a patient can be enrolled in a clinical trial. However, even in 2022, most identified genetic variants in NDD genes are 'variants of uncertain significance'. To safely enrol patients in precision medicine clinical trials, it is important to increase our knowledge about which regions in NDD-associated proteins can 'tolerate' missense variants and which ones are 'essential' and will cause a NDD when mutated. In addition, knowledge about functionally indispensable regions in the 3D structure context of proteins can also provide insights into the molecular mechanisms of disease variants. We developed a novel consensus approach that overlays evolutionary, and population based genomic scores to identify 3D essential sites (Essential3D) on protein structures. After extensive benchmarking of AlphaFold predicted and experimentally solved protein structures, we generated the currently largest expert curated protein structure set for 242 NDDs and identified 14 377 Essential3D sites across 189 gene disorders associated proteins. We demonstrate that the consensus annotation of Essential3D sites improves prioritization of disease mutations over single annotations. The identified Essential3D sites were enriched for functional features such as intermembrane regions or active sites and discovered key inter-molecule interactions in protein complexes that were otherwise not annotated. Using the currently largest autism, developmental disorders, and epilepsies exome sequencing studies including >360 000 NDD patients and population controls, we found that missense variants at Essential3D sites are 8-fold enriched in patients. In summary, we developed a comprehensive protein structure set for 242 NDDs and identified 14 377 Essential3D sites in these. All data are available at https://es-ndd.broadinstitute.org for interactive visual inspection to enhance variant interpretation and development of mechanistic hypotheses for 242 NDDs genes. The provided resources will enhance clinical variant interpretation and in silico drug target development for NDD-associated genes and encoded proteins.

References provided by Crossref.org

000      
00000naa a2200000 a 4500
001      
bmc23004190
003      
CZ-PrNML
005      
20230425141209.0
007      
ta
008      
230418s2023 enk f 000 0|eng||
009      
AR
024    7_
$a 10.1093/brain/awac381 $2 doi
035    __
$a (PubMed)36256779
040    __
$a ABA008 $b cze $d ABA008 $e AACR2
041    0_
$a eng
044    __
$a enk
100    1_
$a Iqbal, Sumaiya $u The Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA $u Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA $u Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA $1 https://orcid.org/0000000177004374
245    10
$a Delineation of functionally essential protein regions for 242 neurodevelopmental genes / $c S. Iqbal, T. Brünger, E. Pérez-Palma, M. Macnee, A. Brunklaus, MJ. Daly, AJ. Campbell, D. Hoksza, P. May, D. Lal
520    9_
$a Neurodevelopmental disorders (NDDs), including severe paediatric epilepsy, autism and intellectual disabilities are heterogeneous conditions in which clinical genetic testing can often identify a pathogenic variant. For many of them, genetic therapies will be tested in this or the coming years in clinical trials. In contrast to first-generation symptomatic treatments, the new disease-modifying precision medicines require a genetic test-informed diagnosis before a patient can be enrolled in a clinical trial. However, even in 2022, most identified genetic variants in NDD genes are 'variants of uncertain significance'. To safely enrol patients in precision medicine clinical trials, it is important to increase our knowledge about which regions in NDD-associated proteins can 'tolerate' missense variants and which ones are 'essential' and will cause a NDD when mutated. In addition, knowledge about functionally indispensable regions in the 3D structure context of proteins can also provide insights into the molecular mechanisms of disease variants. We developed a novel consensus approach that overlays evolutionary, and population based genomic scores to identify 3D essential sites (Essential3D) on protein structures. After extensive benchmarking of AlphaFold predicted and experimentally solved protein structures, we generated the currently largest expert curated protein structure set for 242 NDDs and identified 14 377 Essential3D sites across 189 gene disorders associated proteins. We demonstrate that the consensus annotation of Essential3D sites improves prioritization of disease mutations over single annotations. The identified Essential3D sites were enriched for functional features such as intermembrane regions or active sites and discovered key inter-molecule interactions in protein complexes that were otherwise not annotated. Using the currently largest autism, developmental disorders, and epilepsies exome sequencing studies including >360 000 NDD patients and population controls, we found that missense variants at Essential3D sites are 8-fold enriched in patients. In summary, we developed a comprehensive protein structure set for 242 NDDs and identified 14 377 Essential3D sites in these. All data are available at https://es-ndd.broadinstitute.org for interactive visual inspection to enhance variant interpretation and development of mechanistic hypotheses for 242 NDDs genes. The provided resources will enhance clinical variant interpretation and in silico drug target development for NDD-associated genes and encoded proteins.
650    _2
$a lidé $7 D006801
650    _2
$a dítě $7 D002648
650    12
$a neurovývojové poruchy $x genetika $7 D065886
650    _2
$a genetické testování $7 D005820
650    _2
$a mutace $x genetika $7 D009154
650    12
$a mentální retardace $x genetika $7 D008607
650    _2
$a missense mutace $7 D020125
655    _2
$a časopisecké články $7 D016428
655    _2
$a Research Support, N.I.H., Extramural $7 D052061
655    _2
$a práce podpořená grantem $7 D013485
700    1_
$a Brünger, Tobias $u Cologne Center for Genomics, University of Cologne, 50923 Köln, Germany
700    1_
$a Pérez-Palma, Eduardo $u Universidad del Desarrollo, Centro de Genética y Genómica, Facultad de Medicina Clínica Alemana, 7610658 Las Condes, Santiago de Chile, Chile $1 https://orcid.org/0000000305465141
700    1_
$a Macnee, Marie $u Cologne Center for Genomics, University of Cologne, 50923 Köln, Germany
700    1_
$a Brunklaus, Andreas $u The Paediatric Neurosciences Research Group, Royal Hospital for Children, Glasgow G12 8QQ, UK $u School of Health and Wellbeing, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK $1 https://orcid.org/0000000277286903
700    1_
$a Daly, Mark J $u Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA $u Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA 02114, USA $u Institute for Molecular Medicine Finland (FIMM), Centre of Excellence in Complex Disease Genetics, University of Helsinki, 00100 Helsinki, Finland
700    1_
$a Campbell, Arthur J $u The Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA $u Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
700    1_
$a Hoksza, David $u Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, 110 00 Staré Město, Czechia, Czech Republic $1 https://orcid.org/0000000346790557
700    1_
$a May, Patrick $u Luxembourg Centre for Systems Biomedicine, University of Luxembourg, 4365 Esch-sur-Alzette, Luxembourg $1 https://orcid.org/0000000186983770
700    1_
$a Lal, Dennis $u Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA $u Cologne Center for Genomics, University of Cologne, 50923 Köln, Germany $u Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH 44195, USA $u Genomic Medicine Institute, Lerner Research Institute Cleveland Clinic, Cleveland, OH 44106, USA $1 https://orcid.org/0000000251739636
773    0_
$w MED00009356 $t Brain : a journal of neurology $x 1460-2156 $g Roč. 146, č. 2 (2023), s. 519-533
856    41
$u https://pubmed.ncbi.nlm.nih.gov/36256779 $y Pubmed
910    __
$a ABA008 $b sig $c sign $y p $z 0
990    __
$a 20230418 $b ABA008
991    __
$a 20230425141206 $b ABA008
999    __
$a ok $b bmc $g 1924701 $s 1190399
BAS    __
$a 3
BAS    __
$a PreBMC-MEDLINE
BMC    __
$a 2023 $b 146 $c 2 $d 519-533 $e 2023Feb13 $i 1460-2156 $m Brain $n Brain $x MED00009356
LZP    __
$a Pubmed-20230418

Find record

Citation metrics

Loading data ...

Archiving options

Loading data ...