Genomics, Circuits, and Pathways in Clinical Neuropsychiatry
()
About this ebook
This foundational work comprehensively examines the current state of the genetics, genomics and brain circuitry of psychiatric and neurological disorders. It consolidates discoveries of specific genes and genomic regions associated with these conditions, the genetic and anatomic architecture of these syndromes, and addresses how recent advances in genomics are leading to a reappraisal of the biology underlying clinical neuroscience. In doing so, it critically examines the promise and limitations of these discoveries toward treatment, and to the interdisciplinary nature of understanding brain and behavior. Coverage includes new discoveries regarding autism, epilepsy, intellectual disability, dementias, movement disorders, language impairment, disorders of attention, schizophrenia, and bipolar disorder. Genomics, Circuits, and Pathways in Clinical Neuropsychiatry focuses on key concepts, challenges, findings, and methods in genetics, genomics, molecular pathways, brain circuitry, and related neurobiology of neurologic and psychiatric disorders.
- Provides interdisciplinary appeal in psychiatry, neurology, neuroscience, and genetics
- Identifies key concepts, methods, and findings
- Includes coverage of multiple disorders from autism to schizophrenia
- Reviews specific genes associated with disorders
- Discusses the genetic architecture of these syndromes
- Explains how recent findings are influencing the understanding of biology
- Clarifies the promise of these findings for future treatment
Related to Genomics, Circuits, and Pathways in Clinical Neuropsychiatry
Related ebooks
Functional Neuromarkers for Psychiatry: Applications for Diagnosis and Treatment Rating: 0 out of 5 stars0 ratingsNeuropsychiatric Disorders and Epigenetics Rating: 5 out of 5 stars5/5The Neurobiology of Schizophrenia Rating: 0 out of 5 stars0 ratingsEpigenetics in Psychiatry Rating: 0 out of 5 stars0 ratingsFrom Molecules to Networks: An Introduction to Cellular and Molecular Neuroscience Rating: 5 out of 5 stars5/5Quantitative EEG, Event-Related Potentials and Neurotherapy Rating: 5 out of 5 stars5/5Sleep and Neurologic Disease Rating: 0 out of 5 stars0 ratingsThe Neurology of Consciousness: Cognitive Neuroscience and Neuropathology Rating: 5 out of 5 stars5/5Epigenetic Regulation in the Nervous System: Basic Mechanisms and Clinical Impact Rating: 0 out of 5 stars0 ratingsNeuronal Networks in Brain Function, CNS Disorders, and Therapeutics Rating: 0 out of 5 stars0 ratingsNeurobiology of Sleep and Memory Rating: 0 out of 5 stars0 ratingsPersonalized Psychiatry Rating: 0 out of 5 stars0 ratingsComputational Epigenetics and Diseases Rating: 0 out of 5 stars0 ratingsGene Therapy in Neurological Disorders Rating: 0 out of 5 stars0 ratingsAddictive Substances and Neurological Disease: Alcohol, Tobacco, Caffeine, and Drugs of Abuse in Everyday Lifestyles Rating: 0 out of 5 stars0 ratingsSystems Neuroscience in Depression Rating: 0 out of 5 stars0 ratingsPsychophysiology: Today and Tomorrow Rating: 2 out of 5 stars2/5Executive Functions in Health and Disease Rating: 5 out of 5 stars5/5Stress: Physiology, Biochemistry, and Pathology: Handbook of Stress Series, Volume 3 Rating: 0 out of 5 stars0 ratingsChromatin Signaling and Neurological Disorders Rating: 0 out of 5 stars0 ratingsSleep and Affect: Assessment, Theory, and Clinical Implications Rating: 0 out of 5 stars0 ratingsThe Wiley Handbook of Evolutionary Neuroscience Rating: 0 out of 5 stars0 ratingsMolecular Aspects of Neurodegeneration, Neuroprotection, and Regeneration in Neurological Disorders Rating: 0 out of 5 stars0 ratingsTranslational Neuroimaging: Tools for CNS Drug Discovery, Development and Treatment Rating: 0 out of 5 stars0 ratingsConn's Translational Neuroscience Rating: 0 out of 5 stars0 ratingsElectromyography in CNS Disorders: Central EMG Rating: 0 out of 5 stars0 ratingsStress: Neuroendocrinology and Neurobiology: Handbook of Stress Series, Volume 2 Rating: 3 out of 5 stars3/5Environmental Factors in Neurodevelopmental and Neurodegenerative Disorders Rating: 0 out of 5 stars0 ratings
Biology For You
Gut: The Inside Story of Our Body's Most Underrated Organ (Revised Edition) Rating: 4 out of 5 stars4/5Why We Sleep: Unlocking the Power of Sleep and Dreams Rating: 4 out of 5 stars4/5Sapiens: A Brief History of Humankind Rating: 4 out of 5 stars4/5Anatomy 101: From Muscles and Bones to Organs and Systems, Your Guide to How the Human Body Works Rating: 4 out of 5 stars4/5Dopamine Detox: Biohacking Your Way To Better Focus, Greater Happiness, and Peak Performance Rating: 3 out of 5 stars3/5The Sixth Extinction: An Unnatural History Rating: 4 out of 5 stars4/5The Grieving Brain: The Surprising Science of How We Learn from Love and Loss Rating: 4 out of 5 stars4/5Peptide Protocols: Volume One Rating: 4 out of 5 stars4/5The Winner Effect: The Neuroscience of Success and Failure Rating: 5 out of 5 stars5/5Lifespan: Why We Age—and Why We Don't Have To Rating: 4 out of 5 stars4/5"Cause Unknown": The Epidemic of Sudden Deaths in 2021 & 2022 Rating: 5 out of 5 stars5/5The Soul of an Octopus: A Surprising Exploration into the Wonder of Consciousness Rating: 4 out of 5 stars4/5Other Minds: The Octopus, the Sea, and the Deep Origins of Consciousness Rating: 4 out of 5 stars4/5How Emotions Are Made: The Secret Life of the Brain Rating: 4 out of 5 stars4/5Ultralearning: Master Hard Skills, Outsmart the Competition, and Accelerate Your Career Rating: 4 out of 5 stars4/5The Obesity Code: the bestselling guide to unlocking the secrets of weight loss Rating: 4 out of 5 stars4/5Lies My Gov't Told Me: And the Better Future Coming Rating: 4 out of 5 stars4/5Jaws: The Story of a Hidden Epidemic Rating: 4 out of 5 stars4/5The Book of Fungi: A Life-Size Guide to Six Hundred Species from around the World Rating: 4 out of 5 stars4/5Mother of God: An Extraordinary Journey into the Uncharted Tributaries of the Western Amazon Rating: 4 out of 5 stars4/5A Crack In Creation: Gene Editing and the Unthinkable Power to Control Evolution Rating: 4 out of 5 stars4/5A Letter to Liberals: Censorship and COVID: An Attack on Science and American Ideals Rating: 3 out of 5 stars3/5The Code Breaker: Jennifer Doudna, Gene Editing, and the Future of the Human Race Rating: 4 out of 5 stars4/5Woman: An Intimate Geography Rating: 4 out of 5 stars4/5All That Remains: A Renowned Forensic Scientist on Death, Mortality, and Solving Crimes Rating: 4 out of 5 stars4/5Suicidal: Why We Kill Ourselves Rating: 4 out of 5 stars4/5The Blood of Emmett Till Rating: 4 out of 5 stars4/5This Will Make You Smarter: 150 New Scientific Concepts to Improve Your Thinking Rating: 4 out of 5 stars4/5The Coming Plague: Newly Emerging Diseases in a World Out of Balance Rating: 4 out of 5 stars4/5
Reviews for Genomics, Circuits, and Pathways in Clinical Neuropsychiatry
0 ratings0 reviews
Book preview
Genomics, Circuits, and Pathways in Clinical Neuropsychiatry - Thomas Lehner
Genomics, Circuits, and Pathways in Clinical Neuropsychiatry
Editors
Thomas Lehner
Office of Genomics Research Coordination, National Institute of Mental Health, Bethesda, MD, United States
Bruce L. Miller
Department of Neurology, School of Medicine, University of California San Francisco, San Francisco, CA, United States
Matthew W. State
Department of Psychiatry, School of Medicine, University of California San Francisco, San Francisco, CA, United States
Table of Contents
Cover image
Title page
Copyright
List of Contributors
Preface
Section I. The Genome Tools and Methods
Chapter 1. The Newly Emerging View of the Genome
Introduction
Conceptualizing DNA
Information Flow in Biological Systems
Key Definitions
The Genome (DNA)
Visualizing the Genome
Large-Scale Structures in the Genome
Small-Scale Structures in the Genome
Multiple Roles
Variation in the Genome
Ethnicity and Population Frequency of Variants in the Genome
Variation in Complex and Mendelian Disorders
Chromatin (DNA, RNA, and Protein) and Epigenetics
Regulatory Complexes
Transcription (RNA)
Posttranscriptional mRNA Modifications
Translation
Posttranslational Modifications
Summary
Chapter 2. Contribution of Genetic Epidemiology to Our Understanding of Psychiatric Disorders
Introduction
Background: Epidemiology
Genetic Epidemiology of Psychiatric Disorders
Applications of Genetic Epidemiology in Molecular Era
Chapter 3. Natural Selection and Neuropsychiatric Disease: Theory, Observation, and Emerging Genetic Findings
Introduction
Epidemiology of Neuropsychiatric Disease
Theoretical Considerations
Hypothesis Testing: Epidemiology and Emerging Genetic Data
Mendelian Randomization and Polygenic Risk
De Novo Mutation and Mutation–Selection Balance
Conclusions
Chapter 4. Genome Tools and Methods: Rare Genetic Variation
Rare Variants in Psychiatric Disease
Genetic Variation
Modern Technologies for Discovery of Rare Variants
Microarray Technology and Large-Scale Copy Number Variants in Neurodevelopmental and Neuropsychiatric Disease
Copy Number Variant Studies of Disease
Studies of De Novo Mutation in Trio Families
Case–Control Studies
Analysis Strategies
Detection of Rare Genetic Variation by High-Throughput Sequencing
Rare Genetic Variation in Noncoding DNA: A New Frontier
Rare Genetic Variants and the Potential to Affect Clinical Care
Chapter 5. Neuroepigenomics and Human Disease
Epigenetics
Epigenetics in Neurobiological Research
The Genomic Context of Transcriptional and Epigenetic Mechanisms
Transcriptional and Epigenetic Regulatory Mechanisms
Epigenetic Perturbation and Epigenome-Wide Assays
Cell Type Choices in Neuroepigenomic Studies
Interaction of the Genome With the Epigenome
Epigenetics Mechanisms of Brain Diseases: General Principles and Future Directions
Chapter 6. Bioinformatics in Neuropsychiatric Genomics
What is Bioinformatics?
Databases
Methods
Tools
Standards
Summary
List of URLs
Chapter 7. Imaging Genomics and ENIGMA
Introduction
Power of MRI
What Is Imaging Genomics?
Mapping Brain Diseases in Neurology
MRI Versus Cellular Measures
Mapping Brain Disorders: Psychiatry
Psychiatric Neuroimaging Expands Worldwide
ENIGMA Studies of Brain Disease
Defusing Controversy With Meta-Analysis
Imaging Genomics and Genome-Wide Association Studies
Genetic Influences on the Brain
Genes and Disease Risk
Manhattan Plots
Candidate Genes and Genome-Wide Association Studies
Genome-Wide Significance
Formation of ENIGMA in 2009
Surprises From Imaging Genomics Consortia
Other Neuroimaging Methods
Multivariate Imaging Genomics and Big Data
Searching the Brain for Gene Effects
Conclusions
Chapter 8. Brain in a Dish: Stem Cell Technologies to Study Disorders of the Central Nervous System
Generating Induced Pluripotent Stem Cells
Progress in Modeling Human Central Nervous System Pathology: Neurodevelopmental Disorders
Rett Syndrome
Fragile X Syndrome
Angelman Syndrome
Timothy Syndrome
Contribution of Different Cell Types to Neurodevelopmental Disorders
The Next Step of Induced Pluripotent Stem Cells: Three-Dimensional Cultures and Mini-Brains
Limitations of Induced Pluripotent Stem Cell Model
Genome Editing: Creating Diseased Embryonic Stem Cells
Drug Discovery With Induced Pluripotent Stem Cells
Conclusions
Chapter 9. Association Strategies
Common Variants
Rare Variants
Conclusions
Chapter 10. Reconstructing Causal Network Models of Human Disease
Introduction
Modeling Biological Data
Causality as a Statistical Inference
From Assessing Causal Relationships Among Trait Pairs to Predictive Gene Networks
Application of Predictive Network Models to High-Throughput Screening
Conclusion and Future Directions
Chapter 11. Gene Networks in Neuropsychiatric Disease
Introduction
RNA, Protein, and Epigenetic Molecular Levels in Neurobiology
The Challenge of Spatial and Temporal Heterogeneity in the Central Nervous System
Gene Networks Provide a Framework for Neurobiological Interpretation
Gene Networks in Neuropsychiatric Disorders
Conclusions and Future Directions
Chapter 12. Somatic Mosaicism and Neurological Diseases
Introduction
Cortical Clonal Architecture and Somatic Mutations
Somatic Mutations in Normal Brain
Somatic Mutation in Neurological Disease
Types of Somatic Variants
Tissue Type Considerations
Tools to Study Somatic Variation in the Brain
Conclusion
Section II. A New Neuroanatomy
Chapter 13. The Molecular Landscape of the Developing Human Central Nervous System
Introduction
The Cellular and Structural Complexity of the Human Brain
General Principles of Human Neocortical Development
Transcriptional Landscape of the Developing Human Brain
Epigenomic and Regulatory Landscapes of the Developing Human Brain
Insights Into Psychiatric and Neurological Disorders
Conclusions and Future Directions
Chapter 14. Optogenetic Approaches to Neural Circuit Analysis in the Mammalian Brain
Introduction
Tools for Functional Neural Circuit Dissection
Functional Circuit Analysis
Translational Medicine
Conclusions
Chapter 15. Brain Imaging With Magnetoencephalography During Rest and During Speech and Language Processing
Brain Imaging With Magnetoencephalography (MEG)
Sensing the Brain's Magnetic Fields
From Sensing to Imaging
Magnetoencephalography Studies in Aging and Dementia
Summary and Conclusions
Chapter 16. Resting-State Functional MRI: A Novel Tool for Understanding Brain Networks in Neuropsychiatric Disorders
A Brief History of Resting-State Functional MRI
Healthy Skepticism Drives Deeper Explorations
Attributes of Resting-State Networks
Default-Mode and Salience Networks
Applications to Brain Disease
Future Directions
Chapter 17. Neuroimaging Advances in Alzheimer's Disease
Introduction
Alzheimer's Disease Defined
Imaging Characteristics of Alzheimer's Disease
Progression of Neuroimaging Abnormalities: An Amyloid Cascade?
Underlying Mechanisms of Alzheimer's Disease Pathology
Conclusions
Chapter 18. Progressive Supranuclear Palsy and Related Parkinsonian Disorders
Introduction
Each Neurodegenerative Syndrome Reflects a Network
Parkinsonian Syndromes: Parkinson Disease and Others
Network Architecture of Neurodegenerative Parkinsonian Syndromes
Clinicopathological Correlation in Progressive Supranuclear Palsy and Corticobasal Degeneration
Genetic Factors in Progressive Supranuclear Palsy and Corticobasal Degeneration
Conclusion
Chapter 19. A New Neuroanatomy of Basal Ganglia Circuitry
Anatomic Overview of Basal Ganglia
Clinical Basal Ganglia Disorders
Classical Basal Ganglia Microanatomy
Probing Basal Ganglia Circuitry in Animal Models
Emerging Techniques to Study Basal Ganglia Circuitry in Animals
Probing Basal Ganglia Circuitry in Humans: Invasive Physiology
Noninvasive Studies of Basal Ganglia Circuitry in Humans
Conclusions
Chapter 20. Brainstem Circuitry and Emotions
The Structural Organization of the Brainstem
Reticular Formation
Reticular Formation Modulates Behavior
Conclusion
Chapter 21. Apathy: Frontal and Basal Ganglia Circuits
Introduction
Definition and Diagnosis of Apathy
Clinical Variants, Diagnosis, and Measurement of Apathy
Measurement and Rating Scale
Molecular and Neuroanatomical Basis of Apathy
Prefrontal Cortex in Primates
Prefrontal Cortex in Humans
Motivation and Reward System: Basal Ganglia Structures and Corticostriatal Loop
Structural and Functional Connectivity in Apathy
Dopamine Regulation in Motivation and Reward System
Apathy in Neuropsychiatric Diseases
Genetics Studies of Apathy
Summary
Chapter 22. Emotional Dysfunction in Psychopathology and Neuropathology: Neural and Genetic Pathways
Emotional Dysfunction in Psychopathology and Neuropathology: Neural and Genetic Pathways
Emotions and Emotional Processes Emotions: Short and Discrete
Emotional Processes
Emotional Functioning: Neural Pathways
Emotional Reactivity
Emotion Regulation
Emotional Affiliation
Emotion Functioning: Genetic Pathways
Model of the Genetic Pathway
Emotional Reactivity
Emotion Regulation
Emotional Affiliation
Neural and Genetic Pathways: Clinical Implications
Emotional Dysfunction in Psychopathology and Neuropathology
Emotion in Diagnosis
Emotion in Treatment
Chapter 23. The Anatomy of Delusion
Introduction and Definition
Phenomenology and Anatomy
Content-Specific Delusions
Genetics
Chapter 24. Beyond Dopamine: The Role of the Serotonergic System and Treatments in Understanding and Treating Visual Hallucinations in Parkinson Disease
Simple Versus Complex Visual Hallucinations
The Visual Pathway and Mechanisms of Disruption Leading to Visual Hallucinations
Complex Visual Hallucinations: The Role of Lesion Location
Synucleinopathy and Visual Hallucinations
The Serotonergic System and Visual Hallucinations
Pharmacological Interventions for Visual Hallucinations in Parkinson Disease
Summary
Section III. Clinical Phenomenologie
Chapter 25. Risk Overlap Between Clinical Disorders
Introduction
Clinical Neuroscience, Neurology, and Psychiatry
The Diseases and Disorders in Section III
Overlap
Genomics as a Way to Understand Risk Overlap
Wielding Genomic Tools
The Genetic Architectures of Section III Entities
Risk Overlap
Conclusion
Financial Conflicts of Interest
List of URLs
Chapter 26. The NIMH Research Domain Criteria Project: Toward an Integrated Neuroscience of Mental Disorders
Introduction
Why Research Domain Criteria?
Research Domain Criteria Process and Structure
Intermediate Phenotypes and Endophenotypes
Loosening the Constraints of Research
Research Domain Criteria and Genetics
Meeting in the Middle
Research Domain Criteria and Big Data
Summary and Conclusions
Competing Interests
Chapter 27. Schizophrenia
Introduction
Heritability and Genetic Architecture
The Genomics Era
Genome-Wide Association Studies
Structural Variation Studies
Sequencing Studies
Next Steps in Gene Discovery
Circuits and Pathways
Future Directions
Chapter 28. The Emergence and Underlying Neurobiology of Psychosis
Introduction
The Clinical Emergence of Psychosis
Clinical Phenotype to Endophenotype
Neurocognition
Neuroimaging
Neurodevelopmental Perspective
Links to Genomics: Challenges and Future Directions
Chapter 29. Autism Spectrum Disorder: Genes to Pathways to Circuits
Introduction
Genetics and Genomics
From Genes to Biology in Autism Spectrum Disorder
Conclusions
Chapter 30. Molecular Architecture and Neurobiology of Bipolar Disorder
Descriptive Epidemiology
Comorbidity
Genetic Epidemiology
Molecular Genetics
Intermediate Traits and the Biology of Bipolar Disorder
Neuroimaging Intermediate Traits
Summary and Conclusions
Chapter 31. Conceptualizing Major Depression: From Genes to Neuroanatomy to Epidemiology
Prevalence of Major Depressive Disorder
Risk Factors for Major Depressive Disorder
Diagnostic Heterogeneity in Major Depressive Disorder
Neuroanatomy of Major Depressive Disorder
Functional Neuroimaging in Major Depressive Disorder
Heritability
Endophenotypes
Genetics
Treatment for Major Depression
Conclusions
Chapter 32. Speech and Language Disorders
Introduction
Language Networks and Pathways
Neurodevelopmental Language Disorders
Primary Progressive Aphasia
Chapter 33. Molecular Pathways Leading to the Clinical Phenomenology of Frontotemporal Dementia
Introduction to Frontotemporal Dementia and Associated Clinical Syndromes
Three Unique Genetic Mechanisms Converging on Frontotemporal Dementia Spectrum Disorders
Neuroanatomical Circuits and Clinical Syndromes in Autosomal Dominant Frontotemporal Lobar Degeneration
Chapter 34. The Genetic Basis of Alzheimer's Disease: Findings From Genome-Wide Studies
Introduction
Genetics of Early-Onset Familial Alzheimer's Disease
Amyloid Precursor Protein
Presenilin Genes (PSEN1 and PSEN2)
Other Early-Onset Familial Alzheimer's Disease Genes
Late-Onset Sporadic Form of Alzheimer's Disease
Apolipoprotein E
ADAM Metallopeptidase Domain 10
Triggering Receptor Expressed on Myeloid Cells 2
CD33 (Myeloid Cell Surface Antigen CD33; Sialic Acid-Binding Immunoglobulin-Like Lectin 3)
Cas Scaffolding Protein Family Member 4
Complement Component (3b/4b) Receptor 1
Bridging Integrator 1
Clusterin
Adenosine Triphosphate-Binding Cassette, Subfamily A (ABC1), Member 7
Phosphatidylinositol-Binding Clathrin Assembly Protein
Membrane-Spanning 4 Domain Subfamily A Members 4A and 6A
CUGBP, Elav-Like Family Member 1
Sortilin-Related Receptor, LDLR Class A Repeats Containing
Inositol Polyphosphate-5-Phosphatase
CD2-Associated Protein
Major Histocompatibility Complex Class II, DRβ1 and 5
EPH Receptor A1
Myocyte Enhancer Factor 2C
NME/NM23 Family Member 8
Zinc Finger, CW Type With PWWP Domain 1
Fermitin Family Member 2
Protein Tyrosine Kinase 2β
Ras and Rab Interactor 3 (RIN3) and Solute Carrier Family 24 Sodium/Potassium/Calcium Exchanger, Member 4 (SLC24A4)
Conclusion
Chapter 35. Posttraumatic Stress Disorder: From Circuits to Genes
Introduction
Clinical Aspects of Posttraumatic Stress Disorder Diagnosis
Neural Circuits of Fear and Extinction, and Their Dysregulation in Posttraumatic Stress Disorder
Genetic Studies of Posttraumatic Stress Disorder
Conclusions
Acknowledgment
Disclosures
Chapter 36. Neurodevelopmental Disorders, Causes, and Consequences
Neurodevelopmental Disorders, Causes, and Consequences
Causes of Disorders of Neurodevelopment
Genetics of Neurodevelopmental Disorders
Mechanisms of Action and Consequences on the Circuits of Cognition and Behavior
Future Directions
Chapter 37. Molecular Architecture and Neurobiology of the Epilepsies
Introduction
Progress in Epilepsy Genetics
Genetic Architecture of the Epilepsies
Voltage-Gated Channelopathies
Ligand-Gated Channelopathies
Vesicle Trafficking Dysregulation
Miscellaneous Genes
Interpretation
Management/Therapeutic Implications
Future Studies and Conclusion
Chapter 38. Clinical Syndromes of Substance Use Disorder
Introduction
Overview of Substance Use Disorder
Circuits and Pathways That Mediate Substance Abuse
Cellular and Molecular Pathology of Substance Abuse
Genetic Factors Contributing to Substance Use Disorder
Conclusion
Chapter 39. Immunologic and Genetic Aspects of Type 1 Narcolepsy
Introduction to the Clinical Condition and Its Pathophysiology
Type 1 Narcolepsy Is Caused by a Loss of Hypocretin Neurons
Genetic Association of Narcolepsy With the Human Leukocyte Antigen
Non–Human Leukocyte Antigen Genes Are Also Associated With Narcolepsy
Narcolepsy in Association With Other Syndromic Genetic Disorders
Is Type 1 Narcolepsy an Autoimmune Disease?
Involvement of Environmental Factors in Narcolepsy
H1N1 Pandemrix Vaccination Leading to an Increase in Narcolepsy Incidence
Perspectives
Chapter 40. Genomic Landscape of Brain Tumors
Introduction
Meningiomas
Gliomas
Medulloblastomas
Ependymal Tumors
Summary
Chapter 41. White Matter Disorders
Introduction
History of White Matter Research
White Matter Anatomy
Clinical Syndromes
Chapter 42. Amyotrophic Lateral Sclerosis 1 and Many Diseases
Introduction
Epidemiology
Clinical Features
Clinical Subtypes of Amyotrophic Lateral Sclerosis
Amyotrophic Lateral Sclerosis Anatomy
Amyotrophic Lateral Sclerosis Pathology
Amyotrophic Lateral Sclerosis Genetics by Putative Mechanism
Amyotrophic Lateral Sclerosis Models
Discussion and Future Directions
Chapter 43. Eating Disorders
Clinical Syndromes
Human Genetic Research
Neural Circuit Dissection of Feeding Behavior Related to Eating Disorders
Concluding Remarks
Section IV. Clinical Translation and Drug Development
Chapter 44. Psychiatric Pharmacogenomics: Translating Genomics
Definition and Overview
Connection to Genomics in Psychiatry and Implications for Architecture
Challenges
Principal Findings by Therapeutic Area
Efforts at Clinical Translation
Applications in Clinical Investigation
Policy and Regulatory Considerations
Emerging Directions
Index
Copyright
Academic Press is an imprint of Elsevier
125 London Wall, London EC2Y 5AS, UK
525 B Street, Suite 1800, San Diego, CA 92101-4495, USA
50 Hampshire Street, 5th Floor, Cambridge, MA 02139, USA
The Boulevard, Langford Lane, Kidlington, Oxford OX5 1GB, UK
Copyright © 2016 Elsevier Inc. All rights reserved.
No part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying, recording, or any information storage and retrieval system, without permission in writing from the publisher. Details on how to seek permission, further information about the Publisher’s permissions policies and our arrangements with organizations such as the Copyright Clearance Center and the Copyright Licensing Agency, can be found at our website: www.elsevier.com/permissions.
This book and the individual contributions contained in it are protected under copyright by the Publisher (other than as may be noted herein).
Notices
Knowledge and best practice in this field are constantly changing. As new research and experience broaden our understanding, changes in research methods, professional practices, or medical treatment may become necessary.
Practitioners and researchers may always rely on their own experience and knowledge in evaluating and using any information, methods, compounds, or experiments described herein. In using such information or methods they should be mindful of their own safety and the safety of others, including parties for whom they have a professional responsibility.
To the fullest extent of the law, neither the Publisher nor the authors, contributors, or editors, assume any liability for any injury and/or damage to persons or property as a matter of products liability, negligence or otherwise, or from any use or operation of any methods, products, instructions, or ideas contained in the material herein.
British Library Cataloguing-in-Publication Data
A catalogue record for this book is available from the British Library
Library of Congress Cataloging-in-Publication Data
A catalog record for this book is available from the Library of Congress
ISBN: 978-0-12-800105-9
For information on all Academic Press publications visit our website at https://www.elsevier.com/
Publisher: Nikki Levy
Acquisition Editor: Nikki Levy
Editorial Project Manager: Barbara Makinster
Production Project Manager: Caroline Johnson
Designer: Matthew Limbert
Typeset by TNQ Books and Journals
www.tnq.co.in
List of Contributors
Nii A. Addy, Yale University School of Medicine, New Haven, CT, United States
Laura Almasy, University of Texas Rio Grand Valley, Brownsville, TX, United States
Lynn M. Almli
Emory University, Atlanta, GA, United States
Harvard Medical School, Belmont, MA, United States
Nicholas AuYong, University of California Los Angeles, Los Angeles, CA, United States
Hanwen Bai, Yale University School of Medicine, New Haven, CT, United States
Daniel J. Barrera, University of California San Diego, San Diego, CA, United States
Carrie E. Bearden, University of California Los Angeles, Los Angeles, CA, United States
James D. Berry, Neurological Clinical Research Institute (NCRI), Massachusetts General Hospital, Boston, MA, United States
Brianne M. Bettcher, University of Colorado School of Medicine, Aurora, CO, United States
Carolyn Bevan, University of California San Francisco, San Francisco, CA, United States
John Blangero, University of Texas Rio Grand Valley, Brownsville, TX, United States
Yvette M. Bordelon, University of California Los Angeles, Los Angeles, CA, United States
Jesse A. Brown, University of California San Francisco, San Francisco, CA, United States
Cynthia M. Bulik
University of North Carolina Center of Excellence for Eating Disorders, Chapel Hill, NC, United States
Karolinska Institutet, Stockholm, Sweden
Victoria E. Clark, Yale University School of Medicine, New Haven, CT, United States
Aiden Corvin, Trinity College Dublin, Dublin, Ireland
Joanne E. Curran, University of Texas Rio Grand Valley, Brownsville, TX, United States
Bruce N. Cuthbert, National Institute of Mental Health, National Institutes of Health, Bethesda, MD, United States
Alissa M. D'Gama
Howard Hughes Medical Institute, Boston, MA, United States
Harvard Medical School, Boston, MA, United States
Mark J. Daly
Massachusetts General Hospital, Boston, MA, United States
Broad Institute of MIT and Harvard, Cambridge, MA, United States
Ryan S. Dhindsa, Columbia University Medical Center, New York, NY, United States
Gül Dölen, Johns Hopkins University, Baltimore, MD, United States
Andrew Evans, Royal Melbourne Hospital, Melbourne, VIC, Australia
Nelson B. Freimer, University of California Los Angeles, Los Angeles, CA, United States
Daniel H. Geschwind, University of California Los Angeles, Los Angeles, CA, United States
Michael D. Geschwind, University of California San Francisco, San Francisco, CA, United States
David C. Glahn
Hartford Hospital, Hartford, CT, United States
Yale University School of Medicine, New Haven, CT, United States
David B. Goldstein, Columbia University Medical Center, New York, NY, United States
John M. Greally, Albert Einstein College of Medicine, Bronx, NY, United States
Michael Greicius, Stanford University School of Medicine, Stanford, CA, United States
Lea T. Grinberg
University of California San Francisco, San Francisco, CA, United States
University of São Paulo Medical School, São Paulo, Brazil
Jennifer M. Günel, Yale University School of Medicine, New Haven, CT, United States
Murat Günel, Yale University School of Medicine, New Haven, CT, United States
Raquel E. Gur, University of Pennsylvania, Philadelphia, PA, United States
Ruben C. Gur, University of Pennsylvania, Philadelphia, PA, United States
Claudia M. Haase, Northwestern University, Evanston, IL, United States
Helmut Heinsen
University of São Paulo Medical School, São Paulo, Brazil
University Hospital Würzburg, Würzburg, Germany
Derrek P. Hibar, University of Southern California, Los Angeles, CA, United States
Karen Hodgson, Yale University School of Medicine, New Haven, CT, United States
Basavaraj Hooli
Massachusetts General Hospital, Boston, MA, United States
Harvard Medical School, Boston, MA, United States
Branka Hrvoj-Mihic, University of California San Diego, La Jolla, CA, United States
Louis Jacob, Stanford University Center for Narcolepsy, Palo Alto, CA, United States
Neda Jahanshad, University of Southern California, Los Angeles, CA, United States
Saumya S. Jamuar
KK Women's and Children's Hospital, Singapore
Howard Hughes Medical Institute, Boston, MA, United States
Harvard Medical School, Boston, MA, United States
Adrienne M. Keener, University of California Los Angeles, Los Angeles, CA, United States
Emma E.M. Knowles, Yale University School of Medicine, New Haven, CT, United States
Stephan Lammel, University of California Berkeley, Berkeley, CA, United States
Suzee E. Lee, University of California San Francisco, San Francisco, CA, United States
Robert W. Levenson, University of California Berkeley, Berkeley, CA, United States
Mingfeng Li, Yale School of Medicine, New Haven, CT, United States
Mauro F.C. Lins, Clínica-Escola de Autismo, Itaboraí, Rio de Janeiro, Brazil
Daniel H. Lowenstein, University of California San Francisco, San Francisco, CA, United States
Joshua Mahlios, Stanford University Center for Narcolepsy, Palo Alto, CA, United States
Robert C. Malenka, Stanford University School of Medicine, Stanford, CA, United States
Christopher E. Mason, Cornell University, New York, NY, United States
Samuel R. Mathias, Yale University School of Medicine, New Haven, CT, United States
Mario F. Mendez
University of California Los Angeles, Los Angeles, CA, United States
Greater Los Angeles VA Healthcare System, Los Angeles, CA, United States
Kathleen R. Merikangas, National Institutes of Health, Bethesda, MD, United States
Alison K. Merikangas, University of Pennsylvania, Philadelphia, PA, United States
Emmanuel Mignot, Stanford University Center for Narcolepsy, Palo Alto, CA, United States
Ramon Mocellin
Royal Melbourne Hospital, Melbourne, VIC, Australia
University of Melbourne and Melbourne Health, Melbourne, VIC, Australia
Rebecca A. Muhle, Yale University School of Medicine, New Haven, CT, United States
Alysson R. Muotri, University of California San Diego, La Jolla, CA, United States
Georges Naasan, University of California San Francisco, San Francisco, CA, United States
Srikantan S. Nagarajan, University of California San Francisco, San Francisco, CA, United States
Benjamin M. Neale
Massachusetts General Hospital, Boston, MA, United States
Broad Institute of MIT and Harvard, Cambridge, MA, United States
Priscilla D. Negraes, University of California San Diego, La Jolla, CA, United States
Eric J. Nestler, Icahn School of Medicine at Mount Sinai, New York, NY, United States
Hanna M. Ollila, Stanford University Center for Narcolepsy, Palo Alto, CA, United States
Rene L. Olvera, University of Texas Health Science Center San Antonio, San Antonio, TX, United States
Pongsatorn Paholpak
Khon Kaen University, Khon Kaen, Thailand
University of California Los Angeles, Los Angeles, CA, United States
Neelroop N. Parikshak, University of California Los Angeles, Los Angeles, CA, United States
Roy H. Perlis, Harvard Medical School, Boston, MA, United States
Sirisha Pochareddy, Yale School of Medicine, New Haven, CT, United States
Carlos Portera-Cailliau, University of California Los Angeles, Los Angeles, CA, United States
Nader Pouratian, University of California Los Angeles, Los Angeles, CA, United States
Shaun M. Purcell
Harvard Medical School, Boston, MA, United States
Icahn School of Medicine at Mount Sinai, New York City, NY, United States
Gil D. Rabinovici, University of California San Francisco, San Francisco, CA, United States
Kamalini G. Ranasinghe, University of California San Francisco, San Francisco, CA, United States
Elena Ratti, Neurological Clinical Research Institute (NCRI), Massachusetts General Hospital, Boston, MA, United States
Hannah E. Reed, Yale University School of Medicine, New Haven, CT, United States
Kerry J. Ressler
Emory University, Atlanta, GA, United States
Harvard Medical School, Belmont, MA, United States
Elise B. Robinson
Massachusetts General Hospital, Boston, MA, United States
Broad Institute of MIT and Harvard, Cambridge, MA, United States
Stephan J. Sanders, University of California San Francisco, San Francisco, CA, United States
Eric E. Schadt, Icahn School of Medicine at Mount Sinai, New York City, NY, United States
Daniel R. Schonhaut, University of California San Francisco, San Francisco, CA, United States
Jonathan Sebat, University of California San Diego, San Diego, CA, United States
William W. Seeley, University of California San Francisco, San Francisco, CA, United States
Katerina Semendeferi, University of California San Diego, La Jolla, CA, United States
Nenad Sestan, Yale School of Medicine, New Haven, CT, United States
Kevin A. Shapiro, University of California San Francisco, San Francisco, CA, United States
Elliot H. Sherr, University of California San Francisco, San Francisco, CA, United States
John C. Silbereis, Yale School of Medicine, New Haven, CT, United States
Wojciech B. Solecki
Institute of Pharmacology PAS, Krakow, Poland
Jagiellonian University, Krakow, Poland
Edoardo G. Spinelli
University of California San Francisco, San Francisco, CA, United States
Vita-Salute San Raffaele University, Milan, Italy
Matthew W. State, University of California San Francisco, San Francisco, CA, United States
Jason L. Stein, University of North Carolina, Chapel Hill, NC, United States
Garret D. Stuber, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
Virginia E. Sturm, University of California San Francisco, San Francisco, CA, United States
Patrick F. Sullivan
University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
Karolinska Institutet, Stockholm, Sweden
Rudolph E. Tanzi
Massachusetts General Hospital, Boston, MA, United States
Harvard Medical School, Boston, MA, United States
Maria L.G. Tempini, University of California San Francisco, San Francisco, CA, United States
Panos Theofilas, University of California San Francisco, San Francisco, CA, United States
Paul M. Thompson, University of Southern California, Los Angeles, CA, United States
Cleber A. Trujillo, University of California San Diego, La Jolla, CA, United States
Dennis Velakoulis
Royal Melbourne Hospital, Melbourne, VIC, Australia
University of Melbourne, Melbourne, VIC, Australia
Keith A. Vossel, University of California San Francisco, San Francisco, CA, United States
Christopher A. Walsh
Howard Hughes Medical Institute, Boston, MA, United States
Harvard Medical School, Boston, MA, United States
Broad Institute of MIT and Harvard, Cambridge, MA, United States
Mark Walterfang
Royal Melbourne Hospital, Melbourne, VIC, Australia
University of Melbourne, Melbourne, VIC, Australia
Aliza P. Wingo
Emory University, Atlanta, GA, United States
Harvard Medical School, Belmont, MA, United States
Nailin Yao, Yale University School of Medicine, New Haven, CT, United States
Jennifer S. Yokoyama, University of California San Francisco, San Francisco, CA, United States
Mark W. Youngblood, Yale University School of Medicine, New Haven, CT, United States
Peter Zandi, Johns Hopkins School of Medicine, Baltimore, MD, United States
Ying Zhu, Yale School of Medicine, New Haven, CT, United States
Preface
Clinical neuroscience is in the midst of a profound transformation. Parallel advances in genomics and neurobiology are revealing the pathophysiological mechanisms underlying both common and rare disorders and empowering the search for novel and more effective therapies. Over the last few years, a revolution in gene discovery has changed the landscape for many neurological and psychiatric conditions. The path to reliable and systematic gene discovery has been firmly established, providing a solid molecular foundation upon which to build a deeper understanding of the pathophysiology of multiple clinical syndromes. The next wave of the -omics
revolution is already upon us, providing the opportunity to study germline and somatic variation, gene expression and, soon, the proteome, at scale, offering the ability to readily edit the genome and thereby illuminate function, and increasingly leveraging powerful informatics and systems biological approaches. Recent advances in neuroscience have been no less dramatic: the field is steadily characterizing brain in a wide range of species at cellular resolution and across development; leveraging an array of tools to assess and manipulate circuits in unprecedented ways; integrating multiple levels of analysis from developmental and molecular to computational and systems neuroscience; and developing stem cell technologies promising to provide the opportunity to study previously inaccessible biological processes restricted to the human central nervous system.
These advances are driving the reintegration of the clinical neurosciences most dramatically neurology and psychiatry. The demands on the contemporary clinician, regardless of discipline, to understand a wide range of fields from genomics to molecular, cellular, and circuit level neuroanatomy has never been greater and will undoubtedly increase over the coming years. For example, genetic testing is becoming a standard of care for multiple diagnoses across psychiatry and neurology, and recent findings promise to make these assays even more commonplace in clinical and clinical research settings. And recent efforts to transform the psychiatric diagnostic nosology are pushing investigators and clinicians to prioritize the understanding of CNS anatomy and function, and their links to psychopathology, in an unprecedented fashion.
This is the motivation for this volume: to provide relevant foundational scientific knowledge to clinicians, researchers, and trainees in a wide range of neuroscience disciplines as a resource for understanding the relevance of recent advances for clinical syndromes.
The first section provides a comprehensive review of genetics and genomics relevant to clinical neuroscience. Chapters address the contemporary view of the human genome, provide an overview of the state-of-the-art methods to detect and analyze common and rare variation both in clinical and epidemiological samples, describe emerging tools to examine and manipulate the genome, and offer an introduction to a range of related topics including epigenetics, bioinformatics, stem cell models, imaging genomics, pathway analysis, and systems biology.
The second section of the book focuses on the neuroanatomy and neurocircuitry associated with psychiatric and neurological conditions. Chapters in this section address molecular, cellular, and circuit level analyses, describe state-of-the-art methods to study and manipulate circuits, and review specific functional and behavioral domains and their anatomical underpinnings—including mood and emotion, apathy, delusions, hallucinations, and movement.
The third and fourth sections endeavor to bring this foundational knowledge to bear on a discussion of clinical syndromes and therapeutics development. This begins with a discussion of the overlap of genetic risk for a wide-range disorders and the challenges of and emerging alternatives to categorical diagnosis—particularly in psychiatry. Disorders and syndromes are then covered individually and range from those that have been traditionally considered the domain of psychiatry, to neurological and neurodegenerative and neurooncological disorders. As the title of the volume suggests, all of the authors address what is currently known about the genomic, pathway, and circuit level aspects of pathology. Not surprisingly, there is considerable variation in the state-of-the-science for various conditions. As a general proposition, disorders that have traditionally been the remit of neurology and neurosurgery have already accumulated a stronger knowledge base with regard to genetics, molecular pathology, and anatomy. Of course, there are several syndromes traditionally tied to psychiatry in which tremendous recent progress has been made in elaborating molecular, cellular, and circuit level mechanism.
The use of the term neuropsychiatry in the title of this volume is a prediction. As the basic science of brain and behavior continues to advance rapidly, easy distinctions between neurological and psychiatric disorders will continue to erode and a new synthesis—and new professional pathways—will emerge. It is not, however, meant to reify the term versus other similar ones. Indeed, it is likely that any current distinctions between, for instance, neuropsychiatry, clinical neuroscience, and behavioral neurology will also become less and less clear with time. Moreover, our view of the future celebrates the differences in emphasis and culture that currently characterize distinct subfields focused on mind and brain. Recent advances do not necessitate a reductionist view. The astonishing scientific progress that is challenging the conventional wisdom leaves plenty of room for the types of differences that drive aspiring surgeons versus psychotherapists, neuroimmunologist, neurogeneticists, and basic neuroscientists. Our hope is that this text will serve as a useful resource for anyone interested in a core set of principles, practices, and methods relevant to the contemporary understanding of disorders of the human brain.
Thomas Lehner, Bruce Miller, and Matthew State
Section I
The Genome Tools and Methods
Outline
Chapter 1. The Newly Emerging View of the Genome
Chapter 2. Contribution of Genetic Epidemiology to Our Understanding of Psychiatric Disorders
Chapter 3. Natural Selection and Neuropsychiatric Disease: Theory, Observation, and Emerging Genetic Findings
Chapter 4. Genome Tools and Methods: Rare Genetic Variation
Chapter 5. Neuroepigenomics and Human Disease
Chapter 6. Bioinformatics in Neuropsychiatric Genomics
Chapter 7. Imaging Genomics and ENIGMA
Chapter 8. Brain in a Dish: Stem Cell Technologies to Study Disorders of the Central Nervous System
Chapter 9. Association Strategies
Chapter 10. Reconstructing Causal Network Models of Human Disease
Chapter 11. Gene Networks in Neuropsychiatric Disease
Chapter 12. Somatic Mosaicism and Neurological Diseases
Chapter 1
The Newly Emerging View of the Genome
Stephan J. Sanders University of California San Francisco, San Francisco, CA, United States
Christopher E. Mason Cornell University, New York, NY, United States
Abstract
Advances in methods for sequencing, characterizing, and profiling human genomes have provided a wealth of opportunities for understanding the genetic basis of neuropsychiatric disorders. Alongside the sequencing of the human genome, large-scale efforts have mapped DNA variation in healthy populations, identified variants associated with numerous neuropsychiatric disorders, revealed complex machinery that regulates DNA and protein synthesis, and cataloged the genes expressed in various tissues and throughout development. Intricate knowledge of these genomic advances will help scientists make further progress in research and clinicians to apply these resources and methods to patient populations. This chapter will build a foundation of knowledge for the interpretation of genomics research, application of genomics methods, and creation of genome-guided medicine.
Keywords
Chromatin; Coding; Enhancer; Epigenome; Exon; Genetics; Genome; Genomics; Noncoding; Promoter; Transcription; Transcriptome; Translation
Introduction
A genomic revolution has been underway since the beginning of this century. A series of transformative advances, starting with the sequencing of the human genome (Lander et al., 2001; Venter et al., 2001), have enabled high-resolution, genome-wide analysis to become a routine technique in both research and clinical practice. After generation of the human reference genome by the Human Genome Project, the Haplotype Mapping (HapMap) Consortium began the process of cataloging human genetic variation. The combination of these HapMap variants and microarray technology enabled genome-wide association studies (GWAS) to find common variants associated with human disease.
The subsequent invention of high-throughput sequencing reduced the price of DNA sequencing by 50% every 5 months between 2006 and 2014: data generation required for the 10-year, $3 billion human genome project thus could be achieved in 3 days for $2000. This advance has enabled gene discovery in complex disorders such as intellectual disability and autism spectrum disorder (ASD), provided high-resolution maps of human variation (1000 Genomes Project and Exome Aggregation Consortium), and enabled tissue-specific maps of gene expression (Genotype-Tissue Expression Project [GTEx] and BrainSpan).
However, rather than yielding a working model of the genome, these advances have simply begun to map the sheer complexity of the dynamic role played by DNA in cells. On top of the DNA genome lies an epigenome composed of myriad interacting factors that regulate how, when, and where DNA is expressed as RNA. The extent of this noncoding regulatory architecture correlates with organismal complexity (Taft, Pheasant, & Mattick, 2007) to a far greater degree than the protein-coding regions; furthermore, many of the genetic loci identified in human disease through GWAS and sequencing analyses target regulatory regions or the genes involved in DNA regulation. The Encyclopedia of DNA Elements (ENCODE) Project has generated tissue-specific maps of chromatin, the first-level of this regulation, yet the list of regulatory processes is expanding faster than the understanding of how these processes function or interact.
Therefore, the genomic revolution has transformed our understanding of biology, human diversity, and human disease, laying a foundation for a future era of genome-guided precision medicine. To achieve this goal the modern clinician will require familiarity not just with the 1.5% of protein-coding DNA but also the 98.5% of noncoding DNA that describes how the resulting proteins create a human and a human brain.
Conceptualizing DNA
Early descriptions of DNA describe a passive molecule that stores information to be read by the cell as required, analogous to a book. Contemporary molecular analysis has revealed a vastly more dynamic and active molecule capable of both storing and processing information, analogous to a computer. To extend this computer analogy, the genome contains hardware
in the form of nucleotides, software
in the form of epigenetic modifications, such as methylation, and information processing capabilities in the form of factors that integrate internal and external signals to regulate mRNA transcription. Furthermore, this genomic computer
can be reconfigured to run different operating systems (eg, Windows, Mac OS, Linux) in the form of hundreds or possibly thousands of cell types (eg, stem cells, neurons, or lymphocytes). This entire process takes place in concert with the other molecular and structural components of the cell, in a manner similar to the bidirectional feedback between the central nervous system and the body.
However, several aspects of the genome go beyond a computer analogy. Foremost among these is that the genome contains the information to reproduce itself, including replication of the information and development from a single copy (ie, zygote) to a fully developed adult. In addition, the components of a computer are usually discrete, with a hard drive for storage and a processor for data integration, whereas in the genome these processes are fully integrated with the regulatory machinery acting directly on the nucleotides and their surroundings. Finally, whereas computers are based on the reliable transmission of bits of information (ie, 0 or 1), the genome acts through inherently stochastic chemical interactions which are essentially continuous (0–1 rather than 0 or 1). Therefore, whereas computers act by enforcing conformity, biological systems act by manipulating stochastic processes, constraining them to the degree that is required to achieve the necessary outcome reliably.
Information Flow in Biological Systems
To make sense of the data generated by genomics, it is necessary to understand how the information encoded in DNA leads to an observable human phenotype. This chapter will describe the first four molecular steps in this process in detail (Fig. 1.1A), identifying specific examples relevant to neuropsychiatry where possible. However, the information from the DNA continues to exert an influence through the actions of proteins across further levels of organization, ultimately resulting in an observable phenotype (Fig. 1.1B). This includes the formation of circuits and pathways in the brain, in tandem with environmental and regulatory influences.
Key Definitions
Cis and Trans
These terms describe whether a DNA sequence acts on a nearby region, eg, a promoter acting in cis to initiate transcription of a gene, or a distant region, eg, a gene encoding a transcription factor that acts in trans at multiple binding sites across the genome.
Gene
The word gene
was used initially to describe a particle that is inherited to form a specific trait (pangene,
1889, Hugo de Vries) or an independent determinant of a hereditary characteristic (gene,
1909, Wilhelm Johannsen). After the discovery of transcription, translation, and the triplet code, the word gene
became synonymous with protein-coding gene,
and this is still the most common usage of the term. However, one gene can have multiple isoforms, a noncoding enhancer could be considered part of the gene unit, and noncoding transcripts can have an influence on a hereditary characteristic. Two contemporary definitions have been proposed: (1) A gene is a union of genomic sequences encoding a coherent set of potentially overlapping functional products
(Gerstein et al., 2007), and (2) A gene is a discrete genomic region whose transcription is regulated by one or more promoters and distal regulatory elements and which contains the information for the synthesis of functional proteins or noncoding RNAs, related by the sharing of a portion of genetic information at the level of the ultimate products (proteins or RNAs)
(Pesole, 2008).
Transcript
This is a catchall phrase used to describe lengths of RNA that are produced from DNA. Many, but not all of these transcripts will be the mRNAs transcribed from genes.
High-Throughput Sequencing
This is also called next-generation sequencing and it has revolutionized genomics. The previous generation, called Sanger sequencing or dideoxy sequencing, involved amplifying a specific region of DNA using the polymerase chain reaction (PCR), inserting one of four fluorescently labeled nucleotides with chain terminators that prevent further DNA replication, and then comparing the fluorescent color with the length of the DNA to read
the output. This method was used to map the human genome in 2001 (Lander et al., 2001). In contrast, high-throughput methods read multiple strands of DNA simultaneously: for example, using the Illumina sequencing by synthesis method that involves amplifying a strand of DNA to form a cluster of identical DNA strands and then replicating all of the strands in the cluster using fluorescently labeled nucleotides with reversible chain terminators. An image of the cluster is taken to record the fluorescent color present, and then the chain termination is removed, allowing the next nucleotide to be added and imaged in a similar manner. These high-throughput methods produce numerous reads
of sequencing that have to be aligned to the genome for interpretation. High-throughput sequencing has been applied to many different research methods, including:
Figure 1.1 Information flow in biological systems.
(A) The genome describes the totality of information encoded in DNA nucleotides. Proteins and RNAs surround the DNA to form chromatin; the complete set of these reversible chemical changes in a cell or groups of cells is called the epigenome. Chromatin regulates the transcription of DNA into RNA; the complete set of RNA transcripts in a cell or groups of cells is called the transcriptome. Protein-coding mRNA is translated into proteins through the action of ribosomes and tRNAs; the complete set of proteins in the cell or groups of cells is called the proteome. (B) Information encoded by DNA nucleotides is amplified through multiple molecular and structural stages, eventually resulting in an observable phenotype in the organism. Throughout this amplification process the information flow can be regulated by upstream or downstream processes and further influenced by environmental factors. Finally, the phenotype of the organism governs its ability to reproduce, leading to selective pressure that determines whether the DNA is passed on to the next generation.
• Whole exome sequence (WES): The 1.5% of DNA in protein-coding exons is captured then sequenced to identify DNA variants in at least 80% of protein-coding nucleotides.
• Whole genome sequence (WGS): All of the DNA in the genome is sequenced to identify DNA variants in at least 95% of nucleotides.
• RNA sequencing (RNA-Seq): RNA transcripts are captured (eg, by selecting for a poly(A) tail) and converted to complementary DNA, which can be sequenced. This is used to assess the quantity and type of RNAs in the cell.
• Chromatin immunoprecipitation sequencing (ChIP-Seq): Chromatin immunoprecipitation creates cross-links between proteins and DNA, and then uses an antibody to recognize a specific marker (eg, a histone mark or transcription factor). The antibody and bound protein/DNA are isolated. Then the DNA is released and can be sequenced. This allows identification of the location of epigenetic markers in the genome.
• Methyl sequencing: This technique identifies regions of DNA that are methylated. This can be achieved by either bisulfite treatment, which converts unmethylated cytosine (C) to uracil (U) and comparing treated to untreated DNA, or immunoprecipitation (methylated DNA immunoprecipitation) with an antibody specific to methylated DNA.
• DNase sequencing: DNase I is an enzyme that digests DNA. Active regions of DNA that have been exposed through chromatin remodeling are hypersensitive to this enzyme so comparing treated to untreated DNA identifies these regions.
• Assay for Transposase-Accessible Chromatin sequencing (ATAC-Seq): This technique uses a highly active transposase enzyme to insert sequencing adapters into exposed (open) DNA. The technique is complementary to DNase sequencing, but requires less starting DNA and provides higher resolution of the open DNA (Buenrostro, Wu, Chang, & Greenleaf, 2015).
The Genome (DNA)
The term genome
describes the complete set of DNA found in the cells of an organism. DNA is made up of two polymer chains arranged in a double helix and bound together by hydrogen bonds. Each polymer chain is composed of multiple copies of four nucleotides, each of which is composed of a phosphate group, a deoxyribose sugar, and one of four nucleobases: adenine (A), cytosine (C), guanine (G), and thymine (T). Both polymer chains start with a phosphate group on the fifth carbon in the deoxyribose sugar (called the 5′ end) and ends with a hydroxyl group on the third carbon in the deoxyribose sugar (3′); this distinction gives directionality to the DNA which influences function, because both DNA and RNA polymerases act only in a 5′ to 3′ direction. The two polymer chains are arranged in opposite directions, so that the 5′ end of one chain opposes the 3′ end of the other. The nucleotides on each chain are paired, so that A binds to T with two hydrogen bonds, whereas C binds to G with three hydrogen bonds. Along with creating redundancy (ie, each nucleotide can be imputed from the opposite strand) the differing number of hydrogen bonds makes CG pairing slightly stronger than AT pairing. This GC bias has an influence on numerous molecular techniques, eg, PCR and whole-genome sequencing.
Each chromosome is composed of a single DNA molecule; after the Human Genome Project (Lander et al., 2001; Venter et al., 2001), most of the human genome was mapped. Every nucleotide pair is numbered starting at the 5′ end of the short arm of the chromosome (p arm) (Fig. 1.2) and continuing to the 3′ end at the end of the long arm of the chromosome (q arm). The polymer chain (that is, 5′ to 3′ in this arrangement) is called the positive strand, whereas the opposing chain (3′ to 5′) is the negative strand. By convention, the nucleotide at each position is expressed as that on the positive strand.
In humans, the genome is composed of 46 chromosomes (two pairs of 22 autosomes and two sex chromosomes: XX or XY) and the mitochondrial chromosome, of which there may be numerous copies per cell. In total, this equates to about 12.8 billion nucleotides (including both strands and both chromosome copies), 6.4 billion nucleotide pairs (the diploid genome, including both chromosome copies), or 3.2 billion numbered nucleotide pairs (the haploid genome, including only one chromosome copy). The haploid genome measure is commonly used to measure DNA or describe a position in the chromosome, with a single nucleotide pair being described as 1 base pair (bp); 1 kbp is 1000 nucleotide pairs, and 1 Mbp is 1 million nucleotide pairs.
Figure 1.2 Chromosomal structure.
An ideogram of human chromosome 16 with Giemsa banding is shown as an example of chromosomal structure. The centromere splits the chromosome into a short p arm and a longer q arm. Each arm is capped by a subtelomeric region and telomere.
Visualizing the Genome
Three of the most frequently used tools for visualizing the genome are UCSC Genome Browser (genome.ucsc.edu), the Ensembl Genome Browser (www.ensembl.org), and the Integrative Genome Viewer (www.broadinstitute.org/igv/). All of these are publicly accessible and have a wealth of information about the genome of humans and those of other species, including known genes and common variants.
Large-Scale Structures in the Genome
To visualize chromosomes, cell division is arrested during metaphase, when the chromosomes that have been replicated are at their most condensed, forming the familiar X shape. Giemsa banding is frequently used to stain the condensed chromosomes; this solution binds to the phosphate in DNA with a preference for AT nucleotides rather than GC nucleotides (Fig. 1.2). This leads to a series of discrete bands, each of a few million nucleotides that can be observed through a microscope. Broadly, the AT-rich darkly staining bands tend to be heterochromatin with fewer genes and more repetitive DNA, whereas the GC-rich, lightly staining bands tend to be euchromatin; however, this distinction rarely has practical applications. The human chromosomes are numbered from 1 to 22 based on their length in this condensed state, with 1 being the largest (247 million nucleotide pairs) and 21 being the smallest (50 million nucleotide pairs).
Telomeres
Much like the hard cover on the front and back of a book, at each end of the chromosome there is a region of repetitive DNA called the telomere. In all vertebrates, including humans, the telomere is composed of 1000 to 3000 six-nucleotide TTAGGG
repeats (Fig. 1.2). At the end of the telomere there is an overhanging 5′ strand that forms a T-loop, shaped like a paperclip and protecting the end of the DNA polymer, and preventing chromosomes from joining together. Of note, the length of telomeres decreases with each cell division in differentiated cells, preventing further cell division once a critical length is reached. This limitation of 50–70 cell divisions is called the Hayflick limit.
The length of telomeres varies among chromosomes, tissues, individuals, and species, and according to exposure to environmental factors (Gomez et al., 2012; Riethman, 2008).
Subtelomeres
Immediately after the telomeres lie the subtelomeric regions. These are 100 to 500 kbp long and composed of numerous segmental duplications (long stretches of DNA that are nearly identical to other stretches of DNA on the same or other chromosomes), subtelomeric repeats, and stretches of the TTAGGG telomeric sequence (Fig. 1.2) (Riethman, 2008). These regions are prone to mutation, particularly translocation between chromosomes, and vary widely between individuals and species. Despite the repetitive sequence, the subtelomeres initiate transcription of both the subtelomeric and telomeric regions, resulting in numerous transcripts including Telomeric Repeat-containing RNA, a long noncoding RNA (lncRNA) that has an important role in regulating the telomere, and Wiskott-Aldrich syndrome protein and SCAR Homolog genes which have an important role in the actin cytoskeleton. These subtelomeric genes are related to the Wiskott–Aldrich Syndrome Protein (WASP) that is mutated in Wiskott–Aldrich X-linked immunodeficiency, though WASP itself is not subtelomeric (Linardopoulou et al., 2007).
Centromeres
The middle
of the chromosome is called the centromere and it is the point at which the kinetochore protein complex forms to allow the attachment of spindle fibers that pull sister chromatids apart during cell division (Fig. 1.2). This region is largely composed of repetitive DNA, particularly α-satellite repeats, but it is the histone markers, rather than the DNA sequence, that are recognized by the kinetochore. Specifically, the usual H3 histone protein is replaced by centromere protein A (Fukagawa & Earnshaw, 2014).
Small-Scale Structures in the Genome
Understanding the information content and functional role of specific DNA regions requires integrating numerous types of data. Comparisons between the genomes of different organisms allow regions of conserved DNA to be detected, indicating key information content. Analysis of transcribed RNA through expressed sequence tags and RNA-Seq has identified active transcribed regions of DNA and numerous types of RNA (Table 1.5). The GTEx project has brought tissue-level resolution to this endeavor (www.gtexportal.org). Bioinformatic analysis of DNA sequences has identified protein-coding genes through open reading frames, transcription factor motifs, and binding sites, and the ENCODE project (www.encodeproject.org) has begun to form a comprehensive map of epigenetic markers in various tissues. This endeavor is proceeding rapidly, and undoubtedly there are further insights to be found that will fundamentally alter our perception of how DNA functions, such as an investigation of three-dimensional (3D-structure). The following discussion summarizes some key components that have been identified to date.
Protein-Coding Genes
Regions of DNA are transcribed to form messenger RNAs (mRNAs), which are translated by ribosomes to form protein products. Several gene standard groups, including the HUGO Gene Nomenclature Committee (www.genenames.org), the Consensus Coding Sequence project (www.ncbi.nlm.nih.gov/CCDS/), and the Genes of ENCODE (www.gencodegenes.org), identify 18,826–19,797 distinct protein-coding genes. The ambiguity is due to the multiple isoforms that each gene can have, with differing transcription start sites (TSS), exons, and ends, and these names and annotations are continuously being updated. Protein-coding genes are made up of the following components (Fig. 1.3).
Promoters
These are regions of DNA that are not transcribed, in which the preinitiation complex (PIC) can form. The classic
core promoter sequence is the TATA-box, an eight-nucleotide sequence that begins with TATA and has three variable sites shown by single-nucleotide polymorphism (SNP) codes (Table 1.3): TATAWAWR. However, only 10% of human genes have this exact sequence whereas a further 14% of genes have a TATA-like element (Yang, Bolotin, Jiang, Sladek, & Martinez, 2007). A higher proportion of human genes (46%) contain the eight-nucleotide initiator (INR) element (INR: YYANWYY), although 46% lack either TATA or INR elements. Along with the core promoter sequences, the surrounding nucleotides can modulate PIC binding (Sandelin et al., 2007). Although variation in the promoter region can have significant impacts on gene expression, it is rarely possible to predict what this effect will be without experimental testing. The promoter is commonly defined as 1000 bp upstream of the transcription start site (TSS).
Figure 1.3 Features of a protein-coding gene.
(A) A positive-strand gene is read from left to right on the positive strand. The promoter region is immediately upstream of the transcription start site (TSS). The first exon begins at the TSS, usually with a 5′ UTR, which may contain an intron (not shown). The 5′ UTR is followed a protein-coding sequence that begins with the ATG initiation codon. Exons and introns alternate along the transcript with the canonical splice sites marking the start and end of each intron. The protein-coding sequence ends with one of three stop codons, which are usually followed by a 3′ UTR, which may contain an intron as shown here. (B) The negative-strand gene is read from right to left on the positive strand. It contains the same elements and start/splicing/stop sequences as a positive-strand gene; however, because DNA is represented on the positive strand, the reverse complement of these sequences is shown, as would be observed in a genome browser.
Untranslated Regions
The PIC binds to the promoter and brings the RNA polymerase II (Pol II) enzyme to the TSS. DNA is transcribed into mRNA from the TSS; however, the first protein-coding sequence is usually some distance downstream. The transcribed region before the first protein-coding sequence is called the 5′ UTR, and DNA variants in this region have the potential to influence the resulting protein through transcriptional pausing, the formation of loops of RNA that affect ribosomal binding, and as a binding for site for regulators of transcription, eg, micro RNAs (miRNAs). Similarly, at the end of the gene, the transcribed region after the last first protein-coding sequence is called the 3′ UTR, and DNA variants in this region can also influence the resulting protein through variation in polyadenylation and translational efficiency. Notably, the 3′ UTRs of genes are significantly longer in neuronal and brain tissues than in other tissue types (Miura, Shenker, Andreu-Agullo, Westholm, & Lai, 2013).
Exons
These are regions of DNA that are not removed from the pre-mRNA when it is spliced to form the mature mRNA. They may be either protein-coding sequence or UTRs.
Introns
These are regions of DNA that are removed from the pre-mRNA when it is spliced to form the mature mRNA. They alternate with exons and similarly may be present between either protein-coding sequence or untranslated regions. While introns are removed before translation, they can still perform a wide range of regulatory functions, including variation in exon splicing and the presence of other types of RNA, eg, lncRNA (Chorev & Carmel, 2012).
Splice Sites
These are the boundaries between exons and introns. Most splice sites (about 99%) are marked by the two-nucleotide canonical splice site sequences: the GT donor
at the start of each intron and the AG acceptor
at the end of each intron. Variants in these canonical sites frequently lead to exon skipping and an altered reading frame. The intronic nucleotides near the canonical splice site are also highly conserved and can contribute to splicing behavior.
Protein-Coding Sequence
This includes all DNA in exons that are translated into proteins and it accounts for about 1.5% of the genome. The coding sequence is made up of three nucleotide blocks, each of which encodes one amino acid in the ribosome through the binding of tRNA (Tables 1.1 and 1.2). DNA variants in protein-coding sequence frequently change the resulting protein (a nonsynonymous variant) by changing one amino acid (a missense variant) or creating a premature stop codon (a nonsense variant). If the DNA change does not alter the resulting amino acid, it is described as synonymous or silent; however, these variants can still alter RNA stability, ribosomal pausing, and protein folding (Sauna & Kimchi-Sarfaty, 2011).
Noncoding Transcripts/Genes
Most transcripts and genes do not encode proteins; to date, over 40,000 noncoding transcripts and genes have been described, compared with about 20,000 protein-coding genes (www.gencodegenes.org). These noncoding genes are split into four categories of approximately similar size.
Table 1.1
Triplet Code of Amino Acids From Positive Strand Genes as Observed in the DNA on the Positive Strand of a Genome Browser
Table 1.2
Triplet Code of Amino Acids From Negative Strand Genes as Observed in the DNA on the Positive Strand of a Genome Browser
Pseudogenes
These genes are similar to functional genes but lack coding potential. There are three main classes: (1) processed, mature mRNA that has been reintegrated into the genome as DNA and lacks introns; (2) unprocessed, duplicated versions of existing genes; and (3) unitary, nonfunctional genes that are the only copy of previously functional genes (eg, premature stop codons in olfactory receptor genes). Although these genes do not become translated into proteins, they can still be transcribed and have regulatory roles (Ding, Qu, & Fang, 2014). Despite their degenerate state, DNA variants in pseudogenes may influence human disorders, possibly by acting as a sponge to suppress the actions of miRNA. A small number of pseudogenes have been identified as being differentially expressed in neurodegenerative disorders, including Alzheimer disease (Costa, Esposito, Aprile, & Ciccodicola, 2012).
Long Noncoding RNAs
These are defined as transcripts greater than 200 bp that have been processed and polyadenylated, but which lack any suggestion of an open reading frame that defines a protein-coding gene or that is degenerate in a pseudogene. Many lncRNAs represent antisense transcripts of protein-coding genes and therefore bind to the complementary mRNAs. Understanding of the role of lncRNAs continues to advance; however, they appear to have an important regulatory role, especially in fine-tuning mRNA transcription and protein translation. This regulatory role is facilitated by their ability to bind to DNA/RNA, bind to proteins, and form complex 3D structures. They are frequently observed to act as intermediaries guiding chromatin regulators to specific genomic loci (eg, methyltransferase genes). They can also act to edit mRNA, alter mRNA stability, influence protein folding, and influence posttranslational modification of proteins. Finally, lncRNAs have a critical role in X chromosome inactivation (X Inactive Specific Transcript) and genomic imprinting (eg, Antisense of IGF2R Non-protein-coding RNA) (Fatica & Bozzoni, 2014). There are several databases of lncRNAs, with some transcript annotations varying between databases (Fritah, Niclou, & Azuaje, 2014). Because of their regulatory roles, it is likely that DNA variants in lncRNAs have important roles in human disorders. One such example is an expansion of a triplet repeat in the lncRNA, ATXN8 opposite strand, that is the cause of spinocerebellar ataxia 8, possibly through the toxic effects of excess RNA.
Small Noncoding RNAs
These are RNAs smaller than 200 bp with varying degrees of posttranslational processing. Numerous classes have been described (see Table 1.5 for a complete list). They frequently act by targeting mRNA to regulate stability and translation. For example, miRNAs form part of the RNA-induced silencing complex that binds to mRNA to prevent translation. Like lncRNAs, it is likely that DNA variants in small noncoding RNAs have key roles in human disorders. Examples of this include both the miRNA MIR137 and its regulatory targets that are associated with schizophrenia through two large GWAS (Ripke et al., 2011; Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014). In addition, absence of the paternal copy of the small nucleolar RNA, SNORD116, is the cause of Prader–Willi syndrome, which is associated with developmental delay and overeating (Duker et al., 2010). There are several databases of small noncoding RNAs, including miRNAs (www.mirbase.org, mirdb.org/miRDB/), noncoding elements (www.noncode.org), and genome browsers.
Circular RNAs
These RNAs form a circle without exposed 5′ and 3′ ends, are consequently resistant to RNA degradation, and therefore are highly stable. They are usually formed from mRNA by a process of back-splicing
and are often composed of one or more exons joined at the canonical splice sites. Their expression pattern is similar to that of mRNAs, including the degree of cell type specificity and abundance. It has been suggested that they act as miRNA sponges,
serving to prevent mRNA degradation as a form of posttranscriptional regulation (Guo, Agarwal, Guo, & Bartel, 2014). The role of circRNA DNA variants in human disorders remains to be evaluated; however, it is likely to be difficult to distinguish the contribution of the variant through a circRNA from that of the parent protein-coding gene. CircBase is a database of circRNAs (www.circbase.org).
Enhancers
These are regions of DNA that act as binding sites for transcription factors that increase (activate) the transcription of nearby genes (ie, acting in cis). Whereas there are about 20,000 protein-coding genes in the human genome, there are estimated to be over 100,000 enhancers (E. P. Consortium, 2012) and possibly millions (Arnold et al., 2013). The location of many of these has been predicted (www.encodeproject.org), although only a small number of these have been experimentally validated (enhancer.lbl.gov). Enhancers are thought to be one of the main determinants of cell type by determining which genes are expressed. Most enhancers are within 100 kbp of the transcription start site of a gene; however, there are well-documented examples over 1000 kbp away: for example, the zone of polarizing activity regulatory sequence (ZRS), which is an enhancer of the Sonic Hedgehog (SHH) and in which DNA variants can lead to polydactyly (Lettice et al., 2003). A 28-bp deletion of the enhancer of the gene proteolipid protein 1 leads to X-linked recessive hypomyelinative leukodystrophy, characterized by nystagmus, spastic quadriplegia, ataxia, and developmental delay (Hobson et al., 2002).
Silencers
These are the opposite of enhancers; they act as transcription factor binding sites that repress the transcription of nearby genes; like enhancers, they are usually found in close proximity to the genes they regulate in cis. Silencers are considerably less well characterized than enhancers. However, some enhancers may double as silencers in different cell types (Kim et al., 2008). One common example of silencers is the neuron-restrictive silencer elements (NRSEs; also called Repressor Element-1 [RE1]) that silence neuronal genes in nonneuronal cell types. The RE1-Silencing Transcription Factor (REST; also called Neural-Restrictive Silencer Factor) gene encodes a transcription factor that acts as a repressor when bound to these NRSEs. Altered expression of REST in neurons has been implicated in Down syndrome (Canzonetta et al., 2008), Huntington disease (Zuccato et al., 2003), and Alzheimer disease (Lu et al., 2014).
Insulators
These are regions that block the activity of enhancers and silencers on nearby genes. A common mechanism of this insulation is through the formation of DNA loops that physically prevent interaction between neighboring regions of DNA, potentially creating regulatory domains in the genome. These loops are held in place through the action of cohesins that form a ringlike structure over two strands of DNA; these are often associated with the protein CTCF.
Repetitive Elements
Although most biological investigation focuses on nonrepetitive DNA, about 50% of the genome is duplicated to some extent (Treangen & Salzberg, 2012). Regions of dense repetitive DNA remain as about 160 gaps in the human reference genome, because they cannot be mapped with short DNA reads. Their repetitive nature interferes with DNA replication, making these regions hypermutable and therefore highly polymorphic between individuals.
About 3% of these regions are variable