Bioinformatics for Beginners: Genes, Genomes, Molecular Evolution, Databases and Analytical Tools

Ebook649 pages6 hours

Bioinformatics for Beginners: Genes, Genomes, Molecular Evolution, Databases and Analytical Tools

Name: Bioinformatics for Beginners: Genes, Genomes, Molecular Evolution, Databases and Analytical Tools
Brand: Academic Press
Rating: 4.7 (3 reviews)

By Supratim Choudhuri

Rating: 4.5 out of 5 stars

4.5/5

()

Read preview

About this ebook

Bioinformatics for Beginners: Genes, Genomes, Molecular Evolution, Databases and Analytical Tools provides a coherent and friendly treatment of bioinformatics for any student or scientist within biology who has not routinely performed bioinformatic analysis.

The book discusses the relevant principles needed to understand the theoretical underpinnings of bioinformatic analysis and demonstrates, with examples, targeted analysis using freely available web-based software and publicly available databases. Eschewing non-essential information, the work focuses on principles and hands-on analysis, also pointing to further study options.

Avoids non-essential coverage, yet fully describes the field for beginners
Explains the molecular basis of evolution to place bioinformatic analysis in biological context
Provides useful links to the vast resource of publicly available bioinformatic databases and analysis tools
Contains over 100 figures that aid in concept discovery and illustration

Skip carousel

LanguageEnglish

PublisherAcademic Press

Release dateMay 9, 2014

ISBN9780124105102

Author

Supratim Choudhuri

Dr. Supratim Choudhuri is a toxicologist at the Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration. Dr. Choudhuri has extensively published in the fields of molecular toxicology, metabolism, genomics, and epigenetics. He has previously edited a book, titled Genomics: Fundamentals and Applications with a colleague Dr. David B. Carlson.

Related authors

Skip carousel

Related to Bioinformatics for Beginners

Related ebooks

Skip carousel

DNA and Biotechnology
Ebook
DNA and Biotechnology
byMolly Fitzgerald-Hayes
Rating: 5 out of 5 stars
5/5
Bioinformatics Algorithms: Design and Implementation in Python
Ebook
Bioinformatics Algorithms: Design and Implementation in Python
byMiguel Rocha
Rating: 0 out of 5 stars
0 ratings
Protein Bioinformatics: From Sequence to Function
Ebook
Protein Bioinformatics: From Sequence to Function
byM. Michael Gromiha
Rating: 5 out of 5 stars
5/5
Statistics for Bioinformatics: Methods for Multiple Sequence Alignment
Ebook
Statistics for Bioinformatics: Methods for Multiple Sequence Alignment
byJulie Thompson
Rating: 0 out of 5 stars
0 ratings
Concepts and Techniques in Genomics and Proteomics
Ebook
Concepts and Techniques in Genomics and Proteomics
byN Saraswathy
Rating: 0 out of 5 stars
0 ratings
Bioinformatics for Everyone
Ebook
Bioinformatics for Everyone
byMohammad Yaseen Sofi
Rating: 0 out of 5 stars
0 ratings
Introduction to Bioinformatics Using Action Labs
Ebook
Introduction to Bioinformatics Using Action Labs
byJean-Louis Lassez
Rating: 0 out of 5 stars
0 ratings
Synthetic Biology: Tools and Applications
Ebook
Synthetic Biology: Tools and Applications
byHuimin Zhao
Rating: 0 out of 5 stars
0 ratings
Deep Learning in Bioinformatics: Techniques and Applications in Practice
Ebook
Deep Learning in Bioinformatics: Techniques and Applications in Practice
byHabib Izadkhah
Rating: 0 out of 5 stars
0 ratings
Probabilistic Methods for Bioinformatics: with an Introduction to Bayesian Networks
Ebook
Probabilistic Methods for Bioinformatics: with an Introduction to Bayesian Networks
byRichard E. Neapolitan
Rating: 0 out of 5 stars
0 ratings
Metagenomics for Microbiology
Ebook
Metagenomics for Microbiology
byJacques Izard
Rating: 5 out of 5 stars
5/5
Diagnostic Molecular Biology
Ebook
Diagnostic Molecular Biology
byChang-Hui Shen
Rating: 0 out of 5 stars
0 ratings
Challenges in Delivery of Therapeutic Genomics and Proteomics
Ebook
Challenges in Delivery of Therapeutic Genomics and Proteomics
byAmbikanandan Misra
Rating: 0 out of 5 stars
0 ratings
Chemoinformatics and Bioinformatics in the Pharmaceutical Sciences
Ebook
Chemoinformatics and Bioinformatics in the Pharmaceutical Sciences
byNavneet Sharma
Rating: 0 out of 5 stars
0 ratings
Fundamentals of Molecular Structural Biology
Ebook
Fundamentals of Molecular Structural Biology
bySubrata Pal
Rating: 0 out of 5 stars
0 ratings
Metagenomics: Perspectives, Methods, and Applications
Ebook
Metagenomics: Perspectives, Methods, and Applications
byMuniyandi Nagarajan
Rating: 0 out of 5 stars
0 ratings
Computational Non-coding RNA Biology
Ebook
Computational Non-coding RNA Biology
byYun Zheng
Rating: 0 out of 5 stars
0 ratings
Advances in Cell and Molecular Diagnostics
Ebook
Advances in Cell and Molecular Diagnostics
byP.B. Raghavendra
Rating: 5 out of 5 stars
5/5
Practical Guide for Biomedical Signals Analysis Using Machine Learning Techniques: A MATLAB Based Approach
Ebook
Practical Guide for Biomedical Signals Analysis Using Machine Learning Techniques: A MATLAB Based Approach
byAbdulhamit Subasi
Rating: 5 out of 5 stars
5/5
Methods in Biomedical Informatics: A Pragmatic Approach
Ebook
Methods in Biomedical Informatics: A Pragmatic Approach
byIndra Neil Sarkar
Rating: 0 out of 5 stars
0 ratings
Genome Editing: A Practical Guide to Research and Clinical Applications
Ebook
Genome Editing: A Practical Guide to Research and Clinical Applications
byKiran Musunuru
Rating: 0 out of 5 stars
0 ratings
All About Bioinformatics: From Beginner to Expert
Ebook
All About Bioinformatics: From Beginner to Expert
byYasha Hasija
Rating: 0 out of 5 stars
0 ratings
Genomic Control Process: Development and Evolution
Ebook
Genomic Control Process: Development and Evolution
byIsabelle S. Peter
Rating: 5 out of 5 stars
5/5
PCR Applications: Protocols for Functional Genomics
Ebook
PCR Applications: Protocols for Functional Genomics
byMichael A. Innis
Rating: 4 out of 5 stars
4/5
Bioinformatics with Python Cookbook
Ebook
Bioinformatics with Python Cookbook
byTiago Antao
Rating: 0 out of 5 stars
0 ratings
Clinical Genomics
Ebook
Clinical Genomics
byShashikant Kulkarni
Rating: 5 out of 5 stars
5/5
Basic Methods in Molecular Biology
Ebook
Basic Methods in Molecular Biology
byLeonard Davis
Rating: 5 out of 5 stars
5/5
Molecular Biology and Genomics
Ebook
Molecular Biology and Genomics
byCornel Mulhardt
Rating: 3 out of 5 stars
3/5
Understanding PCR: A Practical Bench-Top Guide
Ebook
Understanding PCR: A Practical Bench-Top Guide
bySarah Maddocks
Rating: 5 out of 5 stars
5/5
The Dictionary of Cell and Molecular Biology
Ebook
The Dictionary of Cell and Molecular Biology
byJohn M. Lackie
Rating: 5 out of 5 stars
5/5

Medical For You

Skip carousel

The Lost Book of Simple Herbal Remedies: Discover over 100 herbal Medicine for all kinds of Ailment Inspired By Barbara O'Neill
Ebook
The Lost Book of Simple Herbal Remedies: Discover over 100 herbal Medicine for all kinds of Ailment Inspired By Barbara O'Neill
byBlossom Davis
Rating: 0 out of 5 stars
0 ratings
Peptide Protocols: Volume One
Ebook
Peptide Protocols: Volume One
byMD William A. Seeds
Rating: 4 out of 5 stars
4/5
The Obesity Code: Unlocking the Secrets of Weight Loss (Why Intermittent Fasting Is the Key to Controlling Your Weight)
Ebook
The Obesity Code: Unlocking the Secrets of Weight Loss (Why Intermittent Fasting Is the Key to Controlling Your Weight)
byDr. Jason Fung
Rating: 4 out of 5 stars
4/5
The Emotion Code: How to Release Your Trapped Emotions for Abundant Health, Love, and Happiness (Updated and Expanded Edition)
Ebook
The Emotion Code: How to Release Your Trapped Emotions for Abundant Health, Love, and Happiness (Updated and Expanded Edition)
byDr. Bradley Nelson
Rating: 4 out of 5 stars
4/5
Women With Attention Deficit Disorder: Embrace Your Differences and Transform Your Life
Ebook
Women With Attention Deficit Disorder: Embrace Your Differences and Transform Your Life
bySari Solden
Rating: 5 out of 5 stars
5/5
Mediterranean Diet Meal Prep Cookbook: Easy And Healthy Recipes You Can Meal Prep For The Week
Ebook
Mediterranean Diet Meal Prep Cookbook: Easy And Healthy Recipes You Can Meal Prep For The Week
byLisa Rainolds
Rating: 5 out of 5 stars
5/5
The 40 Day Dopamine Fast
Ebook
The 40 Day Dopamine Fast
byGreg Kamphuis
Rating: 4 out of 5 stars
4/5
The Amazing Liver and Gallbladder Flush
Ebook
The Amazing Liver and Gallbladder Flush
byAndreas Moritz
Rating: 5 out of 5 stars
5/5
Adult ADHD: How to Succeed as a Hunter in a Farmer's World
Ebook
Adult ADHD: How to Succeed as a Hunter in a Farmer's World
byThom Hartmann
Rating: 4 out of 5 stars
4/5
Native American Herbalist's Bible - 10 Books in 1: Create Your Green Paradise of Medicinal Plants and Herbal Remedies to Unleash Your Vitality: Herbal Apotecary Collection
Ebook
Native American Herbalist's Bible - 10 Books in 1: Create Your Green Paradise of Medicinal Plants and Herbal Remedies to Unleash Your Vitality: Herbal Apotecary Collection
byLomasi Ahusaka
Rating: 5 out of 5 stars
5/5
Tight Hip Twisted Core: The Key To Unresolved Pain
Ebook
Tight Hip Twisted Core: The Key To Unresolved Pain
byChristine Koth
Rating: 4 out of 5 stars
4/5
The Diabetes Code: Prevent and Reverse Type 2 Diabetes Naturally
Ebook
The Diabetes Code: Prevent and Reverse Type 2 Diabetes Naturally
byDr. Jason Fung
Rating: 4 out of 5 stars
4/5
Sick to Fit: 3 Simple Techniques that Got Me from 420 Pounds to the Cover of Runner's World, Good Morning America, and The Today Show
Ebook
Sick to Fit: 3 Simple Techniques that Got Me from 420 Pounds to the Cover of Runner's World, Good Morning America, and The Today Show
byJosh LaJaunie, Jr
Rating: 5 out of 5 stars
5/5
What Happened to You?: Conversations on Trauma, Resilience, and Healing
Ebook
What Happened to You?: Conversations on Trauma, Resilience, and Healing
byOprah Winfrey
Rating: 4 out of 5 stars
4/5
Healthy Gut, Healthy You: The Personalized Plan to Transform Your Health from the Inside Out
Ebook
Healthy Gut, Healthy You: The Personalized Plan to Transform Your Health from the Inside Out
byMichael Ruscio
Rating: 4 out of 5 stars
4/5
Lies My Gov't Told Me: And the Better Future Coming
Ebook
Lies My Gov't Told Me: And the Better Future Coming
byRobert W. Malone
Rating: 4 out of 5 stars
4/5
Holistic Herbal: A Safe and Practical Guide to Making and Using Herbal Remedies
Ebook
Holistic Herbal: A Safe and Practical Guide to Making and Using Herbal Remedies
byDavid Hoffmann
Rating: 4 out of 5 stars
4/5
The Hormone Reset Diet: Heal Your Metabolism to Lose Up to 15 Pounds in 21 Days
Ebook
The Hormone Reset Diet: Heal Your Metabolism to Lose Up to 15 Pounds in 21 Days
bySara Szal Gottfried M.D.
Rating: 4 out of 5 stars
4/5
Gut: The Inside Story of Our Body's Most Underrated Organ (Revised Edition)
Ebook
Gut: The Inside Story of Our Body's Most Underrated Organ (Revised Edition)
byGiulia Enders
Rating: 4 out of 5 stars
4/5
ATOMIC HABITS:: How to Disagree With Your Brain so You Can Break Bad Habits and End Negative Thinking
Ebook
ATOMIC HABITS:: How to Disagree With Your Brain so You Can Break Bad Habits and End Negative Thinking
byDr. Dan Builfford
Rating: 5 out of 5 stars
5/5
Period Power: Harness Your Hormones and Get Your Cycle Working For You
Ebook
Period Power: Harness Your Hormones and Get Your Cycle Working For You
byMaisie Hill
Rating: 4 out of 5 stars
4/5
Taking Charge of Your Fertility: The Definitive Guide to Natural Birth Control, Pregnancy Achievement, and Reproductive Health
Ebook
Taking Charge of Your Fertility: The Definitive Guide to Natural Birth Control, Pregnancy Achievement, and Reproductive Health
byToni Weschler
Rating: 4 out of 5 stars
4/5
The Vagina Bible: The Vulva and the Vagina: Separating the Myth from the Medicine
Ebook
The Vagina Bible: The Vulva and the Vagina: Separating the Myth from the Medicine
byDr. Jen Gunter
Rating: 5 out of 5 stars
5/5
The Big Fat Surprise: Why Butter, Meat and Cheese Belong in a Healthy Diet
Ebook
The Big Fat Surprise: Why Butter, Meat and Cheese Belong in a Healthy Diet
byNina Teicholz
Rating: 4 out of 5 stars
4/5
Herbal Healing for Women
Ebook
Herbal Healing for Women
byRosemary Gladstar
Rating: 4 out of 5 stars
4/5
The Song of the Cell: An Exploration of Medicine and the New Human
Ebook
The Song of the Cell: An Exploration of Medicine and the New Human
bySiddhartha Mukherjee
Rating: 4 out of 5 stars
4/5
The Abandonment Recovery Workbook: Guidance through the Five Stages of Healing from Abandonment, Heartbreak, and Loss
Ebook
The Abandonment Recovery Workbook: Guidance through the Five Stages of Healing from Abandonment, Heartbreak, and Loss
bySusan Anderson
Rating: 4 out of 5 stars
4/5
Living Daily With Adult ADD or ADHD: 365 Tips o the Day
Ebook
Living Daily With Adult ADD or ADHD: 365 Tips o the Day
byDouglas A Puryear MD
Rating: 5 out of 5 stars
5/5
Summary of Dr. Gundry's Diet Evolution: Turn off the Genes That Are Killing You and Your Waistline
Ebook
Summary of Dr. Gundry's Diet Evolution: Turn off the Genes That Are Killing You and Your Waistline
byBookSuma Publishing
Rating: 3 out of 5 stars
3/5
The Art of Dying Well: A Practical Guide to a Good End of Life
Ebook
The Art of Dying Well: A Practical Guide to a Good End of Life
byKaty Butler
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Ep. 220: “Developmental and Synthetic Biology” Featuring Dr. Michael Levin: Dr. Michael Levin is the Director of the Allen Discovery Center and a Distinguished Professor of Biology at Tufts University. He is a cognitive biologist who utilizes model systems such as Xenopus to answer fundamental questions in developmental biolog...
Podcast episode
Ep. 220: “Developmental and Synthetic Biology” Featuring Dr. Michael Levin: Dr. Michael Levin is the Director of the Allen Discovery Center and a Distinguished Professor of Biology at Tufts University. He is a cognitive biologist who utilizes model systems such as Xenopus to answer fundamental questions in developmental biolog...
byThe Stem Cell Podcast
0 ratings
0% found this document useful
Proteomics and Deep Learning with Melih Yilmaz
Podcast episode
Proteomics and Deep Learning with Melih Yilmaz
byAxial Podcast
0 ratings
0% found this document useful
Episode 52. Neurological Disorders: Sara Manning Peskin: The brain is the most mysterious and complex organ of the body, and when things go awry, we may be confronted with personal tragedy and we may gain insights on what it means to be human. With us to discuss neurological disorders and the history of...
Podcast episode
Episode 52. Neurological Disorders: Sara Manning Peskin: The brain is the most mysterious and complex organ of the body, and when things go awry, we may be confronted with personal tragedy and we may gain insights on what it means to be human. With us to discuss neurological disorders and the history of...
byScience History Podcast
0 ratings
0% found this document useful
OrgaMapper: A robust and easy-to-use workflow for analyzing organelle positioning
Podcast episode
OrgaMapper: A robust and easy-to-use workflow for analyzing organelle positioning
byPaperPlayer biorxiv cell biology
0 ratings
0% found this document useful
Exploring Loops in the Human Genome: Dr. Erez Lieberman Aiden Explains His Research: Dr. Aiden works on analyzing the bending of our human genome, a 3-D complex arrangement that, in part, regulates our cells. This conversation explores how each chromatid forms unique loops and bends while patterns emerge across similar cell types,...
Podcast episode
Exploring Loops in the Human Genome: Dr. Erez Lieberman Aiden Explains His Research: Dr. Aiden works on analyzing the bending of our human genome, a 3-D complex arrangement that, in part, regulates our cells. This conversation explores how each chromatid forms unique loops and bends while patterns emerge across similar cell types,...
byFinding Genius Podcast
0 ratings
0% found this document useful
Optogenetics, RNA Editing and CRISPR with Dr. Feng Zhang
Podcast episode
Optogenetics, RNA Editing and CRISPR with Dr. Feng Zhang
byFYI - For Your Innovation
0 ratings
0% found this document useful
Ep 03: Glyphosate, Adjuvants & Viruses with Dr. Stephanie Seneff
Podcast episode
Ep 03: Glyphosate, Adjuvants & Viruses with Dr. Stephanie Seneff
byThe Way Forward with Alec Zeck
0 ratings
0% found this document useful
Making Sense of Sequencing: Exome Sequencing Analysis and What's Next with Dr. Daniel Geschwind: Dr. Daniel Geschwind Bio: Dr. Geschwind is the Gordon and Virginia MacDonald Distinguished Professor of Human Genetics, Neurology and Psychiatry at UCLA. In his capacity as Senior Associate Dean and Associate Vice Chancellor of Precision...
Podcast episode
Making Sense of Sequencing: Exome Sequencing Analysis and What's Next with Dr. Daniel Geschwind: Dr. Daniel Geschwind Bio: Dr. Geschwind is the Gordon and Virginia MacDonald Distinguished Professor of Human Genetics, Neurology and Psychiatry at UCLA. In his capacity as Senior Associate Dean and Associate Vice Chancellor of Precision...
byFinding Genius Podcast
0 ratings
0% found this document useful
Revolutionizing and Democratizing the Accessibility of Genomic Data—Paul Mola—Roswell Biotechnologies: Imagine visiting your primary care doctor and receiving a whole read out of your genomic data within 10 minutes for as little as $10, equipping your doctor with the information needed in order to deliver you individually tailored medical treatments...
Podcast episode
Revolutionizing and Democratizing the Accessibility of Genomic Data—Paul Mola—Roswell Biotechnologies: Imagine visiting your primary care doctor and receiving a whole read out of your genomic data within 10 minutes for as little as $10, equipping your doctor with the information needed in order to deliver you individually tailored medical treatments...
byFinding Genius Podcast
0 ratings
0% found this document useful
Understanding Our Era of Biological Evolution: Eugene V. Koonin Shares His Knowledge: Computational biologist and evolutionary genomics researcher Eugene Koonin touches on several timely topics about biology, evolution, and what computational biology can teach us. In this podcast, he discusses How the molecular clock works as a null...
Podcast episode
Understanding Our Era of Biological Evolution: Eugene V. Koonin Shares His Knowledge: Computational biologist and evolutionary genomics researcher Eugene Koonin touches on several timely topics about biology, evolution, and what computational biology can teach us. In this podcast, he discusses How the molecular clock works as a null...
byFinding Genius Podcast
0 ratings
0% found this document useful
Episode: 45 - Advances in Targeted Protein Degradation
Podcast episode
Episode: 45 - Advances in Targeted Protein Degradation
byThe Chain: Protein Engineering Podcast
0 ratings
0% found this document useful
How exponentials on top of exponentials in single-cell analysis is transforming biology today
Podcast episode
How exponentials on top of exponentials in single-cell analysis is transforming biology today
by"Securities"
0 ratings
0% found this document useful
The intersection of biology and technology: In this episode, Gabriel and Steve share a conversation with Dr. Ben Sun about multi-omics and the role AI and machine learning are playing in this space for biomarker discovery and disease prediction. Ben shares his insights from being both a clinician and research scientist, and the conversation delves into some really interesting aspects of using AI, including the consideration of the carbon footprint needed to train an algorithm.
Podcast episode
The intersection of biology and technology: In this episode, Gabriel and Steve share a conversation with Dr. Ben Sun about multi-omics and the role AI and machine learning are playing in this space for biomarker discovery and disease prediction. Ben shares his insights from being both a clinician and research scientist, and the conversation delves into some really interesting aspects of using AI, including the consideration of the carbon footprint needed to train an algorithm.
bySpeaking of Mol Bio
0 ratings
0% found this document useful
It's Mutual: Viruses and Immune System Adaptations with Nils G. Walter: Returning guest Nils G. Walter examines RNA molecules from every possible angle, literally and metaphorically. His lab at the University of Michigan brings multidisciplinary minds together to discover what RNA research may offer the medical community....
Podcast episode
It's Mutual: Viruses and Immune System Adaptations with Nils G. Walter: Returning guest Nils G. Walter examines RNA molecules from every possible angle, literally and metaphorically. His lab at the University of Michigan brings multidisciplinary minds together to discover what RNA research may offer the medical community....
byFinding Genius Podcast
0 ratings
0% found this document useful
Natural Prodcast Ep 9 - Roger Linington
Podcast episode
Natural Prodcast Ep 9 - Roger Linington
byNatural Prodcast
0 ratings
0% found this document useful
Biology at the Brink – Pamela Silver, Principal Investigator, Silver Lab – The Bionic Leaf and Advanced Biological Experimentation That Could Help Solve Monumental Environmental and Sustainability Problems: Pamela Silver is the principal investigator of the Silver Lab (<a href="http://silver.med.harvard.edu">silver.med.harvard.edu</a>) and an Elliot T and Onie H Adams Professor of Biochemistry and Systems Biology at Harvard Medical School....
Podcast episode
Biology at the Brink – Pamela Silver, Principal Investigator, Silver Lab – The Bionic Leaf and Advanced Biological Experimentation That Could Help Solve Monumental Environmental and Sustainability Problems: Pamela Silver is the principal investigator of the Silver Lab (<a href="http://silver.med.harvard.edu">silver.med.harvard.edu</a>) and an Elliot T and Onie H Adams Professor of Biochemistry and Systems Biology at Harvard Medical School....
byFinding Genius Podcast
0 ratings
0% found this document useful
Improving Drug Development through Better Use of Biomarkers: The use of biomarkers has long held the promise accelerating drug development and producing safer and more targeted drugs to meet the needs of patients. The explosion of genetic, proteomic, and metabolomic data, as well as the emergence of the human...
Podcast episode
Improving Drug Development through Better Use of Biomarkers: The use of biomarkers has long held the promise accelerating drug development and producing safer and more targeted drugs to meet the needs of patients. The explosion of genetic, proteomic, and metabolomic data, as well as the emergence of the human...
byThe Bio Report
0 ratings
0% found this document useful
Organoids: Multi-Dimensional Standards for Three Dimensional Models
Podcast episode
Organoids: Multi-Dimensional Standards for Three Dimensional Models
byThe Stem Cell Report with Martin Pera
0 ratings
0% found this document useful
Computing Positional Cues: From Single Cells to Embryo Development
Podcast episode
Computing Positional Cues: From Single Cells to Embryo Development
byThe Stem Cell Report with Martin Pera
0 ratings
0% found this document useful
Stem Cells in Translation: Focusing on the Eye
Podcast episode
Stem Cells in Translation: Focusing on the Eye
byThe Stem Cell Report with Martin Pera
0 ratings
0% found this document useful
Lessons Learnt, and Still to Learn, in Stem Cell Trials
Podcast episode
Lessons Learnt, and Still to Learn, in Stem Cell Trials
byThe Stem Cell Report with Martin Pera
0 ratings
0% found this document useful
206. UW Engage Science 2023: Rory Mcguire, Keenan Ganz, & Rasika Venkataraman: sees a future where every graduate student has access to science communication training, and therefore good science communication becomes the norm. The outcome is an increased public trust and positive attitude toward science, ultimately...
Podcast episode
206. UW Engage Science 2023: Rory Mcguire, Keenan Ganz, & Rasika Venkataraman: sees a future where every graduate student has access to science communication training, and therefore good science communication becomes the norm. The outcome is an increased public trust and positive attitude toward science, ultimately...
byTown Hall Seattle Science Series
0 ratings
0% found this document useful
Episode: 42 - Machine Learning Informatics for Antibody Discovery
Podcast episode
Episode: 42 - Machine Learning Informatics for Antibody Discovery
byThe Chain: Protein Engineering Podcast
0 ratings
0% found this document useful
R. Taylor Raborn: evolutionary genetics, good enough for government work: A biologist's journey from academia to government
Podcast episode
R. Taylor Raborn: evolutionary genetics, good enough for government work: A biologist's journey from academia to government
byRazib Khan's Unsupervised Learning
0 ratings
0% found this document useful
Searching for Alzheimer’s Biomarkers: David Wishart Describes Foundational Research: How do doctors measure your liver function, kidney function, cholesterol levels, and heart disease? They use biomarkers, and David Wishart helps identify with analytical chemistry, mass spectrometry, and other bioinformatics tools. He and his...
Podcast episode
Searching for Alzheimer’s Biomarkers: David Wishart Describes Foundational Research: How do doctors measure your liver function, kidney function, cholesterol levels, and heart disease? They use biomarkers, and David Wishart helps identify with analytical chemistry, mass spectrometry, and other bioinformatics tools. He and his...
byFinding Genius Podcast
0 ratings
0% found this document useful
Chromatin Looping: Seeing DNA in 3D: New tools are making it easier to understand not only our genetic code but also the ways that the code's three-dimensional structure contributes to gene expression. This understanding will be vital in the search for cures to diseases with multiple and c
Podcast episode
Chromatin Looping: Seeing DNA in 3D: New tools are making it easier to understand not only our genetic code but also the ways that the code's three-dimensional structure contributes to gene expression. This understanding will be vital in the search for cures to diseases with multiple and c
byBioScience Talks
0 ratings
0% found this document useful
Directed Evolution of Antibodies with Doug Chapnick
Podcast episode
Directed Evolution of Antibodies with Doug Chapnick
byAxial Podcast
0 ratings
0% found this document useful
High-throughput transcriptomics and AI for drug discovery: We kick off Season 2 with a great conversation featuring an entrepreneur using high-throughput transcriptomics and AI to innovate drug discovery. Join us to learn how miniaturization and automation are generating astoundingly large transcriptomics data sets and how generative AI methods are being applied to link the structure and activity of small molecules with RNA fingerprints.
Podcast episode
High-throughput transcriptomics and AI for drug discovery: We kick off Season 2 with a great conversation featuring an entrepreneur using high-throughput transcriptomics and AI to innovate drug discovery. Join us to learn how miniaturization and automation are generating astoundingly large transcriptomics data sets and how generative AI methods are being applied to link the structure and activity of small molecules with RNA fingerprints.
bySpeaking of Mol Bio
0 ratings
0% found this document useful
Unlikely Paths: Stories from the Institute for Genomic Biology: Featuring Heng Ji and Brendan Harley
Podcast episode
Unlikely Paths: Stories from the Institute for Genomic Biology: Featuring Heng Ji and Brendan Harley
byThe Story Collider
0 ratings
0% found this document useful
Using Synthetic Biology To Extend Lifespan With Dr. Nan Hao
Podcast episode
Using Synthetic Biology To Extend Lifespan With Dr. Nan Hao
byLongevity by Design
0 ratings
0% found this document useful

Skip carousel

You Had Questions For David Liu About CRISPR, Prime Editing, And Advice To Young Scientists. He Has Answers
STAT
Article
You Had Questions For David Liu About CRISPR, Prime Editing, And Advice To Young Scientists. He Has Answers
Nov 6, 2019
You had questions for David Liu about CRISPR, prime editing, and advice to young scientists. He has answers.
17 min read
Why Synthetic Protein Research Needs More Funding
Nautilus
Article
Why Synthetic Protein Research Needs More Funding
Jun 4, 2017
4 min read
Why Data Matters For Tracking Biodiversity Changes
Futurity
Article
Why Data Matters For Tracking Biodiversity Changes
Oct 3, 2018
New research highlights the importance of trait variability within species in measuring biodiversity changes and how ecologists can incorporate that data into their assessments. Around the world, ecologists are studying how species are responding to
2 min read
Cell ‘Powerhouses’ Are Like Empty Swimming Pools
Futurity
Article
Cell ‘Powerhouses’ Are Like Empty Swimming Pools
Jul 3, 2017
3 min read
Synthetic Genetic Circuits Control Plant Roots
Futurity
Article
Synthetic Genetic Circuits Control Plant Roots
Aug 19, 2022
3 min read
Protein-RNA Droplets Act Like Silly Putty
Futurity
Article
Protein-RNA Droplets Act Like Silly Putty
Nov 16, 2021
2 min read
The National Academies Illustrates the More Nuanced Value of Transparency in Science
Union of Concerned Scientists
Article
The National Academies Illustrates the More Nuanced Value of Transparency in Science
May 13, 2019
4 min read
‘Legos of Life’ Stack Together to Build Proteins
Futurity
Article
‘Legos of Life’ Stack Together to Build Proteins
Jan 23, 2018
After smashing and dissecting nearly 10,000 proteins to understand their component parts, scientists have discovered the “Legos of life”—four core chemical structures that can be stacked together to build the myriad proteins inside every organism. Th
2 min read
RNA Tweak Leads To 50% More Food From Crops
Futurity
Article
RNA Tweak Leads To 50% More Food From Crops
Jul 23, 2021
4 min read
New Way To Study Protein Complexes Doesn’t Break Them Up
Futurity
Article
New Way To Study Protein Complexes Doesn’t Break Them Up
Mar 23, 2018
Researchers have developed a new way to study protein complexes. Proteins are the worker bees of the cell, mainly ganging up to form macromolecular, multicomponent complexes to perform intricate cellular tasks. “Proteins are the main drivers of prett
2 min read
Decoding the Origami That Drives All Life
The Atlantic
Article
Decoding the Origami That Drives All Life
Jan 19, 2017
5 min read
To Find New Drugs, Make ‘Libraries’ From DNA
Futurity
Article
To Find New Drugs, Make ‘Libraries’ From DNA
Jun 27, 2017
A new technology can clone thousands of genes at once and compile libraries of proteins from DNA samples, potentially speeding up the search for new drugs. Discovering the function of a gene requires cloning a DNA sequence and expressing it. Until no
2 min read
Metabolomes: A New Way To Store Data In Little Space
Futurity
Article
Metabolomes: A New Way To Store Data In Little Space
Jul 5, 2019
3 min read
DNA ‘Signposts’ Direct Gene Shutoff in Plants
Futurity
Article
DNA ‘Signposts’ Direct Gene Shutoff in Plants
Aug 23, 2017
3 min read
A Pluralistic Approach to the HUMAN MICROBIOME
The Art of Healing
Article
A Pluralistic Approach to the HUMAN MICROBIOME
Sep 1, 2019
4 min read
Circuit Programs Human Cells to Add and Subtract
Futurity
Article
Circuit Programs Human Cells to Add and Subtract
Apr 15, 2017
A new platform offers a fast and more efficient way to target and program mammalian cells as genetic circuits, even complex ones. “The problem synthetic biologists are trying to solve is how we ask cells to make decisions and try to design a strategy
2 min read
What Happens When You Put 500,000 People's DNA Online
The Atlantic
Article
What Happens When You Put 500,000 People's DNA Online
Nov 6, 2017
5 min read
Broken ‘Rules’ Lead To Protein Clumps In Diseases Like ALS
Futurity
Article
Broken ‘Rules’ Lead To Protein Clumps In Diseases Like ALS
Feb 26, 2020
3 min read
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
STAT
Article
Opinion: Machine Learning For Clinical Decision-making: Pay Attention To What You Don’t See
Dec 12, 2019
Don't take results from machine learning algorithms at face value. Ask what information isn't available. What subgroups haven't been prioritized? Who is on the research team?
4 min read
Updated Restricted Science Rule Spells Reanalysis Paralysis for the EPA
Union of Concerned Scientists
Article
Updated Restricted Science Rule Spells Reanalysis Paralysis for the EPA
Nov 12, 2019
7 min read
For Better Diagnosis, Watch Proteins Fold In Real-time
Futurity
Article
For Better Diagnosis, Watch Proteins Fold In Real-time
Jun 10, 2018
10A new method can examine protein assembly in real time, in living cells, to find problems in the process and diagnose the resulting diseases, according to new research. Proteins in the body need to be perfectly arranged, or folded, to do their jobs
2 min read
Next Antibiotic May Come From Dirt Bacteria
Futurity
Article
Next Antibiotic May Come From Dirt Bacteria
Aug 2, 2019
3 min read
How Proteins Stabilize And Repair Broken DNA
Futurity
Article
How Proteins Stabilize And Repair Broken DNA
Nov 1, 2019
2 min read
Opinion: All Study Participants Have A Right To Know Their Own Results. My Lab Has Been Doing That For Years
STAT
Article
Opinion: All Study Participants Have A Right To Know Their Own Results. My Lab Has Been Doing That For Years
Sep 5, 2018
Giving study participants their individual results can drive greater public participation in research, increased support for science, and better health.
6 min read
Team Uncovers Mechanisms That Can Lead To Blindness
Futurity
Article
Team Uncovers Mechanisms That Can Lead To Blindness
Jan 5, 2022
1 min read
Opinion: The Trouble With Mice As Behavioral Models For Alzheimer’s And Other Neurologic Diseases
STAT
Article
Opinion: The Trouble With Mice As Behavioral Models For Alzheimer’s And Other Neurologic Diseases
Apr 16, 2019
There's too much reliance on using mice as behavioral models to guide drug development for #Alzheimers and other neurologic diseases. We need to find ways to move beyond this flawed…
4 min read
Genes May Hold Clues To Keep Broccoli Fresh
Futurity
Article
Genes May Hold Clues To Keep Broccoli Fresh
Nov 18, 2021
2 min read
BPA May Do Long-Lasting Damage to Painted Turtles
Futurity
Article
BPA May Do Long-Lasting Damage to Painted Turtles
May 17, 2017
2 min read
Team Solves Medical Mystery Of Child’s Fatal Lung Disease
Futurity
Article
Team Solves Medical Mystery Of Child’s Fatal Lung Disease
Feb 4, 2022
3 min read
Cell Atlases Reveal Biology’s Frontiers
Quanta
Article
Cell Atlases Reveal Biology’s Frontiers
Jul 12, 2017
5 min read

Related categories

Skip carousel

Reviews for Bioinformatics for Beginners

Rating: 4.666666666666667 out of 5 stars

4.5/5

3 ratings1 review

Rating: 5 out of 5 stars
5/5
amazing is the perfect word for this, thank you I enjoyed it so much

Book preview

Bioinformatics for Beginners - Supratim Choudhuri

needed.

Chapter 1 Fundamentals of Genes and Genomes*

This chapter briefly discusses the structure and function of genes and genomes. Some topics covered here are not usually discussed in textbooks of molecular biology. The obvious beginning is from the double-helical structure of DNA. The discussion on hydrogen bonding and the standard base-pairing principle is extended to include Hoogsteen hydrogen bonding and triple helix formation. The importance of intron phase in alternative splicing is discussed in detail; it lays the foundation for understanding exon shuffling during genome evolution, discussed in Chapter 2. Various types of noncoding RNAs (ncRNAs), such as small ncRNA, long ncRNA, competing endogenous RNA, and circular RNA are highlighted. The chemical basis of the instability of RNA is also discussed. The relationship between protein function and the location of amino acids in the polypeptide chain is explained with examples. Some important features of the human genome and characterization of its functional elements by the Encyclopedia of the DNA Elements (ENCODE) project are highlighted. A discussion on the epigenetic modification of the genome is also included.

Keywords

chirality; DNA structure; ENCODE; epigenetics; gene structure; Hoogsteen H-bonding; human genome; intrinsically disordered proteins; intron phase; noncoding RNA; triple helix

Outline

1.1 Biological Macromolecules, Genomics, and Bioinformatics 2

1.2 DNA as the Universal Genetic Material 2

1.3 DNA Double Helix 2

1.3.1 Structural Units of DNA 2

1.3.2 Linkage between Nucleotides 3

1.3.3 Base-Pairing Rules, Double Helix, and Triple Helix 4

1.3.4 Single-Stranded DNA 4

1.3.5 Base Sequence and the Genetic Code 5

1.4 Conformations of DNA 5

1.5 Typical Eukaryotic Gene Structure 5

1.5.1 Transcribed Region 7

1.5.1.1 Intron-Splicing Signals 7

1.5.1.2 Effect of Intron Phase on Alternative Splicing 9

1.5.1.3 Evolution of Introns 10

1.5.2 5′-Flanking Region of Transcribed Genes 11

1.5.3 3′-Flanking Region of Transcribed Genes 11

1.6 Mutations in the DNA Sequence 12

1.7 Some Features of RNA 12

1.7.1 Instability of mRNA 12

1.7.2 5′- and 3′-Untranslated Regions of mRNA 12

1.7.3 Secondary Structures in RNA 13

1.8 Coding Versus Noncoding RNA 14

1.8.1 Small Noncoding RNA, Long Noncoding RNA, Competing Endogenous RNA, and Circular RNA 14

1.9 Protein Structure and Function 15

1.9.1 Configuration and Chirality of Amino Acids 15

1.9.2 Ionic Character of Amino Acids 16

1.9.3 Relationship between Protein Function and the Location of Amino Acids in the Polypeptide Chain 16

1.9.4 Linkage between Amino Acids—The Peptide Bond 17

1.9.5 Four Levels of Protein Structure 17

1.9.6 Acidic and Basic Proteins 17

1.9.7 Nonstandard Amino Acids in Polypeptide Chains 18

1.10 Genome Structure and Organization 18

1.10.1 The Structure of a Representative Genome—The Human Genome 19

1.10.2 Functional Sequence Elements in the Genome 21

1.10.2.1 Promoters 21

1.10.2.2 Enhancers 21

1.10.2.3 Locus Control Regions 21

1.10.2.4 Insulators 22

1.10.3 Epigenetic Modifications of the Genome Can Edit the Language Written in the DNA Sequence and Add an Extra Layer of Complexity in Genome Expression 22

1.10.3.1 Histone Code 23

1.10.3.2 The Dynamics of Epigenetic Changes 24

1.10.4 Lessons Learned from the Second Phase of the ENCODE Project about the DNA Elements in the Human Genome and its Epigenetic Modifications 24

References 25

1.1 Biological Macromolecules, Genomics, and Bioinformatics

Genetic information is stored in the cell in the form of biological macromolecules, such as nucleic acids and proteins. The genetic information not only drives the functioning of the whole organism, but also drives the evolutionary engine. Thus, an understanding of the molecular basis of life is fundamental to understanding how genetic information shapes life and drives its evolution. The following discussion captures some fundamental aspects of the structure and function of genes and genomes with special notes (in boxes) on the applications of this information.

1.2 DNA as the Universal Genetic Material

With some exceptions, deoxyribonucleic acid (DNA) is the universal genetic material. In some viruses, termed RNA viruses, RNA is the genetic material. The term ribovirus is used for viruses with single- and double-stranded RNA genomes, including retroviruses, which are RNA-based for a portion of their life cycle.¹

Among the RNA viruses, retroviruses are well known; they include the notorious AIDS virus. Retroviruses are unique because in their life cycle they have both RNA and DNA versions of their genome. A complete retrovirus contains an RNA genome. The RNA genome encodes some protein products that are necessary for converting the single-stranded RNA genome into a double-stranded DNA genome and then its subsequent integration into the host genome. One such protein product of the retroviral genome is the reverse transcriptase (RT) enzyme. Upon entry into the cell, the reverse transcriptase is produced from the viral RNA genome using the host cellular machinery. The RT then copies the single-stranded RNA genome into a single-stranded DNA, which then produces a double-stranded viral DNA genome. The double-stranded viral DNA genome is referred to as the provirus, which gets incorporated into the host genome from where it keeps producing more retrovirus particles with single-stranded RNA genomes.

1.3 DNA Double Helix

The structure of the DNA double helix and its building blocks are described in all biology textbooks. Here, some other aspects are also highlighted, including the information in Box 1.1. DNA is a double-stranded right-handed helix; the two strands are complementary because of complementary base pairing, and antiparallel because the two strands have opposite 5′−3′ orientation (Figure 1.1A). The diameter of the helical DNA molecule is 20 Å (=2 nm). The helical conformation of DNA creates the alternate major groove and minor groove (Figure 1.1B).

Box 1.1

1. The major grooves in DNA can bind proteins. This is an important property of DNA structure because the major grooves in the upstream regulatory regions of a gene bind transcription-regulatory proteins. For example, for Zn-finger transcription factors, each Zn finger recognizes and binds to a specific trinucleotide sequence in the major groove of DNA.²

2. Any double-stranded nucleic acid (whether DNA double strand, DNA–RNA hybrid double strand, or RNA–RNA double strand) is antiparallel in nature. The complementary and antiparallel nature of double-stranded nucleic acids is an important property to remember while designing synthetic oligonucleotides for hybridization (probes or primers).

3. By convention, nucleic acid (DNA or RNA) sequence is written 5′→3′ from left to right, such as 5′-ATGTAAGCAC-3′. If the 5′→3′ designation is not mentioned, it is assumed that the sequence has been written in a 5′→3′ direction, following convention.

Figure 1.1 DNA structure.

(A) Two nucleotides of the DNA double helix, showing their antiparallel orientation, two H-bonds between A and T and three H-bonds between G and C; (B) the DNA double helix showing the major and minor grooves as well as the diameter of the molecule; (C) the convention of classifying the two sides of the phosphodiester bond and the products generated from their cleavage; (D) the front side (Watson–Crick edge) and the back side (Hoogsteen edge) of a purine; (E) how Hoogsteen H-bonding aids in the formation of the triple helix (see Section 1.3.3); (F) the anti and the syn conformations of bases around the N-glycosidic bond.

1.3.1 Structural Units of DNA

DNA is composed of structural units called nucleotides (deoxyribonucleotides). Each nucleotide is composed of a pentose sugar (2′-deoxy-D-ribose); one of the four nitrogenous bases—adenine (A), thymine (T), guanine (G), or cytosine (C); and a phosphate. The pentose sugar has five carbon atoms and they are numbered 1′ (1-prime) through 5′ (5-prime). The base is attached to the 1′ carbon atom of the sugar, and the phosphate is attached to the 5′ carbon atom (Figure 1.1A). The sugar and base form a nucleoside, whereas nucleoside plus phosphate makes a nucleotide. Hence, nucleoside=sugar+base, whereas nucleotide=sugar+base+phosphate. Table 1.1 shows the naming of nucleosides and nucleotides. Each nucleotide in DNA (as well as in RNA) has one replaceable hydrogen, which is what makes the DNA (and RNA) acidic.

Table 1.1

Naming of Nucleosides and Nucleotides

1.3.2 Linkage between Nucleotides

The nucleotides are joined by 5′–3′ phosphodiester linkage; that is, the 5′-phosphate of a nucleotide is linked to the 3′-OH of the preceding nucleotide by a phosphodiester linkage. In a linear DNA molecule, the 5′-end has a free phosphate and the 3′-end has a free OH group (Figure 1.1A). Each phosphodiester bond has two sides: a 3′-side that is linked to the 3′-end of the preceding nucleotide, and a 5′-side that is linked to 5′-end of the following nucleotide. The 3′-side is called the A side by convention and its cleavage generates a 5′-PO4 product. The 5′-side is called the B side by convention and its cleavage generates a 3′-PO4 product (Figure 1.1C).

1.3.3 Base-Pairing Rules, Double Helix, and Triple Helix

In the double-stranded DNA, A pairs with T by two hydrogen bonds and G pairs with C by three hydrogen bonds (Figures 1.1A and 1.1B); thus GC-rich regions of DNA have more hydrogen bonds and consequently are more resistant to thermal denaturation. Each nucleotide pair C) has a molecular weight of approximately 660 Da (sodium salt; 610 without sodium). In the helical double-stranded DNA molecule, the sugar–phosphate backbone lies outside and the bases are inside. Base pairs are stacked and horizontal; hence they are perpendicular to the axis of DNA. Because of the stacked nature of the base pairs in DNA, spatially flat molecules can intercalate between them. Of the four bases, A and G are purines whereas T and C are pyrimidines. In double-stranded DNA, a purine pairs with a pyrimidine (A with T and G with C). Therefore, total amount of purine should equal total amount of pyrimidine; in other words, the purine/pyrimidine ratio should be 1.0 or close to 1.0. This purine–pyrimidine equivalence in double-stranded DNA is called Chargaff’s rule.

In the bases, the side with the N1 position of the heterocyclic ring is the front, also called the Watson–Crick edge (Figure 1.1D); the opposite side is the back, also called the Hoogsteen edgeC base pairs in the normal double helix can form additional hydrogen bonds (Hoogsteen hydrogen bonds) to give rise to a triple helix involving the Hoogsteen edge of the purines, i.e. N7 of A and G for the third strand (Figure 1.1E). Hoogsteen hydrogen bonds can also form in RNA. In nucleic acids, the presence of a stretch of homopurine allows a stretch of homopyrimidine to hybridize through Hoogsteen hydrogen bonding to form a section of DNA triple helix. The homopyrimidine-containing third strand is oriented parallel to the oligopurine strand (Figure 1.1E), whereas the homopurine-containing third strand is oriented antiparallel to the oligopurine strand (see Box 1.2).³–⁵

Box 1.2

1. Each phosphate has three replaceable H+; phosphodiester-bond formation between two nucleotides leaves one replaceable H+. These replaceable H+ make the DNA (and RNA) acidic (Figures 1.1 and 1.3).

2. The intercalation property of spatially flat molecules is utilized to visualize DNA (and RNA) in a gel using flat aromatic molecules that fluoresce under UV, such as ethidium bromide and acridine orange. The intercalation of these molecules can also cause frameshift mutation during DNA replication.

3. The purine–pyrimidine equivalence can be utilized to determine if a DNA molecule from an unknown source is double stranded or single stranded. In a double-stranded DNA molecule, the purine/pyrimidine ratio should be 1.0 (or close to 1.0); in contrast, in a single-stranded DNA molecule this equivalence is lacking.

4. The differential thermal stability of AT-rich versus GC-rich regions in double-stranded nucleic acids is taken into consideration while designing oligonucleotides for hybridization for different purposes, such as high-stringency hybridization, primers for polymerase chain reaction (PCR), or for sequencing. For example, an oligoprobe that will be used for high-stringency hybridization can have≥55% G+C content.

5. If the molecular weight of an unknown double-stranded DNA is determined, the total base-pair content of the DNA can be calculated based on the fact that each nucleotide pair has an approximate molecular weight of 660 Da. By the same token, if the total number of base pairs in a DNA molecule is known, its molecular weight can be determined as well.

6. Hoogsteen hydrogen bonding can create short transient stretches of triple helix in vivo; triple helix formation can also be induced under experimental conditions. Synthetic oligodeoxynucleotides that can form triple helix have been used in vitro to inhibit gene expression in cells. Triple-helix-forming oligonucleotides coupled to DNA-modifying agents can be introduced into cells to modify the DNA target in a highly sequence-specific manner. This tool can be used to introduce genome modification, modulate specific gene expression, or even repair DNA.⁶,⁷

For bases, two conformational variations are possible. The bond joining the 1′-carbon of the deoxyribose sugar to the base is the N-glycosidic bond. Rotation about this base-to-sugar glycosidic bond gives rise to syn and anti conformations. The anti conformation is the most common one (Figure 1.1F); however, the syn conformation can trigger the formation of triple helix (Figure 1.1E) and also play a role in transversion mutation (see Molecular basis of mutation, Section 2.3.1 in Chapter 2).

1.3.4 Single-Stranded DNA

Many DNA viruses have single-stranded DNA (for example, φX-174, parvoviruses). RNA viruses have RNA as the genetic material, and the RNA genome can be single or double stranded. Single-stranded DNA does not have base equivalence and hence does not follow Chargaff’s base equivalence rule.

1.3.5 Base Sequence and the Genetic Code

The genetic information—that is, the genetic code with information for the amino acid sequence of the protein—lies in the sequence of bases in DNA. Genetic code exists in the form of a sequence of three bases; each three-base sequence is called a codon, which codes for an amino acid. Transcription of mRNA copies the codons from DNA to mRNA, which is translated to yield the protein (polypeptide) product. ATG in DNA (corresponding to AUG in RNA) is the start codon that codes for methionine. Translation begins by recognizing the start codon and incorporating methionine as the first amino acid. Similarly, TAG (amber), TGA (opal), and TAA (ochre) (corresponding to UAG, UGA, and UAA, respectively, in mRNA) are the three stop codons that do not code for any amino acids (exceptions to this rule are discussed below). In addition to being triplet (read as three-nucleotide codons), genetic code is (almost) universal, non-overlapping (adjacent codons do not share nucleotides), and degenerate (most amino acids can be coded by more than one codon). There are 64 (4³) possible codons (61 coding and 3 noncoding). Genetic code normally codes for 20 standard amino acids. The two known cases of direct incorporation of non-standard amino acids are that of selenocysteine (the 21st amino acid) and pyrrolysine (22nd amino acid). Selenocysteine has been found in lower as well as higher organisms, including mammals, while pyrrolysine has so far been found in certain archaebacteria. Both these amino acids are encoded by stop codons; selenocysteine is encoded by UGA and pyrrolysine is encoded by UAG in mRNA.

1.4 Conformations of DNA

There are three major conformations of DNA: B-DNA, A-DNA, and Z-DNA. The DNA structure that Watson and Crick proposed was the B form of DNA (B-DNA), and this is the physiological form of DNA. In B-DNA, the diameter of the helix is 2 nm (=20 Å). Each pitch—that is, one complete turn (360°)—is 3.4 nm (=34 Å) long and contains 10 base pairs. A-DNA has been identified in vitro under different salt concentrations, as well as in DNA–RNA hybrids. It is also a right-handed helix. The diameter of the helix is 2.3 nm (=23 Å). Each pitch is 2.6 nm (=26 Å) and contains 11 base pairs. So, for a given length, the A-form is wider and shorter than the B-form. Z-DNA is a left-handed helix (Z=zigzag). This form has been identified both in vitro and within the cell. Small, localized regions within the physiological B-form of DNA can attain a left-handed conformation. Formation of the left-handed Z-DNA conformation is dictated by regions of alternating purines and pyrimidines residues, such as 5′-GCGCGCGCGCGCGCGC-3′. In Z-DNA, the diameter of the helix is 1.8 nm (=18 Å). Each pitch is 3.7 nm (=37 Å) long and contains 12 base pairs. Thus, the Z-form is narrower and longer than the B-form. It is thought that local Z-DNA conformations may play important roles in gene transcription.

1.5 Typical Eukaryotic Gene Structure

According to the classical view of transcription, for any given gene, one of the two strands of DNA is transcribed, the other is nota. The DNA strand that is NOT transcribed is called the sense or plus (+) or coding strand because it has the same sequence as that of the mRNA (except for U in RNA and T in DNA)—that is, the same sequence of codons in the same 5′→3′ direction, so that the polypeptide sequence can be predicted from the sense strand sequence (see Box 1.3). In contrast, the strand that is transcribed is called the template or antisense or minus (−) or noncoding strand because its sequence is complementary to the coding sequence; hence, the polypeptide sequence cannot be predicted from the template strand sequence. A typical mRNA-coding eukaryotic gene has three major parts: a transcribed region, a 5′-flanking region, and a 3′-flankng region (Figure 1.2). In eukaryotes, different types of RNAs are transcribed from the DNA by different RNA polymerases: RNA polymerase I (pol I) transcribes ribosomal RNA (rRNA), RNA polymerase II (pol II) transcribes messenger RNA (mRNA), RNA polymerase III (pol III) transcribes transfer RNA (tRNA). For mRNA, the primary transcript that contains both exons and introns is called the heterogeneous nuclear RNA (hnRNA) or pre-mRNA. The hnRNA is processed to remove the introns (splicing), add a 7-methyl guanine cap at the 5′-end by 5′–5′ linkage (Figure 1.2 inset), and add a poly(A) tail at the 3′-end, which is about 200 bp long in mammals.

Box 1.3

1. An easy way to remember the sense and antisense designations is to remember just one fact: that the sequence of mRNA is sense. This is because the codons can be found in the coding sequence of mRNA; as a result the amino acid sequence of the polypeptide can be predicted from the mRNA coding sequence. Hence, any sequence that is same as the mRNA sequence along with the same 5′→3′ polarity is also sense. That is why the DNA strand that has the same sequence and polarity as the mRNA is also sense. Likewise, any sequence that is complementary to the mRNA sequence, along with the opposite 5′→3′ polarity, is antisense. Hence, the template DNA strand is antisense (Figure 1.4A).

2. By the same token, the probe used to detect mRNA in northern blot or in situ hybridization is antisense because it is complementary and has an opposite polarity to the mRNA. When designing antisense DNA oligoprobes for RNA or DNA hybridization, the complementary and antiparallel sequence of the sense strand of DNA is used. For example, in Figure 1.4, the mRNA partial sequence shown is 5′-AUG UGU AGA UCG AUG A-3′. That region of the antisense DNA probe will have the sequence 3′-TAC ACA TCT AGC TAC T-5′. Following convention, the DNA probe sequence has to be rewritten in a 5′→3′ direction from left to right. Hence, this DNA probe partial sequence will be rewritten (for reporting the sequence) as 5′-TCA TCG ATC TAC ACA T-3′ (Figure 1.4B).

3. In the nucleotide databases, such as in National Center for Biotechnology Information (NCBI), DNA Data Bank of Japan (DDBJ), or The European Molecular Biology Laboratory (EMBL), the reported mRNA sequences do not contain U but instead contain T. This is because the mRNA sequence is reported as the sense strand of the cloned complementary DNA (cDNA).

Figure 1.2 Gene–hnRNA–mRNA–protein relationship.

Exon 1 is noncoding. Thus, the 5′-untranslated region (5′-UTR) is derived from exon 1, and the 3′-UTR is derived from the noncoding part of exon 5, which is the last and the longest exon. The sense strand of DNA has a T where the mRNA has U—for example, the poly(A) signal sequence in the sense strand is AATAAA, but in RNA it is AAUAAA. The transcription initiation site is +1 and the base to the left (upstream) of it is −1; there is no 0 position. Also, note that RNA polymerase transcribes well beyond the poly(A) site; this extra part of the transcript is degraded and does not form part of the last exon. Inset shows the mRNA cap (7-MeG) and its 5′–5′ linkage with the first base of mRNA. nt, nucleotide; ORF, open reading frame.

1.5.1 Transcribed Region

The nucleotide sequence of a gene that is transcribed into mRNA is composed of discrete sequences called exons and introns. Introns are also known as intervening sequences (abbreviated as IS) (Figure 1.2). After transcription of the gene, a longer primary transcript (the hnRNA or pre-mRNA) is produced. The hnRNA has the same exon–intron organization as the gene: exons are interrupted by introns. The hnRNA is processed to produce the mature mRNA. Exons are maintained in the mature mRNA, while introns are spliced out (in most cases). The structural unit of mRNA is the ribonucleotide (Figure 1.3). Introns do not contain information for the coding of the polypeptide. However, some introns, usually at the 5′-end of the gene, contain signals for transcriptional regulation. Introns of many genes also contain nested genes that have distinct expression profiles.⁸ In mRNAs, a few terminal exons are noncoding, whereas the internal exons code for amino acids. These terminal noncoding exons form the 5′- and 3′-untranslated regions (UTRs) of the mRNA. In most mRNAs, the last exon (at the 3′-end) is usually the longest of all exons, and is partially coding (see Box 1.4).

Figure 1.3 Alkaline hydrolysis of RNA.

O−, which carries out a nucleophilic attack on the δ+ P of the phosphate. This results in the cleavage of the phosphodiester bond and the formation of 2′−3′ cyclic nucleotide; the cyclic nucleotide hydrolyzes into ribonucleoside 2′- and 3′-monophosphate end products.

Figure 1.4 Sense and antisense strands of DNA.

(A) The two strands of DNA have been drawn in different colors so that their respective 5′- and 3′-ends could be easily distinguished. The figure shows that mRNA and the sense strand have the same sequence (except for U in RNA and T in DNA) and the same 5′–3′ polarity. (B) The mRNA and antisense probe relationship.

Box 1.4

1. Sometimes an intron may be retained in the mature mRNA and perform specific regulatory functions. For example, migration stimulatory factor (MSF) is a truncated oncofetal isoform of fibronectin. Two types of MSF mRNAs have been detected: a shorter 2.1-kbb transcript and a longer 5.9-kb transcript, which differ only in the length of their 3′-UTRs. In the smaller transcript, the intron-derived 30-nucleotide (nt) coding sequence is followed by a 165-nt intron-derived 3′-UTR. This makes a total of 195-nt intron-derived sequence in the smaller transcript.⁹ This intron-derived 3′-UTR also provides the polyadenylation signal. The smaller transcript is transported to the cytoplasm and eventually secreted, while the larger transcript is retained in the nucleus.

2. After a gene is cloned and sequenced, the exon–intron boundaries are identified by comparing the gene sequence with its complementary DNA (cDNA) (mRNA) sequence. Identification of the exon–intron boundaries of a gene is essential when attempting to manipulate the DNA, such as making a gene-targeting construct.

3. The majority of internal exons in vertebrate genes are less than 300 bp; the average length being 135 bp; exons larger than 800 bp are rare.¹⁰

4. For most genes, the last exon (at the 3′-end) is the longest exon (could be well over 1 kb) and partially coding.

5. For most genes, the 5′-UTR is derived from more than one exon. Of these 5′ noncoding exons, the most downstream one is usually partially noncoding because the open reading frame (ORF) begins at some place in this exon, making it partially noncoding and partially coding.

6. For most genes, the 3′-UTR is three to five times longer than the 5′-UTR, particularly in vertebrates.

7. In vertebrates, exons are small and introns are large. In contrast, in lower eukaryotes, the opposite is

Enjoying the preview?

Page 1 of 1

Bioinformatics for Beginners: Genes, Genomes, Molecular Evolution, Databases and Analytical Tools

About this ebook

Supratim Choudhuri

Related authors

Related to Bioinformatics for Beginners

Related ebooks

Medical For You

Related podcast episodes

Related articles

Related categories

Reviews for Bioinformatics for Beginners

What did you think?

Book preview

Bioinformatics for Beginners - Supratim Choudhuri

Chapter 1

Fundamentals of Genes and Genomes*

Keywords

Outline

1.1 Biological Macromolecules, Genomics, and Bioinformatics

1.2 DNA as the Universal Genetic Material

1.3 DNA Double Helix

Box 1.1

1.3.1 Structural Units of DNA

1.3.2 Linkage between Nucleotides

1.3.3 Base-Pairing Rules, Double Helix, and Triple Helix

Box 1.2

1.3.4 Single-Stranded DNA

1.3.5 Base Sequence and the Genetic Code

1.4 Conformations of DNA

1.5 Typical Eukaryotic Gene Structure

Box 1.3

1.5.1 Transcribed Region

Box 1.4