Computation and Storage in the Cloud: Understanding the Trade-Offs

Ebook228 pages2 hours

Computation and Storage in the Cloud: Understanding the Trade-Offs

Name: Computation and Storage in the Cloud: Understanding the Trade-Offs
Brand: Elsevier Science
Rating: 5.0 (2 reviews)

By Dong Yuan, Yun Yang and Jinjun Chen

Rating: 5 out of 5 stars

5/5

()

Read preview

About this ebook

Computation and Storage in the Cloud is the first comprehensive and systematic work investigating the issue of computation and storage trade-off in the cloud in order to reduce the overall application cost. Scientific applications are usually computation and data intensive, where complex computation tasks take a long time for execution and the generated datasets are often terabytes or petabytes in size. Storing valuable generated application datasets can save their regeneration cost when they are reused, not to mention the waiting time caused by regeneration. However, the large size of the scientific datasets is a big challenge for their storage. By proposing innovative concepts, theorems and algorithms, this book will help bring the cost down dramatically for both cloud users and service providers to run computation and data intensive scientific applications in the cloud.

Covers cost models and benchmarking that explain the necessary tradeoffs for both cloud providers and users
Describes several novel strategies for storing application datasets in the cloud
Includes real-world case studies of scientific research applications

Covers cost models and benchmarking that explain the necessary tradeoffs for both cloud providers and users
Describes several novel strategies for storing application datasets in the cloud
Includes real-world case studies of scientific research applications

Skip carousel

LanguageEnglish

PublisherElsevier Science

Release dateDec 31, 2012

ISBN9780124078796

Author

Dong Yuan

Dong Yuan is currently a research fellow in School of Software and Electrical Engineering at Swinburne University of Technology, Melbourne, Australia. His research interests include data management in parallel and distributed systems, scheduling and resource management, grid and cloud computing.

Related authors

Skip carousel

Related to Computation and Storage in the Cloud

Related ebooks

Skip carousel

Security Patterns: Integrating Security and Systems Engineering
Ebook
Security Patterns: Integrating Security and Systems Engineering
byMarkus Schumacher
Rating: 0 out of 5 stars
0 ratings
The Concise Guide to the Internet of Things for Executives
Ebook
The Concise Guide to the Internet of Things for Executives
byalasdair gilchrist
Rating: 4 out of 5 stars
4/5
Intelligent Networks: Recent Approaches and Applications in Medical Systems
Ebook
Intelligent Networks: Recent Approaches and Applications in Medical Systems
bySyed V. Ahamed
Rating: 0 out of 5 stars
0 ratings
Communications, Radar and Electronic Warfare
Ebook
Communications, Radar and Electronic Warfare
byAdrian Graham
Rating: 0 out of 5 stars
0 ratings
Reliability Assurance of Big Data in the Cloud: Cost-Effective Replication-Based Storage
Ebook
Reliability Assurance of Big Data in the Cloud: Cost-Effective Replication-Based Storage
byYun Yang
Rating: 5 out of 5 stars
5/5
Data Analysis in the Cloud: Models, Techniques and Applications
Ebook
Data Analysis in the Cloud: Models, Techniques and Applications
byDomenico Talia
Rating: 0 out of 5 stars
0 ratings
Deep Learning on Edge Computing Devices: Design Challenges of Algorithm and Architecture
Ebook
Deep Learning on Edge Computing Devices: Design Challenges of Algorithm and Architecture
byXichuan Zhou
Rating: 0 out of 5 stars
0 ratings
Distributed and Cloud Computing: From Parallel Processing to the Internet of Things
Ebook
Distributed and Cloud Computing: From Parallel Processing to the Internet of Things
byKai Hwang
Rating: 5 out of 5 stars
5/5
Computational Intelligence for Multimedia Big Data on the Cloud with Engineering Applications
Ebook
Computational Intelligence for Multimedia Big Data on the Cloud with Engineering Applications
byArun Kumar Sangaiah
Rating: 0 out of 5 stars
0 ratings
Systems Programming: Designing and Developing Distributed Applications
Ebook
Systems Programming: Designing and Developing Distributed Applications
byRichard Anthony
Rating: 0 out of 5 stars
0 ratings
Mastering Cloud Computing: Foundations and Applications Programming
Ebook
Mastering Cloud Computing: Foundations and Applications Programming
byRajkumar Buyya
Rating: 0 out of 5 stars
0 ratings
Computational Learning Approaches to Data Analytics in Biomedical Applications
Ebook
Computational Learning Approaches to Data Analytics in Biomedical Applications
byKhalid Al-Jabery
Rating: 5 out of 5 stars
5/5
Optimized Cloud Resource Management and Scheduling: Theories and Practices
Ebook
Optimized Cloud Resource Management and Scheduling: Theories and Practices
byWenhong Dr. Tian
Rating: 0 out of 5 stars
0 ratings
Fundamentals of Data Science: Theory and Practice
Ebook
Fundamentals of Data Science: Theory and Practice
byJugal K. Kalita
Rating: 0 out of 5 stars
0 ratings
Microgrid Methodologies and Emergent Applications
Ebook
Microgrid Methodologies and Emergent Applications
byChengshan Wang
Rating: 0 out of 5 stars
0 ratings
Temporal QOS Management in Scientific Cloud Workflow Systems
Ebook
Temporal QOS Management in Scientific Cloud Workflow Systems
byXiao Liu
Rating: 0 out of 5 stars
0 ratings
Energy Positive Neighborhoods and Smart Energy Districts: Methods, Tools, and Experiences from the Field
Ebook
Energy Positive Neighborhoods and Smart Energy Districts: Methods, Tools, and Experiences from the Field
byAntonello Monti
Rating: 0 out of 5 stars
0 ratings
Deep Learning: Convergence to Big Data Analytics
Ebook
Deep Learning: Convergence to Big Data Analytics
byMurad Khan
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Future Fiber-Optic Communication Systems
Ebook
Machine Learning for Future Fiber-Optic Communication Systems
byAlan Pak Tao Lau
Rating: 0 out of 5 stars
0 ratings
Computer Science and Ambient Intelligence
Ebook
Computer Science and Ambient Intelligence
byGaëlle Calvary
Rating: 0 out of 5 stars
0 ratings
Intelligent Data-Analytics for Condition Monitoring: Smart Grid Applications
Ebook
Intelligent Data-Analytics for Condition Monitoring: Smart Grid Applications
byHasmat Malik
Rating: 0 out of 5 stars
0 ratings
Introduction to Algorithms for Data Mining and Machine Learning
Ebook
Introduction to Algorithms for Data Mining and Machine Learning
byXin-She Yang
Rating: 0 out of 5 stars
0 ratings
A Beginner's Guide to Data Agglomeration and Intelligent Sensing
Ebook
A Beginner's Guide to Data Agglomeration and Intelligent Sensing
byAmartya Mukherjee
Rating: 0 out of 5 stars
0 ratings
Tensors for Data Processing: Theory, Methods, and Applications
Ebook
Tensors for Data Processing: Theory, Methods, and Applications
byYipeng Liu
Rating: 0 out of 5 stars
0 ratings
Dynamic Modelling of Information Systems
Ebook
Dynamic Modelling of Information Systems
byK.M. van Hee
Rating: 0 out of 5 stars
0 ratings
Big Data: Principles and Paradigms
Ebook
Big Data: Principles and Paradigms
byRajkumar Buyya
Rating: 0 out of 5 stars
0 ratings
Interpretable Machine Learning for the Analysis, Design, Assessment, and Informed Decision Making for Civil Infrastructure
Ebook
Interpretable Machine Learning for the Analysis, Design, Assessment, and Informed Decision Making for Civil Infrastructure
byM. Z. Naser
Rating: 0 out of 5 stars
0 ratings
Handbook of Mobility Data Mining, Volume 3: Mobility Data-Driven Applications
Ebook
Handbook of Mobility Data Mining, Volume 3: Mobility Data-Driven Applications
byHaoran Zhang
Rating: 0 out of 5 stars
0 ratings
Demystifying Numerical Models: Step-by Step Modeling of Engineering Systems
Ebook
Demystifying Numerical Models: Step-by Step Modeling of Engineering Systems
byJohn Mo
Rating: 2 out of 5 stars
2/5
Sharing Data and Models in Software Engineering
Ebook
Sharing Data and Models in Software Engineering
byTim Menzies
Rating: 5 out of 5 stars
5/5

Databases For You

Skip carousel

Blockchain Basics: A Non-Technical Introduction in 25 Steps
Ebook
Blockchain Basics: A Non-Technical Introduction in 25 Steps
byDaniel Drescher
Rating: 5 out of 5 stars
5/5
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
Ebook
SQL Programming & Database Management For Absolute Beginners SQL Server, Structured Query Language Fundamentals: "Learn - By Doing" Approach And Master SQL
byWilliam Sullivan
Rating: 5 out of 5 stars
5/5
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
Ebook
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
byAlexander Cooper
Rating: 1 out of 5 stars
1/5
Learn SQL in 24 Hours
Ebook
Learn SQL in 24 Hours
byAlex Nordeen
Rating: 5 out of 5 stars
5/5
Relational Database Design and Implementation
Ebook
Relational Database Design and Implementation
byJan L. Harrington
Rating: 5 out of 5 stars
5/5
100+ SQL Queries T-SQL for Microsoft SQL Server
Ebook
100+ SQL Queries T-SQL for Microsoft SQL Server
byIFS Harrison
Rating: 4 out of 5 stars
4/5
Practical Data Analysis
Ebook
Practical Data Analysis
byHector Cuesta
Rating: 4 out of 5 stars
4/5
Business Intelligence Guidebook: From Data Integration to Analytics
Ebook
Business Intelligence Guidebook: From Data Integration to Analytics
byRick Sherman
Rating: 4 out of 5 stars
4/5
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
Ebook
SQL QuickStart Guide: The Simplified Beginner's Guide to Managing, Analyzing, and Manipulating Data With SQL
byWalter Shields
Rating: 4 out of 5 stars
4/5
Learning PostgreSQL
Ebook
Learning PostgreSQL
byJuba Salahaldin
Rating: 1 out of 5 stars
1/5
Learn Git in a Month of Lunches
Ebook
Learn Git in a Month of Lunches
byRick Umali
Rating: 0 out of 5 stars
0 ratings
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
Ebook
LINUX: Beginner's Crash Course. Your Step-By-Step Guide To Learning The Linux Operating System And Command Line Easy & Fast!
byJeremy Li
Rating: 3 out of 5 stars
3/5
Excel 2021
Ebook
Excel 2021
byJIAYI SIMONDS
Rating: 4 out of 5 stars
4/5
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
Ebook
Data Governance: How to Design, Deploy and Sustain an Effective Data Governance Program
byJohn Ladley
Rating: 4 out of 5 stars
4/5
COMPUTER SCIENCE FOR ROOKIES
Ebook
COMPUTER SCIENCE FOR ROOKIES
byAngel Bahabwa
Rating: 0 out of 5 stars
0 ratings
Query Store for SQL Server 2019: Identify and Fix Poorly Performing Queries
Ebook
Query Store for SQL Server 2019: Identify and Fix Poorly Performing Queries
byTracy Boggiano
Rating: 0 out of 5 stars
0 ratings
Behind Every Good Decision: How Anyone Can Use Business Analytics to Turn Data into Profitable Insight
Ebook
Behind Every Good Decision: How Anyone Can Use Business Analytics to Turn Data into Profitable Insight
byPiyanka Jain
Rating: 5 out of 5 stars
5/5
SQL Clearly Explained
Ebook
SQL Clearly Explained
byJan L. Harrington
Rating: 5 out of 5 stars
5/5
Python Projects for Everyone
Ebook
Python Projects for Everyone
byMohamad Charara
Rating: 0 out of 5 stars
0 ratings
Beginning Microsoft SQL Server 2012 Programming
Ebook
Beginning Microsoft SQL Server 2012 Programming
byPaul Atkinson
Rating: 1 out of 5 stars
1/5
A Concise Guide to Object Orientated Programming
Ebook
A Concise Guide to Object Orientated Programming
byalasdair gilchrist
Rating: 0 out of 5 stars
0 ratings
Building a Scalable Data Warehouse with Data Vault 2.0
Ebook
Building a Scalable Data Warehouse with Data Vault 2.0
byDaniel Linstedt
Rating: 4 out of 5 stars
4/5
SQL: Practical Guide for Developers
Ebook
SQL: Practical Guide for Developers
byMichael J. Donahoo
Rating: 2 out of 5 stars
2/5
Learn SQL Server Administration in a Month of Lunches
Ebook
Learn SQL Server Administration in a Month of Lunches
byDon Jones
Rating: 0 out of 5 stars
0 ratings
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics
Ebook
Beginning Microsoft Power BI: A Practical Guide to Self-Service Data Analytics
byDan Clark
Rating: 0 out of 5 stars
0 ratings
Access 2019 For Dummies
Ebook
Access 2019 For Dummies
byLaurie A. Ulrich
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence for Fashion: How AI is Revolutionizing the Fashion Industry
Ebook
Artificial Intelligence for Fashion: How AI is Revolutionizing the Fashion Industry
byLeanne Luce
Rating: 0 out of 5 stars
0 ratings
The AI Bible, Making Money with Artificial Intelligence: Real Case Studies and How-To's for Implementation
Ebook
The AI Bible, Making Money with Artificial Intelligence: Real Case Studies and How-To's for Implementation
byJhon Dujardin
Rating: 4 out of 5 stars
4/5
Access 2010 All-in-One For Dummies
Ebook
Access 2010 All-in-One For Dummies
byAlison Barrows
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
Podcast episode
Learning Long-Time Dependencies with RNNs w/ Konstantin Rusch - #484: Today we conclude our 2021 ICLR coverage joined by Konstantin Rusch, a PhD Student at ETH Zurich. In our conversation with Konstantin, we explore his recent papers, titled coRNN and uniCORNN respectively, which focus on a novel architecture of...
byThe TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
0 ratings
0% found this document useful
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
Podcast episode
007 Prof. Kristin Persson of the Materials Project – Building a Global Materials Informatics Platform: Summary: This episode focuses on Prof. Kristin Persson’s work directing the Materials Project, where she had her group have built an open-source materials informatics platform that reaches over 75,000 users worldwide. In this episode,...
byDataLab: The Materials Informatics Podcast
0 ratings
0% found this document useful
[Bite] Data Science and the Scientific Method
Podcast episode
[Bite] Data Science and the Scientific Method
byDataCafé
0 ratings
0% found this document useful
Episode 21: Remember when RealNetworks used to-- BUFFERING: Are you about to head off to college? Interested in DevOps and the Cloud? Is there a good way for someone like you who is starting out in the world of technology to absorb the necessary skills? The Open Source Lab (OSL) at Oregon State University (OSU) is
Podcast episode
Episode 21: Remember when RealNetworks used to-- BUFFERING: Are you about to head off to college? Interested in DevOps and the Cloud? Is there a good way for someone like you who is starting out in the world of technology to absorb the necessary skills? The Open Source Lab (OSL) at Oregon State University (OSU) is
byScreaming in the Cloud
0 ratings
0% found this document useful
Build Your Own Data Pipeline - Andreas Kretz
Podcast episode
Build Your Own Data Pipeline - Andreas Kretz
byDataTalks.Club
0 ratings
0% found this document useful
Making Agile work for data science: On this episode, we chat with Michael Carrico, director of data science, and Chris Wones, an engineering lead. Both work at 84.51°, which builds software and data solutions to power the pricing and promotions that reach over eight million shoppers a day across 2,500 Kroger stores.
Podcast episode
Making Agile work for data science: On this episode, we chat with Michael Carrico, director of data science, and Chris Wones, an engineering lead. Both work at 84.51°, which builds software and data solutions to power the pricing and promotions that reach over eight million shoppers a day across 2,500 Kroger stores.
byThe Stack Overflow Podcast
0 ratings
0% found this document useful
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
Podcast episode
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
byUVA Data Points
0 ratings
0% found this document useful
The Modern Data Stack vs Hyperscale Data Warehousing: The modern data stack is a collection of cloud-based tools and technologies used to collect, store, process, and analyze data in a scalable way. It is a departure from traditional data stacks, which were often based on on-premises infrastructure and...
Podcast episode
The Modern Data Stack vs Hyperscale Data Warehousing: The modern data stack is a collection of cloud-based tools and technologies used to collect, store, process, and analyze data in a scalable way. It is a departure from traditional data stacks, which were often based on on-premises infrastructure and...
byDM Radio
0 ratings
0% found this document useful
215 — Workplace design in the Covid era: Earlier in the year, a report by academics at Cardiff and Southampton Universities found that a majority of people would like to continue working from home in some capacity, even after social distancing is no longer a requirement. But what would a...
Podcast episode
215 — Workplace design in the Covid era: Earlier in the year, a report by academics at Cardiff and Southampton Universities found that a majority of people would like to continue working from home in some capacity, even after social distancing is no longer a requirement. But what would a...
byThe Mind Tools L&D Podcast
0 ratings
0% found this document useful
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
Podcast episode
Pushing The Limits Of Scalability And User Experience For Data Processing WIth Jignesh Patel: Data processing technologies have dramatically improved in their sophistication and raw throughput. Unfortunately, the volumes of data that are being generated continue to double, requiring further advancements in the platform capabilities to keep up. As the sophistication increases, so does the complexity, leading to challenges for user experience. Jignesh Patel has been researching these areas for several years in his work as a professor at Carnegie Mellon University. In this episode he illuminates the landscape of problems that we are faced with and how his research is aimed at helping to solve these problems.
byData Engineering Podcast
0 ratings
0% found this document useful
Understanding Graph Database Patterns
Podcast episode
Understanding Graph Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful
Unlocking Scientific Data with Mike Tarselli at TetraScience
Podcast episode
Unlocking Scientific Data with Mike Tarselli at TetraScience
byFrom Lab to Launch by Qualio
0 ratings
0% found this document useful
[Bite] Documenting Data Science Projects
Podcast episode
[Bite] Documenting Data Science Projects
byDataCafé
0 ratings
0% found this document useful
MLOps: A leader's perspective // Stephen Galsworthy // MLOps Coffee Sessions #39
Podcast episode
MLOps: A leader's perspective // Stephen Galsworthy // MLOps Coffee Sessions #39
byMLOps.community
0 ratings
0% found this document useful
Commanding the Council of the Lords of Thought with Anna Belak: A few years ago Corey caught wind of the open source project Sysdig, which at the time attracted his attention. Now it has turned into something “rather interesting” when it comes to observability and security. Anna Belak, Sysdig’s Director of Thought Lea
Podcast episode
Commanding the Council of the Lords of Thought with Anna Belak: A few years ago Corey caught wind of the open source project Sysdig, which at the time attracted his attention. Now it has turned into something “rather interesting” when it comes to observability and security. Anna Belak, Sysdig’s Director of Thought Lea
byScreaming in the Cloud
0 ratings
0% found this document useful
Exploring Open-Source for Tissue Image Analysis and Data Science Business w/ Trevor McKee, Pathomics.io
Podcast episode
Exploring Open-Source for Tissue Image Analysis and Data Science Business w/ Trevor McKee, Pathomics.io
byDigital Pathology Podcast
0 ratings
0% found this document useful
Strachey Lecture - Privacy-preserving analytics in, or out of, the cloud: This talk is about the experience of providing privacy when running analytics on users’ personal data.
Podcast episode
Strachey Lecture - Privacy-preserving analytics in, or out of, the cloud: This talk is about the experience of providing privacy when running analytics on users’ personal data.
byComputer Science
0 ratings
0% found this document useful
Understanding Time-Series Database Patterns
Podcast episode
Understanding Time-Series Database Patterns
byThe Cloudcast
0 ratings
0% found this document useful
The Cloudcast #355 - Exploring IoT Edge
Podcast episode
The Cloudcast #355 - Exploring IoT Edge
byThe Cloudcast
0 ratings
0% found this document useful
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
Podcast episode
MLOps Coffee Sessions #10 Analyzing the Article “Continuous Delivery and Automation Pipelines in Machine Learning" // Part 2
byMLOps.community
0 ratings
0% found this document useful
ATLAS with Dr. Mario Lassnig: Our guest today is Dr. Mario Lassnig, a software engineer working on the ATLAS Experiment at CERN!
Podcast episode
ATLAS with Dr. Mario Lassnig: Our guest today is Dr. Mario Lassnig, a software engineer working on the ATLAS Experiment at CERN!
byGoogle Cloud Platform Podcast
0 ratings
0% found this document useful
La Trobe University | Director of Data & Analytics | Anthony Perera
Podcast episode
La Trobe University | Director of Data & Analytics | Anthony Perera
byThe iTnews Podcast
0 ratings
0% found this document useful
Quantum Tech Pod Episode 16: Andrew Horsley, Quantum Brilliance Co-Founder & CEO: (QuantumTechPod) Host Chris Bishop, today interviews Dr. Andrew Horsley, Quantum Brilliance Co-Founder & CEO. Andrew is the founder and applied quantum physicist, working on room temperature quantum computing using NV centres in diamond with 8+ years’...
Podcast episode
Quantum Tech Pod Episode 16: Andrew Horsley, Quantum Brilliance Co-Founder & CEO: (QuantumTechPod) Host Chris Bishop, today interviews Dr. Andrew Horsley, Quantum Brilliance Co-Founder & CEO. Andrew is the founder and applied quantum physicist, working on room temperature quantum computing using NV centres in diamond with 8+ years’...
byQuantum Tech Pod
0 ratings
0% found this document useful
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
Podcast episode
Podcast Ep. #18 – Prof. Wenbin Yu on the Structure Genome: On this episode I am speaking to Wenbin Yu, who is a professor at the School of Aeronautics and Astronautics of Purdue University and CTO of AnalySwift, a provider of simulation software for composites. Wenbin has achieved many accolades in both the ac...
byAerospace Engineering Podcast
0 ratings
0% found this document useful
The Cloudcast #306 - PaaS Adoption from Around the World: Aaron and Brian talk with Thurupathan Vijayakumar (<a href="https://twitter.com/thurutweets">@ThuruTweets</a>, Solutions Architect | Developer | Microsoft Azure MVP) about cloud deployments in Asia, the business drivers for using public cloud services,...
Podcast episode
The Cloudcast #306 - PaaS Adoption from Around the World: Aaron and Brian talk with Thurupathan Vijayakumar (<a href="https://twitter.com/thurutweets">@ThuruTweets</a>, Solutions Architect | Developer | Microsoft Azure MVP) about cloud deployments in Asia, the business drivers for using public cloud services,...
byThe Cloudcast
0 ratings
0% found this document useful
1027: Simplifying Microsegmentation: Edgewise founder and CEO Peter Smith joins me on my daily tech podcast to share his expertise and insights
Podcast episode
1027: Simplifying Microsegmentation: Edgewise founder and CEO Peter Smith joins me on my daily tech podcast to share his expertise and insights
byThe Tech Talks Daily Podcast
0 ratings
0% found this document useful
Debezium - Capturing Data the Instant it Happens (with Gunnar Morling)
Podcast episode
Debezium - Capturing Data the Instant it Happens (with Gunnar Morling)
byDeveloper Voices
0 ratings
0% found this document useful
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
Podcast episode
4 + 1 Model of Data Science: Before diving into the complex world of data science it seemed to wise to establish a shared definition of the field. Here at the UVA School of Data Science, we have defined data science with the 4 + 1 Model. This model serves an outline for the first series of UVA Data Points. It also serves as a guiding definition within the School of Data Science, touching everything from research to course planning. In this introduction trailer, host Monica Manney discusses the history, development, and function of the 4 + 1 Model of Data Science with its main author, Raf Alvarado. Below is a brief expect from An Outline of the 4 + 1 Model of Data Science by Raf Alvarado: “The point of the 4 + 1 model, abstract as it is, is to provide a practical template for strategically planning the various elements of a school of data science. To serve as an effective template, a model must be general. But generality if often purchased at the cost of intuitive understanding. The fol
byUVA Data Points
0 ratings
0% found this document useful
Complete and Fast 3D Image Analysis in Microscopy
Podcast episode
Complete and Fast 3D Image Analysis in Microscopy
byListen In - Bitesize Bio Webinar Audios
0 ratings
0% found this document useful
Scaling the Internet
Podcast episode
Scaling the Internet
byDataCafé
0 ratings
0% found this document useful

Skip carousel

Team Encodes Digital ‘Hello’ Into Lab-made DNA
Futurity
Article
Team Encodes Digital ‘Hello’ Into Lab-made DNA
Mar 26, 2019
4 min read
Federated Learning Uses The Data Right On Our Devices
Futurity
Article
Federated Learning Uses The Data Right On Our Devices
Jul 21, 2022
2 min read
Research in Large Australian Practices: A Roundtable Discussion
Architecture Australia
Article
Research in Large Australian Practices: A Roundtable Discussion
Jul 2, 2018
What research is actually happening in large architectural practices in Australia? How are practices pursuing research and what are their motivations? What do they see as the benefits and how are they justifying the cost? What are the challenges and
12 min read
Data Centers Aren’t The Energy Hogs We Thought
Futurity
Article
Data Centers Aren’t The Energy Hogs We Thought
Feb 28, 2020
2 min read
Cloud Computing in Health Care Industry
Techfastly
Article
Cloud Computing in Health Care Industry
Apr 1, 2021
The vast impact of digital transformation in the health care industry makes its future firm. What’s interesting in cloud computing with healthcare? Cloud computing changes the traditional way of dealing with data. Big data analytics with cloud comput
4 min read
Building Trends, Building Momentum
Facility Management
Article
Building Trends, Building Momentum
Oct 14, 2019
3 min read
Safer Cyber
Cosmos Magazine
Article
Safer Cyber
Mar 14, 2024
3 min read
These Walls Can Talk
Facility Management
Article
These Walls Can Talk
Aug 23, 2018
3 min read
태도가 건축이 될 때 When Attitude Becomes Architecture
Space
Article
태도가 건축이 될 때 When Attitude Becomes Architecture
Dec 5, 2023
12 min read
Arrested Development
Architecture Australia
Article
Arrested Development
Jul 2, 2018
Architectural firms are great at creating knowledge and value through design. But when it comes to R and D, architects are good at doing the R but not so good at the D. Arguably, the disintermediation of architects in procurement processes, the rise
4 min read
Digital Files And University Design Space
Facility Management
Article
Digital Files And University Design Space
Mar 28, 2019
3 min read
Art Vs Science
Architectural Review Asia Pacific
Article
Art Vs Science
Sep 21, 2023
7 min read
Prototype Paves Way For ‘Computer-on-a-chip’
Futurity
Article
Prototype Paves Way For ‘Computer-on-a-chip’
Feb 22, 2019
2 min read
Powering Costing With Artificial Intelligence: The Case Of Vodafone Procurement
The European Business Review
Article
Powering Costing With Artificial Intelligence: The Case Of Vodafone Procurement
May 25, 2021
8 min read
How Women Are Leading The Charge In Emerging Tech
Business Today
Article
How Women Are Leading The Charge In Emerging Tech
Mar 4, 2023
3 min read
Inform And Enhance Your Business With Open Data
PC Pro Magazine
Article
Inform And Enhance Your Business With Open Data
Jun 10, 2021
7 min read
How And Where You Use Machine-learning
APC
Article
How And Where You Use Machine-learning
Oct 7, 2019
4 min read
Quantum Simulators An Overview
Techfastly
Article
Quantum Simulators An Overview
Oct 1, 2021
4 min read
Always Looking Forward
Recoil
Article
Always Looking Forward
Mar 23, 2021
3 min read
The Cloud Is All Around Us
MoneyWeek
Article
The Cloud Is All Around Us
Mar 17, 2023
The ways the cloud can be used in our day-to-day lives is unlimited, as these examples help to illustrate. Within entertainment, whether it’s Disney+ or Netflix, the television shows and films we watch are stored in the cloud so that millions can sim
2 min read
Real World Computing
PC Pro Magazine
Article
Real World Computing
May 11, 2023
Migrating to Azure isn’t necessarily the toughest part of a successful cloud migration, explains our guest columnist Many organisations succeed at deploying resources in or migrating to Microsoft Azure. But many of those same organisations fail to en
6 min read
Moore’s Law Is About to Get Weird: Never mind tablet computers. Wait till you see bubbles and slime mold.
Nautilus
Article
Moore’s Law Is About to Get Weird: Never mind tablet computers. Wait till you see bubbles and slime mold.
Feb 12, 2015
I’ve never seen the computer you’re reading this story on, but I can tell you a lot about it. It runs on electricity. It uses binary logic to carry out programmed instructions. It shuttles information using materials known as semiconductors. Its brai
7 min read
Wireless Network Gets Data From Sensors The Size Of Salt Grains
Futurity
Article
Wireless Network Gets Data From Sensors The Size Of Salt Grains
Mar 19, 2024
Tiny chips may be a big breakthrough, researchers report. They have a new approach for a wireless communication network that can efficiently transmit, receive, and decode data from thousands of microelectronic chips that are each no larger than a gra
3 min read
Building PCs
Linux Format
Article
Building PCs
Apr 7, 2020
2 min read
Facilities Systems
Facility Management
Article
Facilities Systems
Oct 21, 2018
5 min read
‘Deep Learning’ Goes Faster With Organized Data
Futurity
Article
‘Deep Learning’ Goes Faster With Organized Data
Jun 5, 2017
Researchers have found that a technique for speedy data lookup, called hashing, can dramatically reduce the amount of computation required for deep learning, a demanding form of machine learning. “This applies to any deep-learning architecture, and t
2 min read
To Build Amazing Computers, Mimic The Brain?
Futurity
Article
To Build Amazing Computers, Mimic The Brain?
Mar 4, 2020
5 min read
DNA Data Storage Takes A Leap Forward With ‘DORIS’
Futurity
Article
DNA Data Storage Takes A Leap Forward With ‘DORIS’
Jun 14, 2020
3 min read
Pragmatic Parametricism
Architectural Review Asia Pacific
Article
Pragmatic Parametricism
Nov 13, 2020
4 min read
Measuring Performance For Nature Recovery
Landscape Architecture Australia
Article
Measuring Performance For Nature Recovery
Jan 29, 2024
5 min read

Related categories

Skip carousel

Reviews for Computation and Storage in the Cloud

Rating: 5 out of 5 stars

5/5

2 ratings0 reviews

Book preview

Computation and Storage in the Cloud - Dong Yuan

computing.

Preface

Nowadays, scientific research increasingly relies on IT technologies, where large-scale and high-performance computing systems (e.g. clusters, grids and supercomputers) are utilised by the communities of researchers to carry out their applications. Scientific applications are usually computation and data-intensive, where complex computation tasks take a long time for execution and the generated data sets are often terabytes or petabytes in size. Storing valuable generated application data sets can save their regeneration cost when they are reused, not to mention the waiting time caused by regeneration. However, the large size of the scientific data sets makes their storage a big challenge.

In recent years, cloud computing is emerging as the latest distributed computing paradigm which provides redundant, inexpensive and scalable resources on demand to system requirements. It offers researchers a new way to deploy computation and data-intensive applications (e.g. scientific applications) without any infrastructure investments. Large generated application data sets can be flexibly stored or deleted (and regenerated whenever needed) in the cloud, since, theoretically, unlimited storage and computation resources can be obtained from commercial cloud service providers.

With the pay-as-you-go model, the total application cost for generated data sets in the cloud depends chiefly on the method used for storing them. For example, storing all the generated application data sets in the cloud may result in a high storage cost since some data sets may be seldom used but large in size; but if we delete all the generated data sets and regenerate them every time they are needed, the computation cost may also be very high. Hence, there is a trade-off between computation and storage in the cloud. In order to reduce the overall application cost, a good strategy is to find a balance to selectively store some popular data sets and regenerate the rest when needed. This book focuses on cost-effective data sets storage of scientific applications in the cloud, which is currently a leading-edge and challenging topic. By investigating the niche issue of computation and storage trade-off, we (1) propose a new cost model for data sets storage in the cloud; (2) develop novel benchmarking approaches to find the minimum cost of storing the application data; and (3) design innovative runtime storage strategies to store the application data in the cloud.

We start with introducing a motivating example from astrophysics and analyse the problems of computation and storage trade-off in the cloud. Based on the requirements identified, we propose a novel concept of Data Dependency Graph (DDG) and propose an effective data sets storage cost model in the cloud. DDG is based on data provenance, which records the generation relationship of all the data sets. With DDG, we know how to effectively regenerate data sets in the cloud and can further calculate their generation costs. The total application cost for the generated data sets includes both their generation cost and their storage cost.

Based on the cost model, we develop novel algorithms which can calculate the minimum cost for storing data sets in the cloud, i.e. the best trade-off between computation and storage. This minimum cost is a benchmark for evaluating the cost-effectiveness of different storage strategies in the cloud. For different situations, we develop different benchmarking approaches with polynomial time complexity for a seemingly NP-hard problem, where (1) the static on-demand approach is for situations in which only occasional benchmarking is requested; and (2) the dynamic on-the-fly approach is suitable for situations in which more frequent benchmarking is requested at runtime.

We develop novel cost-effective storage strategies for users to facilitate at runtime of the cloud. These are different from the minimum cost benchmarking approach, and sometimes users may have certain preferences regarding storage of some particular data sets due to reasons other than cost – e.g. guaranteeing immediate access to certain data sets. Hence, users’ preferences should also be considered in a storage strategy. Based on these considerations, we develop two cost-effective storage strategies for different situations: (1) the cost-rate-based strategy is highly efficient with fairly reasonable cost-effectiveness; and (2) the local-optimisation-based strategy is highly cost-effective with very reasonable time complexity.

To the best of our knowledge, this book is the first comprehensive and systematic work investigating the issue of computation and storage trade-off in the cloud in order to reduce the overall application cost. By proposing innovative concepts, theorems and algorithms, the major contribution of this book is that it helps bring the cost down dramatically for both cloud users and service providers to run computation and data-intensive scientific applications in the cloud.

1 Introduction

This book investigates the trade-off between computation and storage in the cloud. This is a brand new and significant issue for deploying applications with the pay-as-you-go model in the cloud, especially computation and data-intensive scientific applications. The novel research reported in this book is for both cloud service providers and users to reduce the cost of storing large generated application data sets in the cloud. A suite consisting of a novel cost model, benchmarking approaches and storage strategies is designed and developed with the support of new concepts, solid theorems and innovative algorithms. Experimental evaluation and case study demonstrate that our work helps bring the cost down dramatically for running the computation and data-intensive scientific applications in the

Enjoying the preview?

Page 1 of 1

Computation and Storage in the Cloud: Understanding the Trade-Offs

About this ebook

Dong Yuan

Related authors

Related to Computation and Storage in the Cloud

Related ebooks

Databases For You

Related podcast episodes

Related articles

Related categories

Reviews for Computation and Storage in the Cloud

What did you think?

Book preview

Computation and Storage in the Cloud - Dong Yuan

Preface

1

Introduction