Professional Documents
Culture Documents
TRENDS REPORT
2017
Data Warehousing
Preferences and Challenges
Panoply Annual Survey
Table of Contents
Executive Summary 3
Final Notes 11
About Panoply 11
Copyright Notice. Any Panoply information that is to be used in advertising, press releases, or promotional
materials requires prior written approval from the Panoply CEO. A draft of the proposed document should
accompany any such request. Panoply reserves the right to deny approval of external usage for any reason.
Copyright 2017 Panoply Ltd. Reproduction without written permission is completely forbidden.
2 0 1 7 PA N O P LY DATA WA R E H O U S E T R E N D S R E P O RT
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
Executive Summary
Amazon re:Invent is always a rewarding experience, providing not only opportunities
to demonstrate Panoplys automated data warehouse solutions to thousands of IT
professionals, but also to gather feedback from industry professionals, as a means to
gauge cloud-industry trends and understand the markets most pressing needs. In
that spirit, we conducted a survey during our time at re:Invent 2016 that offers insight
into todays use of data warehousing systems, in particular Redshift, while also
exposing users biggest challenges and advantages.
Our survey includes input from more than 800 re:Invent attendees who answered a
ten-question survey, investigating level of satisfaction with their current data
warehousing solution and exploring more deeply the grounds for their responses.
The respondents come from diverse sectors, and hold a wide variety of positions or
specialties within their organizations.
Findings from the survey support what industry experts such as Gartner have been
saying:
A data warehouse is a collection of data in which multiple disparate data sources can
be loaded and integrated together into the same repository. The systems logical
3
2 0 1 7 PA N O P LY DATA WA R E H O U S E T R E N D S R E P O RT
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
design facilitates the integration of data sources and allows the generation of new,
additional valuable data sources without signicant structural adjustment.
Ultimately, a data warehouse should be larger than the sum of its data, and serve as
an ongoing intelligent resource for use by multiple members of an organization,
large or small.
In addition to the explosive growth in the amount of data and data sources weve
seen in recent years, another motivation for creating even more sophisticated data
warehousing systems is the ever-increasing need for customizable business
intelligence and analytics.
In this paper, we will examine what these results imply about the current state of
affairs in the data warehouse community, as well as what industry leaders must
address in order to adapt to and keep up with their data demands. We will see how
Amazon Redshift continues to satisfy its core base of customers, and how the results
indicate what experts predict; that AWS Redshift is gaining traction amongst its
strongest competitors.
1 https://aws.amazon.com/redshift/
2 https://cloud.google.com/bigquery/ 4
2 0 1 7 PA N O P LY DATA WA R E H O U S E T R E N D S R E P O RT
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
5
DATA WA R E H O U S E T R E N D S R E P O RT 2 0 1 7
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
All in all, organizations are becoming increasingly aware of the value of data
warehousing beyond simple storage; theyre calling for better ways to extract
information from their data and analyzing it. We believe that Amazon Redshift holds
the key to simplifying that task, making data warehouse accessible and effective not
only for large and well-funded enterprises, but also medium and, perhaps, even
small ones.
The fact is that companies are employing a data warehouse solution but still suffer
as a result of its complexity, plus dont use ETL tools. And while the survey results
indicate that Amazon Redshift is experiencing traction, we also conclude that the
use of cloud-based data warehousing, complemented by a rich BI tool such as
Tableau, is still in its early stages.
6
DATA WA R E H O U S E T R E N D S R E P O RT 2 0 1 7
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
60% 21% 9% 7% 3%
Redshift On premise Azure Other BigQuery
59% Difcult
27% Easy
10% Very Difcult
4% Very Easy
64% 25% 8% 3%
7
DATA WA R E H O U S E T R E N D S R E P O RT 2 0 1 7
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
4% 9%
19%
25% 47%
26%
51%
19%
PERFORMANCE COMPLEXITY
OTHER COST
SIMPLICITY PERFORMANCE
COST OTHER
REDSHIFT USERS
UNSATISSFIED-REASONS
8
DATA WA R E H O U S E T R E N D S R E P O RT 2 0 1 7
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
61% NONE
10% TALEND
9% SEGMENT
9% OTHER
6% STITCH
3% FIVETRAN
2% ATOM
USE OF BI TOOLS
35%
25%
8% 7% 6% 5% 5% 4% 3% 2%
Tableau None PowerBI DataBricks Other QlikView Domo Looker Chartio Sisense
67%
Redshift
17% 9%
On Premise Other 7%
Azure
9
DATA WA R E H O U S E T R E N D S R E P O RT 2 0 1 7
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
Noting that re:Invent participants, in general, and especially visitors to our booth,
were most likely cloud users, we assume the population from which we gathered
information skews in the direction of cloud-based data warehouse users; and
possibly even towards Amazon Redshift users, in particular.
Despite the possible biases of our respondents in terms of their preference toward
Amazon services, the individuals who participated in the survey do span various
industries from software to nance, apparel to healthcare. Their roles within the
companies range from IT to management, data engineering to sales and marketing.
Therefore, we believe the results to be comprehensive and reective of a wide
spectrum of users, inuencers, and decision-makers within data organizations.
COMPANY SIZE
300
250
200
150
100
50
10
DATA WA R E H O U S E T R E N D S R E P O RT 2 0 1 7
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
RESPONDANTS BY INDUSTRY
3% 13% 8% 7%
22% 3% 7% 30%
2% 2% 2% 1%
11
DATA WA R E H O U S E T R E N D S R E P O RT 2 0 1 7
DATA WA R E H O U S I N G P R E F E R E N C E S A N D C H A L L E N G E S
Final Notes
Survey Results Align with Gartners Magic Quadrant for Data Warehouse and Data
Management Solutions for Analytics
In March, in conjunction with its release of 2016 Magic Quadrant for Data Warehouse
and Data Management Solutions for Analytics, leading industry analyst Gartner
cautioned all market leaders, including IBM, Microsoft, Oracle, SAP, and Teradata, to
recognize the competition facing them as data warehousing has moved into the
cloud, in particular from Amazon Redshift, who we see has gained great traction. Our
survey supports Gartners conclusions and shows that due to a growing desire for
better use of data, as well as system management challenges data professionals still
face, Amazon Redshift continues to provide users with a more satisfying data
warehouse experience, especially when complemented with products like those
offered by Panoply, that ll the holes in AWS Redshifts service.
Gartner notes the continued impact the public cloud is having on the way IT
professionals approach their organizational responsibilities, as well as users
expectations for logical data warehouses. They predict a data warehouse
transformation as a result; another reason Amazon Web Services is getting closer and
closer to becoming a real contender for market leadership, especially when supported
by end-to-end data management platforms such as Panoply.
About Panoply
Our story begins with an idea: In the Big Data era, free up your data engineers and scientists, and you create
value for your customers and your business. Its simple, right? We believe in taking the load off the IT and data
engineers that have long been mired in time-intensive tasks like schema building, data mining, complex
modelling, performance tuning... Our easy-to-use platform gives small and medium businesses the tools to
harness Big Data and get analytics quickly, so they can make faster and better business decisions.