Professional Documents
Culture Documents
Abstract
Technological advances gave birth to a million software to facilitate work in com-
plex data analysis. SWANSTAT is a user-friendly web application developed from the
R programming language that compiles from the best various packages and is built with
shinydashboard Package. The motivation to create SWANSTAT is to streamline the rou-
tine work flow of data analysis so that users unfamiliar with R can perform the analysis
interactively in a web browser with a high-ability server using a shiny server. SWANSTAT
is a new statistical software with various features including best visualization, the latest
statistical methods, help documents and tutorials, and can be accessed online using varios
types of browsers. This research will be continuous for the development of the application
by enriching the latest statistical method, as well as the improvement of the quality of
features for data science needs.
1. Introduction
Technological developments are more and more increasing which is marked by the birth of
various sophisticated software as an aids to facilitate people in doing their works. Nowadays,
both comercial and open-source statistical softwares haves been been developed to analyze
data that is so complex. A software which has been becoming popular and used by many
statisticians and data scientist across nations is R that was first discovered by Ross Ihaka
and Robert Gentleman in 1992 from the Statistics Department, University of Auckland, New
2 SWANSTAT: A User-Friendly Web Application for Data Analysis using Shinydashboard
Zealand (R Core Team 2017). The R software is open-source software so it collects thousands
of packages made by various contributors in the world and has high quality in processing
large data. The users of R may utilize R studio and shinydashboard package (Chang and
Ribeiro 2018) to develop easy and interactive application which may assists data analyst to
handle their analytical problem by avoiding to build program script by themselves.
Based on the authors‘ experience in conducting analytics trainings, many people face a big
challenge in writing R programing due to their lack of knowledge in the language and pro-
graming structure. It is easier for them to use a click-and-drag application and not worrying
about how the application handle the computation. However, it is not easy for them to find
a user friendly software which is free (open source) and consist of various kinds of statistical
analyses.
This research aims to design an application by utilizing the sophistication of R software to be
more user friendly, namely creating statistical tool interfaces from various existing packages.
Then the developed application can be accessed online using a highly capable server with the
help of a shiny server called SWANSTAT. In addition, the SWANSTAT application is also
equipped with a help document in the form such white papers and video tutorials to facilitate
users and to provide examples of the implementation.
2. Outline of Usage
3
4
of the analysis. For example, we will use data driven, iris, for analyzing correlation between
quantitative varaibles. Then the first step is to choose the Correlation menu then select the
Bivariate submenu. Enter all quantitative variables that will be correlated in the Select Mul-
tiple Variable box. For this research we will use the Person correlation method. Visualization
of the correlation analysis can be displyed via the Plot Options tab menu. For correlation
analysis, the main package of SWAN-STAT is corrplot (Wei and Simko 2017). Figure 3 shows
the stages of using correlation analysis with SWANSTAT. To run the application, select the
RUN button then the analysis output can be seen in the Summary and Plot tab menu as
shown in Figure 4.
4 SWANSTAT: A User-Friendly Web Application for Data Analysis using Shinydashboard
1
4
3. Feature of SWANSTAT
Each application certainly has its own advantages. By combining various packages inside the
R, SWANSTAT is equipped with various features provided to facilitate users in operating
and analyzing data. The features provided by this application include Visualization, More
Statistics Methods, Help Document, and Save Output Report.
3.1. Visualization
Interesting visualization can help users easily provide interpretation of an analysis. This ap-
plication is equipped with visualization that combines various packages of Plots in R including
graphics (R Core Team 2017), ggplot2 (Wickham, Chang, Henry, Pedersen, Takahashi, Wilke,
Journal of Statistical Software 5
and Woo 2018) and plotly package (Sievert, Parmer, Hocking, Chamberlain, Ram, Corvellec,
and Despouy 2017). Figure 5 shows some graphic forms available in various statistical tools
in SWANSTAT.
Meanwhile the tutorial video about the step by step of the use of this application can be
accessed SWAN Statistics Academy channel in Youtube.
Acknowledgments
The main idea of the development of this application came from the publicâĂŹs anxiety over
the use of coding-based software. The team expressed many tahnks to the IPB Statistics
Lecturers who have supported the development of this application. Especially to Mr. Prof.
Dr. Asep Saefuddin, Prof. Dr. Khairil Anwar Notodiputro, Bapak Dr. Anang kurnia and Mr.
Dr. Bagus Sartono who has become our advisor. We also say thanks to all the contributor
packages that we have used in developing this application.
Journal of Statistical Software 7
References
Kuhn M (7). caret : Classification and Regression Training. R package version 6.0-80, URL
https://CRAN.R-project.org/package=caret.
Moon KW (2017). semMediation : Visualize the Mediation Effect for Structural Equa-
tion Modelling. R package version 0.1.1, URL https://github.com/cardiomoon/
semMediation.
R Core Team (2017). R: A Language and Environment for Statistical Computing. R Founda-
tion for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.
Rosseel Y (2018). lavaan : Latent Variable Analysis. R package version 0.6-2, URL https:
//CRAN.R-project.org/package=lavaan.
R Studio Team (2016). Shiny Server Open Source v1.4.0 Administrator’s Guide. R Studio,
Northern Ave, Boston. URL http://rstudio.github.io/shiny-server/os/latest/.
Wei T, Simko V (2017). An Introduction to corrplot Package. R package version 0.84, URL
https://CRAN.R-project.org/package=corrplot.
Wickham H, Chang W, Henry L, Pedersen TL, Takahashi K, Wilke C, Woo K (2018). ggplot2
: Create Elegant Data Visualisations Using the Grammar of Graphics. R package version
3.0.0, URL https://CRAN.R-project.org/package=ggplot2.
Wuertz D (2017). fBasics : Rmetrics - Markets and Basic Statistics. R package version
3042.89, URL https://CRAN.R-project.org/package=fBasics.
8 SWANSTAT: A User-Friendly Web Application for Data Analysis using Shinydashboard
Affiliation:
Rizal Bakri
STIEM Bongaya
SWANSTATISTICS
IPB University
90223 Makassar, South Sulawesi, Indonesia
E-mail: rizal.bakri@stiem-bongaya.ac.id
URL: https://apps.swanstatistics.com/swanstat