SupStat offers high-quality classroom and online training in statistical software use, applied statistical methods, data mining, and data visualization. Our goal is to empower students to utilize modern analytic tools in the management of data in a wide range of industry applications and research fields.
We offer training for individuals and corporate clients in New York, Beijing, and Shanghai. The following is a selection of our core data science courses being offered at the NYC Data Science Academy, the training brand of SupStat in the New York City area. Please visit for our full course listing and corporate training opportunities.
Intro to Data Science with R(workday public workshop)
Data Science with Python: Machine Learning

Data Science with R: Data Analysis

Data Science with Python: Data Analysis

Big Data with Hadoop: 5 Real World Applications

Big Data with Hadoop: Data Engineer Professionals

Training Course Modules

We have more than 10 years experience in developing data analytic solutions with R. We offer the best learning experience and introduce more people into the R world. We can provide beginner, intermediate and advanced level customized content based on participants’ feedback and domain focus. The following are some of our courses:
Intro to Data Analysis
Basic programming, data handling, plotting and analytics
Visualization with R
Basic visualization packages, such as lattice and ggplot2, as well as
advanced packages such as iPlots, googleVis and RGGobi, and rChart
R Software and Applied Statistics
Introduction to the R language, programming basics, plotting, basic statistics,
regression, classification, association rules, logistic regression, generalized
linear models, multi-layer models, PLS and SEM path models, time series,
and state space models.
Data Mining
Focusing on real applications of R in the Data Mining field, we teach
the caret package and how to write classification, regression, and clustering
Advanced Programing
The course emphasizes good practices in object-oriented programming in
R, including designing and developing clear, concise and efficient code,
understanding S3 and S4 objects, functional programming, as well as making
R packages.
Automated Reporting
We teach how to use the Knitr package to create reproducible and automated
R and Optimization
We provide an Introduction to mathematical programming and optimization
through several classical theories and models. Specific packages introduced
include lpSolve, TSP, Rsolnp, and Rglpk, all of which can be used to help
companies better optimize management.
R and Web Services
We cover deployment of R applications on the server via Rserver and OpenCPU,
allowing client connections through a web service’s API.
Quantitative Finance
We introduce quantitative investment theory and modeling through theuse
of high-efficiency R programming.
Big Data with R and RHadoop
Configuration-related knowledge of RHadoop under Ubuntu with the completion
of the data analysis process using RHadoop.
Text Mining with R
Describes the principles of HMMs and their application through the use
of Rwordseg, tm, and rmmseg4j packages.
R and Database Management
Use R inside MySQL to connect the R language with its data mining and
analytics capabilities to your database.
R and High Performance Computing
Improve computational efficiency in R by utilizing parallel computing
and compilation procedures
Download the PDF file and print it.

Our Customers

The following firms have improved the expertise of their teams through training with SupStat:
our customers