INFOSOFT IT SOLUTIONS - Business Analytics with R Training Syllabus

Business Analytics with R Training

Home
Courses

Business Analytics with R Training

Introduction to Business Analytics

Business Analytics involves using data, statistical analysis, and modeling to make better business decisions. This module introduces the fundamentals of Business Analytics, including the role of data in decision-making and the key concepts of descriptive, predictive, and prescriptive analytics.

Getting Started with R

Learn how to set up and use R, a powerful programming language for statistical analysis and data visualization. This section covers installing R and RStudio, understanding the R environment, and basic R programming concepts.

Data Manipulation and Cleaning in R

Discover how to manipulate and clean data in R using packages like dplyr and tidyr. Learn techniques for filtering, sorting, aggregating, and transforming data to prepare it for analysis.

Data Visualization with R

Explore data visualization techniques using R packages like ggplot2. Learn how to create various types of plots and charts, customize visual elements, and effectively communicate data insights through visualization.

Exploratory Data Analysis (EDA) in R

Understand the process of Exploratory Data Analysis (EDA) and how to perform it using R. Learn techniques for summarizing data, identifying patterns and trends, and uncovering hidden insights.

Statistical Analysis in R

Learn how to perform statistical analysis using R. This section covers key statistical concepts, hypothesis testing, regression analysis, and other statistical methods to derive insights from data.

Predictive Modeling with R

Discover how to build predictive models using R. Learn about various modeling techniques such as linear regression, logistic regression, decision trees, and machine learning algorithms to predict future outcomes.

Time Series Analysis in R

Explore time series analysis and forecasting using R. Learn about time series decomposition, smoothing techniques, and building models to analyze temporal data and make forecasts.

Text Mining and Sentiment Analysis in R

Learn how to perform text mining and sentiment analysis using R. This section covers techniques for extracting insights from textual data, analyzing sentiment, and applying natural language processing (NLP) methods.

Data Mining with R

Understand the process of data mining and how to apply it using R. Learn about clustering, association rule mining, and other data mining techniques to discover patterns and relationships in data.

Building Interactive Dashboards with R Shiny

Discover how to create interactive dashboards and web applications using R Shiny. Learn how to build dynamic, user-friendly interfaces to present data and analytics results effectively.

Real-World Business Analytics Case Studies

Review real-world case studies and examples of Business Analytics with R. Learn from practical scenarios to understand how analytics is applied across various industries to drive business decisions.

Best Practices and Advanced Techniques

Learn best practices for effective Business Analytics with R. Discover advanced techniques for data handling, performance optimization, and ensuring the accuracy and reliability of your analytics results.

Business Analytics with R Course Syllabus

1. Data Science Project Lifecycle

Recap of Demo
Introduction to Types of Analytics
Project Lifecycle
An Introduction to Our E-Learning Platform

2. Introduction to Basic Statistics Using R and Python

Data Types
Measure of Central Tendency
Measures of Dispersion
Graphical Techniques
Skewness & Kurtosis
Box Plot
R
R Studio
Descriptive Stats in R
Python (Installation and Basic Commands) and Libraries
Jupyter Notebook
Set up GitHub
Descriptive Stats in Python
Pandas and Matplotlib / Seaborn

3. Probability and Hypothesis Testing

Random Variable
Probability
Probability Distribution
Normal Distribution
Standard Normal Distribution (SND)
Expected Value
Sampling Funnel
Sampling Variation
Central Limit Theorem (CLT)
Confidence Interval
Assignments Session-1 (1 hr)
Introduction to Hypothesis Testing
Hypothesis Testing with Examples
Two-Proportion Test
Two-Sample T-Test
ANOVA and Chi-square Case Studies

4. Exploratory Data Analysis - 1

Visualization
Data Cleaning
Imputation Techniques
Scatter Plot
Correlation Analysis
Transformations
Normalization and Standardization

5. Linear Regression

Principles of Regression
Introduction to Simple Linear Regression
Multiple Linear Regression

6. Logistic Regression

Multiple Logistic Regression
Confusion Matrix
False Positive, False Negative
True Positive, True Negative
Sensitivity, Recall, Specificity, F1 Score
Receiver Operating Characteristics Curve (ROC Curve)

7. Deployment

R Shiny
Streamlit

8. Data Mining and Unsupervised Clustering

Supervised vs Unsupervised Learning
Data Mining Process
Hierarchical Clustering / Agglomerative Clustering
Measure of Distance
Numeric - Euclidean, Manhattan, Mahalanobis
Categorical - Binary Euclidean, Simple Matching Coefficient, Jaccard’s Coefficient
Mixed - Gower’s General Dissimilarity Coefficient
Types of Linkages: Single Linkage, Complete Linkage, Average Linkage, Centroid Linkage
Visualization of Clustering Algorithm Using Dendrogram

9. Dimension Reduction Techniques

Principal Component Analysis (PCA) and t-Distributed Stochastic Neighbor Embedding (tSNE)
Why Dimension Reduction?
Advantages of PCA
Calculation of PCA Weights
2D Visualization Using Principal Components
Basics of Matrix Algebra

10. Association Rules

What is Market Basket / Affinity Analysis?
Measure of Association
Support
Confidence
Lift Ratio
Apriori Algorithm

11. Recommender System

User-Based Collaborative Filtering
Measure of Distance / Similarity Between Users
Driver for Recommendation
Computation Reduction Techniques
Search-Based Methods / Item to Item Collaborative Filtering
Vulnerability of Recommender Systems

12. Introduction to Supervised Machine Learning

Workflow from Data to Deployment
Data Nuances
Mindsets of Modelling

13. Decision Tree

Elements of Classification Tree - Root Node, Child Node, Leaf Node, etc.
Greedy Algorithm
Measure of Entropy
Attribute Selection Using Information Gain
Implementation of Decision Tree Using C5.0 and Sklearn Libraries

14. Exploratory Data Analysis - 2

Encoding Methods
One-Hot Encoding (OHE)
Label Encoders
Outlier Detection - Isolation Forest
Predictive Power Score

15. Feature Engineering

Recursive Feature Elimination
PCA

16. Model Validation Methods

Splitting Data into Train and Test
Methods of Cross Validation
Accuracy Methods

17. Ensemble Techniques

Bagging
Boosting
Random Forest
XGBoost (XGBM)
LightGBM (LGBM)

18. K-Nearest Neighbors (KNN) and Support Vector Machines (SVM)

Deciding the K Value
Building a KNN Model by Splitting the Data
Understanding the Various Generalization and Regularization Techniques to Avoid Overfitting and Underfitting
Kernel Tricks

19. Regularization Techniques

Lasso Regression
Ridge Regression

20. Neural Networks

Artificial Neural Network (ANN)
Biological Neuron vs. Artificial Neuron
ANN Structure
Activation Function
Network Topology
Classification Hyperplanes
Best Fit “Boundary”
Gradient Descent
Stochastic Gradient Descent Intro
Back Propagation
Introduction to Concepts of Convolutional Neural Networks (CNN)

21. Text Mining

Sources of Data
Bag of Words
Pre-Processing, Corpus Document-Term Matrix (DTM) and Term-Document Matrix (TDM)
Word Clouds
Corpus Level Word Clouds
Sentiment Analysis
Positive Word Clouds
Negative Word Clouds
Unigram, Bigram, Trigram
Vector Space Modelling
Word Embedding
Document Similarity Using Cosine Similarity

22. Natural Language Processing (NLP)

Sentiment Extraction
Lexicons and Emotion Mining

23. Naive Bayes

Probability – Recap
Bayes Rule
Naive Bayes Classifier
Text Classification Using Naive Bayes

24. Forecasting

Introduction to Time Series Data
Steps of Forecasting
Components of Time Series Data
Scatter Plot and Time Plot
Lag Plot
Auto-Correlation Function (ACF) / Correlogram
Visualization Principles
Naive Forecast Methods
Errors in Forecast and its Metrics
Model-Based Approaches
Linear Model
Exponential Model
Quadratic Model
Additive Seasonality
Multiplicative Seasonality
Auto-Regressive (AR) Model for Errors
Random Walk
ARMA (Auto-Regressive Moving Average), Order p and q
ARIMA (Auto-Regressive Integrated Moving Average), Order p, d and q
Data-Driven Approach to Forecasting
Smoothing Techniques
Moving Average
Simple Exponential Smoothing
Holt's / Double Exponential Smoothing
Winters / Holt-Winters
De-Seasoning and De-Trending
Forecasting Using Python and R

Business Analytics with R Training

Business Analytics with R Training

Introduction to Business Analytics

Getting Started with R

Data Manipulation and Cleaning in R

Data Visualization with R

Exploratory Data Analysis (EDA) in R

Statistical Analysis in R

Predictive Modeling with R

Time Series Analysis in R

Text Mining and Sentiment Analysis in R

Data Mining with R

Building Interactive Dashboards with R Shiny

Real-World Business Analytics Case Studies

Best Practices and Advanced Techniques

Business Analytics with R Course Syllabus

1. Data Science Project Lifecycle

2. Introduction to Basic Statistics Using R and Python

3. Probability and Hypothesis Testing

4. Exploratory Data Analysis - 1

5. Linear Regression

6. Logistic Regression

7. Deployment

8. Data Mining and Unsupervised Clustering

9. Dimension Reduction Techniques

10. Association Rules

11. Recommender System

12. Introduction to Supervised Machine Learning

13. Decision Tree

14. Exploratory Data Analysis - 2

15. Feature Engineering

16. Model Validation Methods

17. Ensemble Techniques

18. K-Nearest Neighbors (KNN) and Support Vector Machines (SVM)

19. Regularization Techniques

20. Neural Networks

21. Text Mining

22. Natural Language Processing (NLP)

23. Naive Bayes

24. Forecasting

25. Survival Analysis

26. End-to-End Project Description with Deployment

Training

Basic Level Training

Advanced Level Training

Project Level Training

Total Training Period

Course Mode :

Course Fees :

Placement Benefit Services

Provide 100% job-oriented training

Develop multiple skill sets

Assist in project completion

Build ATS-friendly resumes

Add relevant experience to profiles

Build and enhance online profiles

Supply manpower to consultants

Supply manpower to companies

Prepare candidates for interviews

Add candidates to job groups

Send candidates to interviews

Provide job references

Assign candidates to contract jobs

Select candidates for internal projects

Note

100% Job Assurance Only

Daily online batches for employees

New course batches start every Monday