Course Description:
Data is everywhere-in businesses, schools, hospitals and research centers. This course is created to help learners make sense of the data using R. Whether you are a student, researcher or professional looking to upgrade your skills, this training introduces you to practical, real-world data analysis techniques you can apply immediately. We focus on hands-on learning, breaking down complex concepts into simple steps so that even complete beginners can follow along confidently.
Course Objectives:
By the end of this training, the learner should be able to:
- Use R and R Studio proficiently.
- Import, clean, and manipulate datasets.
- Perform exploratory data analysis (EDA).
- Apply statistical analysis techniques.
- Create compelling data visualizations.
- Develop machine learning models.
- Generate automated reports and dashboards.
- Work with real-world datasets.
Course Outline:
Module 1: Introduction to R and R Studio
- Setting up R and R Studio
- Understanding R syntax and basics
- R Data Types
- Vectors: Creation, Naming, and Manipulation
- R packages and libraries (tidy verse, ggplot2, dplyr, etc.)
Module 2: Data Preparation
- R Matrix: Creation, Printing, Adding Column, and Slicing
- Factors in R: Categorical and Continuous Variables
- R Data Frame: Creation, Appending, Selecting, and Sub setting
Module 3: Data and introduction to Functions.
- Lists in R: Creation and Selecting Elements
- Sorting a Data Frame using Order()
- Functions in R
Module 4 Data Manipulation and Transformation
- Filtering, selecting, and grouping data
- IF, ELSE, ELSE IF Statement in R
- Using pipes (%>%) for efficient data processing
Module 5: Data Importing and Cleaning
- Review of previous class assignments
- Handling missing values and data inconsistencies
- Reshaping data with tidyr
Module 6: Exploratory Data Analysis (EDA)
- Summary statistics (mean (), median (), summary ()
- Correlations in Pearson &spearman’s
Module 7: Introduction to Visualizations
- Detecting outliers and data distributions
- Data visualization with ggplot2
- Creating histograms, scatter plots, and boxplots
Module 8: Statistical Analysis in R
- Descriptive vs. inferential statistics
- Hypothesis testing (t-tests, chi-square tests)
- ANOVA and other statistical techniques
Module 9: Regression Analysis
- Simple linear regression
- Multiple linear regression
- Stepwise regression
- Generalized Linear Model (logistic regression and Poisson regression)
Module 10: R Markdown & Notebooks
- R Markdown Reports
- Introduction to R Markdown
- Formatting text using Markdown syntax
- Embedding R code using Knitr
R Notebook
- Creating a notebook
- Inserting code chunks
- Managing the execution queue
- Configuring the execution environment
Module 11: Project & Real-World Applications
- Working on a real-world dataset
- Developing insights and recommendations
- Presenting findings using R Markdown or Shiny dashboards
Module 12: Introduction to Machine Learning with R
- Understanding supervised vs. unsupervised learning
Deliverables:
- Completed R Programming Projects
- Data Analysis Reports
- Interactive Dashboards
- Machine Learning Model Implementations.
- Data Visualizations.
- Certificate of completion