Get in Touch

Course Outline

Day 1

Introduction and preliminaries

  • Enhancing the R experience: R and available GUIs
  • RStudio
  • Related software and documentation
  • The relationship between R and statistics
  • Interactive usage of R
  • An introductory session
  • Obtaining assistance with functions and features
  • R commands, case sensitivity, and other syntax rules
  • Retrieving and correcting previous commands
  • Executing commands from files or redirecting output to files
  • Managing data persistence and removing objects

Simple manipulations; numbers and vectors

  • Vectors and assignment operations
  • Vector arithmetic
  • Generating regular sequences
  • Logical vectors
  • Handling missing values
  • Character vectors
  • Index vectors: selecting and modifying data subsets
  • Other object types

Objects, their modes and attributes

  • Intrinsic attributes: mode and length
  • Adjusting the length of an object
  • Retrieving and setting attributes
  • Understanding object classes

Ordered and unordered factors

  • Specific examples
  • The tapply() function and ragged arrays
  • Ordered factors

Arrays and matrices

  • Arrays
  • Array indexing and accessing subsections
  • Index matrices
  • The array() function
    • Mixed vector and array arithmetic, including the recycling rule
  • The outer product of two arrays
  • Generalized transpose of an array
  • Matrix facilities
    • Matrix multiplication
    • Linear equations and inversion
    • Eigenvalues and eigenvectors
    • Singular value decomposition and determinants
    • Least squares fitting and the QR decomposition
  • Creating partitioned matrices using cbind() and rbind()
  • The concatenation function with arrays
  • Generating frequency tables from factors

Day 2

Lists and data frames

  • Lists
  • Constructing and modifying lists
    • Concatenating lists
  • Data frames
    • Creating data frames
    • Using attach() and detach()
    • Working with data frames
    • Attaching arbitrary lists
    • Managing the search path

Data manipulation

  • Selecting, subsetting observations and variables
  • Filtering and grouping data
  • Recoding and transformations
  • Aggregation and combining data sets
  • Character manipulation using the stringr package

Reading data

  • TXT files
  • CSV files
  • XLS and XLSX files
  • Data in SPSS, SAS, Stata, and other formats
  • Exporting data to TXT, CSV, and other formats
  • Accessing database data via SQL

Probability distributions

  • Utilizing R as a set of statistical tables
  • Examining the distribution of data sets
  • One- and two-sample tests

Grouping, loops and conditional execution

  • Grouped expressions
  • Control statements
    • Conditional execution: if statements
    • Repetitive execution: for loops, repeat, and while

Day 3

Writing your own functions

  • Simple examples
  • Defining new binary operators
  • Named arguments and default values
  • The '..' argument
  • Assignments within functions
  • More advanced examples
    • Efficiency factors in block designs
    • Removing all names in a printed array
    • Recursive numerical integration
  • Scope
  • Customizing the environment
  • Classes, generic functions, and object orientation

Statistical analysis in R

  • Linear regression models
  • Generic functions for extracting model information
  • Updating fitted models
  • Generalized linear models
    • Families
    • The glm() function
  • Classification
    • Logistic Regression
    • Linear Discriminant Analysis
  • Unsupervised learning
    • Principal Components Analysis
    • Clustering Methods (k-means, hierarchical clustering, k-medoids)
  • Survival analysis
    • Survival objects in R
    • Kaplan-Meier estimate
    • Confidence bands
    • Cox PH models with constant covariates
    • Cox PH models with time-dependent covariates

Graphical procedures

  • High-level plotting commands
    • The plot() function
    • Displaying multivariate data
    • Display graphics
    • Arguments for high-level plotting functions
  • Basic visualization graphs
  • Analyzing multivariate relations with lattice and ggplot packages
  • Using graphics parameters
  • Graphics parameters list

Automated and interactive reporting

  • Combining R output with text
  • Creating HTML and PDF documents

Requirements

A solid understanding of statistical concepts is required.

 21 Hours

Number of participants


Price per participant

Testimonials (3)

Upcoming Courses

Related Categories