Get in Touch

Course Outline

Introduction to Programming Big Data with R (bpdR)

  • Configuring your environment to utilize pbdR
  • Overview of the scope and tools provided by pbdR
  • Packages frequently used in conjunction with Big Data and pbdR

Message Passing Interface (MPI)

  • Utilizing pbdR MPI 5
  • Implementing parallel processing
  • Point-to-point communication methods
  • Transmitting matrices
  • Performing matrix summation
  • Executing collective communication
  • Summing matrices using Reduce
  • Scatter and Gather operations
  • Additional MPI communication techniques

Distributed Matrices

  • Constructing a distributed diagonal matrix
  • Computing the Singular Value Decomposition (SVD) of a distributed matrix
  • Building a distributed matrix in parallel

Applications in Statistics

  • Monte Carlo Integration
  • Loading datasets
  • Accessing data across all processes
  • Broadcasting data from a single process
  • Handling partitioned data
  • Distributed Regression
  • Distributed Bootstrap
 21 Hours

Number of participants


Price per participant

Testimonials (2)

Upcoming Courses

Related Categories