Created by:

Profile Photo

Last updated:

September 25, 2023


Unlimited Duration


This course includes:

Unlimited Duration

Badge on Completion

Certificate of completion

Unlimited Duration


Divide and Recombine for the Analysis of Big Data by William S. Cleveland - Machine Learning Summer School at Purdue, 2011. Divide and Recombine (D&R) consists of the general approach of parallelizing big data, statistical methods for division and recombination, sampling and display methods for visualization of samples of subsets, computational methods, and computational environments.

In D&R, the data are broken up into structured subsets, general analysis methods are applied to each subset, and the results of the analyses recombined. The necessary steps of data division and recombination open up an exciting area of research in statistical theory and methods, and there are already a number of very useful results. The steps also open up research in computational methods and hardware-software environments, and here, too, there are important results.

By introducing the exploitable parallelization of the data, D&R succeeds in making it possible to apply to big data almost any existing analysis method from statistics, machine learning, and visualization. This enables detailed, comprehensive analysis of big data at all stages of the analysis process, starting with the raw data. This includes detailed visualization at all stages, not just to reduced data such as summary statistics, results of dimension reduction methods, fitted models, and the output of algorithms applied to the detailed data. Visualization at all stages substantially reduces the chances of losing critical information in the data.

Course Curriculum

  • Lecture 1 – D&R for the Analysis of Big Data (Part 1) Unlimited
  • Lecture 2 – D&R for the Analysis of Big Data (Part 2) Unlimited
  • Lecture 3 – D&R for the Analysis of Big Data (Part 3) Unlimited
  • Lecture 4 – D&R for the Analysis of Big Data (Part 4) Unlimited
  • Lecture 5 – D&R for the Analysis of Big Data (Part 5) Unlimited
  • Lecture 6 – D&R for the Analysis of Big Data (Part 6) Unlimited
  • Lecture 7 – D&R for the Analysis of Big Data (Part 7) Unlimited
  • Lecture 8 – D&R for the Analysis of Big Data (Part 8) Unlimited

About the instructor

5 5

Instructor Rating







Profile Photo
We are an educational and skills marketplace to accommodate the needs of skills enhancement and free equal education across the globe to the millions. We are bringing courses and trainings every single day for our users. We welcome everyone woth all ages, all background to learn. There is so much available to learn and deliver to the people.