목차


ppt

MapReduce = Programming Model + Execution Framework


Cluster Architecture

스크린샷 2023-12-20 오전 6.14.35.png


Cluster Architecture

Untitled


Simplest environment for parallel processing


MapReduce Execution Framework


Distributed File System


Distributed File System


GFS/HDFS


Distribution of the input


Execution Flow Overview

스크린샷 2023-12-20 오전 6.17.38.png


Overall schematic diagram for MapReduce framework

스크린샷 2023-12-20 오전 6.17.55.png


Master


MapReduce: Step-by-Step Execution


MapReduce: Step-by-Step Execution


MapReduce: Output


Execution Overview

(1)MapReduce splits the Input files into M “splits” then Starts many copies of program on servers

스크린샷 2023-12-20 오전 6.19.57.png


(2) One copy(the master) is special. The rest are workers. The master picks idle workers And assigns each 1 of M map tasks or 1of R reduce tasks.

스크린샷 2023-12-20 오전 6.20.17.png