Systems Design and Architecture 🔥

Back to course sections

Fundamentals of Systems Design

A Systems Design Interview Primer For New Engineers

How to Ace Systems Design Interviews: Tips and Strategies

From Byte to Gigabyte to Petabyte: Understanding Data Size

Introduction to Computer Networking and Protocols

REST, RPC, and Distributed API Design

Metrics: Latency, CPU, Memory, Error Rates

Monoliths vs. Microservices for Junior Engineers

Microservices Interview Questions

What is Caching? An Introduction to Strategies

Cookie vs. Token Authentication

Feature: Software Architectural Patterns & Design Structures

What is a Load Balancer? An Introduction

What is MapReduce and How Does It Work?

Potential Bottlenecks in Software Performance Testing

What Is a Message Queue?

Pub-Sub and Event Driven Architecture (EDA)

What Is Cloud Computing? What to Know In Interviews

How do Routers, Switches, and Hubs work?

What is a Proxy Server?

Best Cloud Platforms for Software Development

Cloud Computing Pricing Analysis and Comparisons

API Gateways and Backend-for-Frontend (BFF) Patterns

Mark As Completed Discussion

Home > Systems Design and Architecture 🔥 > Fundamentals of Systems Design > What is MapReduce and How Does It Work?

Example of MapReduce

You can better understand, how MapReduce works by taking an example where we would have a text file called example.txt whose contents are:

Deer, Bear, River, Car, Car, River, Deer, Car, Bear

Now, we can perform a word count on the sample.txt using MapReduce. So, we will be finding unique words and the number of occurrences of those unique words.

Divide the input into three splits as shown in the diagram. This will distribute the work among all the map nodes
Tokenize the words in each of the mappers and give a hardcoded value (1) to each of the tokens or words
A list of key-value pairs is created where the key is nothing but the individual words and the value is one. So, for (Deer Bear River) we have — Deer, 1; Bear, 1; River, 1
Sorting and shuffling happen so that all the tuples with the same key are sent to the corresponding reducer
After the sorting and shuffling phase, each reducer will have a unique key and a list of values corresponding to that very key. For example, Bear, [1,1]; Car, [1,1,1]...
Each Reducer counts the values which are present in that list of values, and gives the final output as — Bear, 2
All the output key/value pairs are collected and written in the output file

Example of MapReduce

Programming Categories

Basic Arrays Interview Questions

Binary Search Trees Interview Questions

Dynamic Programming Interview Questions

Easy Strings Interview Questions

Frontend Interview Questions

Graphs Interview Questions

Hard Arrays Interview Questions

Hard Strings Interview Questions

Hash Maps Interview Questions

Linked Lists Interview Questions

Medium Arrays Interview Questions

Queues Interview Questions

Recursion Interview Questions

Sorting Interview Questions

Stacks Interview Questions

Systems Design Interview Questions

Trees Interview Questions

Popular Lessons

All Courses, Lessons, and Challenges

Data Structures Cheat Sheet

Free Coding Videos

Bit Manipulation Interview Questions

Javascript Interview Questions

Python Interview Questions

Java Interview Questions

SQL Interview Questions

QA and Testing Interview Questions

Data Engineering Interview Questions

Data Science Interview Questions

Blockchain Interview Questions

Quantifiers and Alternation

Market Data Acquisition

Introduction to Code Variables and Assignment

Coding Problems

Real World Deep Neural Network Examples