CC Folder

Overview

The CC folder contains a script related to cloud computing, specifically a MapReduce implementation.

File: mapreduce.py

Description: Implements a simple MapReduce algorithm to count word frequencies across multiple documents.

Dependencies: collections

Code:

                
                from collections import Counter

documents = [
    "Hello world",
    "Hello Hadoop",
    "Hello MapReduce world",
]
words = [word.lower() for doc in documents for word in doc.split()]
word_counts = Counter(words)
for word, count in sorted(word_counts.items()):
    print(f"{word}: {count}")