Apache Hadoop

Apache Hadoop is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model.

Read more

This tutorial shows how to configuring High Availability MultiNode Cluster on AWS EC2 Ubuntu instance using Zookeeper. Learn how to set up and launch Zookeeper on AWS! Don’t forget to subscribe to get more content about Apache Kafka and AWS!

Read more

This video tutorial shows how to set up a two node Hadoop cluster on Amazon EC2 instances. Here’s what you will need Amazon AWS Account PuTTy Windows Client (to connect to Amazon EC2 instance) PuTTYgen (to generate private key –…