What Is Hadoop In Big Data | Apache Hadoop Introduction | Hadoop Tutorial For Beginners

2 min readMay 2, 2021

What is Big data

Big Data is a collection of data that is huge in volume.
Big data is also a data but with huge size.
Big data is collection of large and complex data sets that cannot be processed using traditional computing techniques

Source of big data

Social networking sites: Facebook, Google, LinkedIn all these sites generates huge amount of data on a day to day basis as they have billions of users worldwide
E-commerce site: Sites like Amazon, Flipkart, Alibaba generates huge amount of logs from which users buying trends can be traced.
Weather Station: All the weather station and satellite gives very huge data which are stored and manipulated to forecast weather.
Telecom company : Telecom giants like Airtel, Vodafone study the user trends and accordingly publish their plans and for this they store the data of its million users.
Share Market: Stock exchange across the world generates huge amount of data through its daily transaction.

Types Of Big Data

The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.

Working Procedure

Hadoop is a framework that allows you to first store Big Data in a distributed environment, so that, you can process it parallel. There are basically three components in Hadoop

Example:-

Ask question #Pywix

Originally published at https://pywix.blogspot.com.

What Is Hadoop In Big Data | Apache Hadoop Introduction | Hadoop Tutorial For Beginners

What is Big data

Source of big data

Types Of Big Data

Working Procedure

Example:-

Ask question #Pywix

Written by Digital Classes