Image for post

In this article, we are going to discuss how Hadoop so Fastly store and retrieve the Data from the Bigdata. And Discuss Some Big Company like Facebook …

What is Hadoop?

Components of Hadoop: -

  1. HDFS (Hadoop Distributed File System)
  2. YARN

HDFS: It allows us to store data of various formats across a cluster.

YARN: We used for resource management in Hadoop. It allows parallel processing over the data

Hadoop Cluster: -

Image for post

Advantage of Hadoop: -

  • Scalable
  • Cost-effective
  • Flexible
  • Fast

Big Data:-

Some Facts :-

1. The New York Stock Exchange generates about one terabyte of new trade data per day.

2. The statistic shows that 500+terabytes of new data get ingested into the databases of social media site Facebook, every day. This data is mainly generated in terms of photo and video uploads, message exchanges, putting comments, etc.

Characteristics Of Big Data:-

ii. Velocity

iiiVariety

Benefits of Big Data Processing:-

ii. Early identification of risk to the product/services, if any

iii. Better operational efficiency

Data Analytics:-

Big data analytics refers to collecting, processing, cleaning, and analyzing large datasets to help organizations operationalize their big data.

Big data analytics tools and technology:-

  1. NoSQL databases
  2. MapReduce
  3. YARN
  4. Spark
  5. Tableau

The big challenges of big data:-

  1. Making big data accessible
  2. Maintaining quality data
  3. Keeping data secure
  4. Finding the right tools and platforms

Some Interesting Case study :-

  1. https://eng.uber.com/uber-big-data-platform/
  2. https://data-flair.training/blogs/data-science-at-netflix/