Big Data Resources :bar_chart:
Big data is a field that treats ways to analyze, systematically extract information from, or otherwise deal with data sets that are too large or complex to be dealt with by traditional data-processing application software.
Basics :on:
- The Four V’s of Big Data (IBM)
- The Definition of Big Data (Oracle)
- Big Data in Science & Problem Solving
- Doug Cutting, ‘father’ of Hadoop, talks about big data tech evolution
- Big Data Facts - AnalyticsWeek
- MapReduce: Simplified Data Processing on Large Clusters - Google, Inc.
Youtube channels :computer:
Blogs & Articles :bookmark:
- What is a Lakehouse? - databricks (January 2020)
- The Most Practical Big Data Use Cases Of 2016 - Bernard Marr
-
http://timepasstechies.com/row-oriented-column-oriented-file-formats-hadoop/
- spark performance tuning- Expedia medium blog
Interview resources :question:
Hadoop resources :chart_with_downwards_trend:
- Hadoop - reliable, scalable, distributed computing.
- Hadoop Courses (Coursera)
- Hadoop - tutorialspoint
Spark
- Apache Spark - A unified analytics engine for large-scale data processing
- Apache Spark Courses (Coursera)
Hortonworks
- Hortonworks
HDInsight (Microsoft)
- HDInsight - Easy, cost-effective, enterprise-grade service for open source analytics
- Azure regions
- Microsoft Data Center Tour
Cloudera
- Cloudera
MapR
- MapR
AWS
- Big Data on AWS
Google
- Google Cloud
We hope you now know the roadmap to being proficient in Big Data Analytics :v: