Welcome to Big Data Architecture
Topics
Big data Architectural Patterns
- Introduction
- Big Data Architectural Principles
- Data Collection Layer
- Data Processing Layer
- Data Consumption Layer
- Putting it all together
- Design Patterns
Storage Layer
- Row-based Vs Column-based File Formats
- Text-based File Formats
- Big Data File Formats
- File Compression Techniques in Big Data Systems
- Choosing a Database
HBase
- Why HBase?
- HBase Properties
- HBase Components
- HBase Architecture - Part 1
- HBase Architecture - Part 2
- CAP Theorem
- Cassandra: An introduction
Presto
- What is Presto?
Spark
- Garbage Collection in Spark
- Dynamic Coalescing in Spark
- Probablistic Data Structures
Case Study
- Meesho Data Engineer Interview Questions
- InMobi Interview Questions
- Expedia ML Engineer Interview Question
- CTM System Design
- IHS Markit System Design Case Study (2020)
- ZS Associates Data Consultant System Design Case Study (2020)
- Uber System Design Question (2020)
- MockForce Interview Questions
Author : Kushal Luthra
repo url : https://github.com/kushal-luthra/big-data-architecture