New

Big Data

Spark, Hadoop, Kafka & the lakehouse

Spark internals and tuning, HDFS/YARN, Hive, Kafka, file formats, streaming and Delta Lake — with vendor flavours (Databricks, EMR, BigQuery, HDInsight).

  • Spark · Kafka
  • Lakehouse / Delta
  • Vendor flavours
  • Editorial solutions
  • Progress tracking

Sample questions

  • Transformation vs action in Spark
    Asked at Spark
    Junior
  • Why is a shuffle expensive?
    Asked at Spark
    Mid
  • Kafka consumer-group rebalancing
    Asked at Kafka
    Mid
  • How Delta Lake gives ACID on object storage
    Asked at Databricks
    Senior
Unlock full set