AWS Spark

What is Spark?

Apache Spark is an open-source distributed cluster-computing framework.
Spark provides many interfaces such as SparkCore, SparkSQL and Spark Streaming.
All of which, enable massive “in memory” parallel computing.

Spark advantages:

  1. Fast data processing
  2. In Memory

Spark use cases:

  1. Complex data pipelines
  2. streaming

Spark antipattern use cases

Operational DB

Our Spark Blogs

Architectures and meetups which includes Spark

English

Hebrew

Top Video English

Top Video Hebrew