architecture, AWS Big Data Demystified, Data Engineering, meetup, Spark, zeppelin

AWS Big Data Demystified #3 | Zeppelin + spark sql, JDBC + thrift, ganglia, r+ spark r + livy

 

Meetup slides:

 

the video:

 

Post lecture remarks (thanks to Eyal Trabelsi:

Small correction to lecture #3, regarding the insert overwrite of partitions in spark, according to data bricks:  there is overwrite per partition in spark https://www.slideshare.net/databricks/whats-new-in-upcoming-apache-spark-23 page 54, it is available in spark v2.3.

 

Need to learn more about AWS big data (demystified)?

 

 



——————————————————————————————————————————

I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way. If you have any comments, thoughts, questions, or you need someone to consult with, feel free to contact me:

https://www.linkedin.com/in/omid-vahdaty/



Leave a Reply