Uncategorised

AWS EMR Cost & Performance Tuning

AWS EMR Cost & Performance Tuning

Lecturer: Omid Vahdaty 23.2.2026

In the session, we will provide you with some tips on tuning clusters based on monitoring and suggestions for best practices in Spark configuration.
We will explore how to optimize resource utilization, reduce job execution time, and improve overall stability of your workloads on Amazon EMR. You will learn how to identify bottlenecks using performance metrics, apply the right configuration to executors and memory, and leverage autoscaling and cost saving strategies.
By the end, you will walk away with actionable techniques to boost performance and lower cloud spend without compromising data reliability.

Language- Hebrew

Speaker: Omid Vahdaty
Omid is the Founder and CTO of Jutomate, a company that provides services and solutions in data, cloud, and AI.

He is an expert in data architecture, product innovation, and strategic engineering thinking, with over 20 years of experience across startups, global enterprises, and government organizations.

Video

Slides


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty:

architecture

Your Cloud Region Will Fail. Are You Ready?

Your Cloud Region Will Fail. Are You Ready?

Lecturer: Eran Greenbaum 18.1.2026

Cloud providers offer high availability, but regional failures still happen, and when they do, single-region architectures collapse.
In this session, we will break down why regional outages are inevitable, what they really mean for production systems, and how to design Disaster Recovery that actually works when things go wrong.
You’ll learn:

  • What Disaster Recovery really means in cloud environments
  • How to define realistic RTO and RPO targets
  • The 4 core Disaster Recovery strategy models
  • Common DR assumptions that fail in real outages
  • How to approach multi-region resilience in practice

About the lecturer:
Eran Greenbaum is a highly experienced technology professional with more than 15 years of experience across software development, DevOps practices, data architecture, and cloud engineering. Currently consulting as a freelancer, Eran’s primary focus is always on delivering practical, hands-on solutions driven by his genuine love for technology.

Video

Slides


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty:

architecture

My CDC First Architecture

My CDC First Architecture

Lecturer: Omid Vahdaty 24.10.2025

In a world where data constantly flows from operational systems, applications, and diverse cloud environments, a CDC (Change Data Capture) approach has become a fundamental pillar of any modern data architecture.

Part1
This session will explore the advantages and disadvantages of this approach, its critical role in accelerating real-time data integration, and the challenges of a “CDC-First” architecture – including load management, update ordering, and the separation of Bronze–Silver–Gold data layers.

We’ll also dive into a comparison between three key architectural paradigms:

  • Embedding Architecture – tailored for AI-driven and recommendation-based applications.
  • Analytical Architecture – focused on analytics systems and dashboards.
  • Operational Architecture – designed for mission-critical systems that require real-time consistency and freshness.

    We’ll discuss the pros and cons of each, how to combine them effectively, and how proper architectural planning can prevent future headaches, reduce costs, and accelerate business innovation.

    Speaker: Omid Vahdaty.
    Omid is the Founder and CTO of Jutomate, a company that provides services and solutions in data, cloud, and AI.

    He is an expert in data architecture, product innovation, and strategic engineering thinking, with over 20 years of experience across startups, global enterprises, and government organizations.

Video

Slides


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty:

analytics

Introduction to event based analytics

Introduction to event based analytics

Lecturer: Avi Lior 15.10.2025

Outline:
* The key differences between event based analytics (Google analytics, Amplitude, Mixpanel) and traditional data analytics
* An example of the infrastructure of a project that uses event based analytics using Amplitude (gathering events from the website, CRM, client app etc).
* Examples of dashboards, queries and reports that can be made with these tools and how they complement those of traditional data analytics
* Lastly, review how segments of users are synchronized to a marketing automation platform and how the feedback loop is maintained

Target audience:
* Product and data professionals who would like to learn about event based analytics and see if and how it can be combined with their work.

Language: English

About the lecturer:
My name is Avi Lior, I’m a veteran product consultant, specializing in data analysis and marketing. Through my work with global Fintech and Gaming B2C companies over the years, I’m always passionate about learning the unique requirements and characteristics of each company and help it find the right technological and operational solutions
https://www.linkedin.com/in/alior/

Video

Slides


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty:

NoSQL

Latency and External Database Caching

Latency and External Database Caching

Lecturer: Guy Shtub 11.6.2025

Description:
The talk covers the importance of low latency and explains how modern databases achieve low latency, different caching strategies, and using an external cache with a datastore vs. using an internal cache.

Lecturer: Guy Shtub, Head of Training at ScyllaDB
Language: English
About the lecturer:
Guy Shtub is Head of Training at ScyllaDB and holds a B.SC. degree in Software Engineering from Ben Gurion University. He co-founded two start-ups and is experienced in creating products that people love.

Video

Slides


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty: