Blog

BI

SQL Demystified

SQL Demystified - Introduction to SQL

Lecturer: Omid Vahdaty 29.6.2022

Part 1

A review of the basic SQL with live demo and practice.
SQL Topics:
1. select , where
2. group by , aggregations

Video

Slides

Part 2

A review of the basic SQL with live demo and practice.
SQL Topic- join tables.

Video

Slides


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty:

architecture

A gentle Introduction to Alluxio

A gentle Introduction to Alluxio

Lecturer: Vasista Polali 28.6.2022

An introduction to Open Source Alluxio.
The benefits of using Alluxio and problems with the current data ecosystem it can solve.
The various Data orchestration features alluxio has to offer, when and how best to use Alluxio and the use cases it is most suited for.
The API’s developed by Boolean UG that will enable users to easily configure, scale and secure an open source alluxio cluster, followed by a live Demo.
Lecturer: Vasista Polali, Founder@Boolean UG , Berlin, Germany.
Vasista is the founder of boolean UG, a start-up based in Berlin, Germany specializing in implementing Data Engineering and Data Science solutions based on Distributed computing and Cloud computing paradigms.

Video

Slides


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty:

Logo Jutomate
BI

How to connect BigQuery to Looker

Looker 101 Chapter 1

How to connect BigQuery to Looker

In this chapter, we are going to learn how to connect the Google Cloud Platform (GCP)  to our looker account. 
Let’s start!
1. Go to the Google Cloud Platform home page. From APIs & Services, Select Credentials

2. From CREATE CREDENTIALS, Select Service account

3. Fill in the Service account details
– Select a name for the Service account.
– Select Service account.
-Add description (optional).
– Press CREATE AND CONTINUE. 

4. In Grant this service account access to project -> Select a role-> BigQuery Admin. Select Continue.
5. Grant users access to this service account– optional. You will be able to add users later on if needed
Select  DONE

6. Go to the Credentials tab, under Service Accounts, and click on the account you’ve just created (in this example “Evya-1”)

7. In the keys tab, from Add Key, select Create new key.

8. Select Create. (A JSON file that contains the key will be downloaded to your PC)

9. Go to Looker homepage, choose Develop

10. Select Projects

11. Choose Admin ->Connections-> Add Connection

12. In connection Setting:
-Choose a Name.
-In the Dialect setting, Choose Google BigQuery Standard SQL.

In Project ID: 
            Get the project’s name of the BigQuery project that contains the dataset you want to work with.

            – Go to the Google Cloud Platform home page

            – From BigQuery, Select SQL workspace

            – Copy the name of the project you want  

            – Return to Looker and Enter it in the Project ID

In Dataset:            
       -Go to the Big Query homepage   and copy the dataset name you want to use.
       -Enter this name in the Dataset pane on the looker page.

In Service Account Email:
You will need the email address of the service account you created earlier on the Google Cloud Platform. 
-Go to the Google Cloud Platform homepage. 
Menu-> IAM & Admin -> Service Accounts.

-Find the account name and copy its Email address.
-Enter the email in the Service Account Email pane

In Service Account JSON/P12 File -> insert the JSON file you’ve downloaded earlier (Choose File-> select the file )
Click Add Connection

13. Let’s test it – 
Find your connection on the list and press the Test button.

Every test should become green – connect, kill and query

You are all set! 

Hope to see you in the next chapter- Creating a Project and Getting to know LookML

architecture

Alluxio – the good, the bad and the ugly

Alluxio - The good, the bad and the ugly

Lecturer: Alexander Leibzon 17.5.2022

In the lecture we learn what exactly is “Alluxio- the data orchestration layer”, go over the use cases we’re running in production in the past 2 years.
The good things (the actual pain points it solves), the bad things (and how we overcome it), and the ugly things (the actual tips and tricks people come to meetups for).

Video

Slides


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty:

ETL tools

The First No-Code ETL Python Integration is Here

The First No-Code ETL Python Integration is Here

Lecturers: Ariel Yosef & Ophir Prusak , 26.4.2022

A live demonstration of Rivery’s industry-first Python integration.
The Python integration allows you to:

  • Run custom Python code directly within a No-Code ETL Platform.
  • Easily get your data into (or out of) Python without the need to write any connectivity code in Python.
  • Transform your data on-the-fly

Lecturers:
Ophir Prusak, Product Marketing at Rivery
Ariel Yosef, Data Engineer at Jutomate

Video

Slides


——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,

feel free to contact me via LinkedIn – Omid Vahdaty: