Fast Analytics with Presto and Alluxio - A tale of two open source projects
Lecturer: Rongrong Zhong 26.7.2022
Presto is a popular analytics query engine that provides interactive analytics on configurable data sources.
While this architecture provides flexibility, query performance can be dramatically affected by remote storage latency variance.
Alluxio is a virtual file system that can provide transparent caching for remote storage systems. When the two projects joined together, we had an analytical query engine that can query data from anywhere, while caching the hot dataset locally. This architecture is used in many large companies like Meta, Uber, ByteDance, etc, providing flexibility and speed at the same time.
Lecturer: Rongrong Zhong, a software engineer at Alluxio.
Before joining Alluxio, she was a software engineer in Meta. She has 10 years of experience working in databases, storage engines and analytical query engines. She’s a Presto committer and TSC member.
Video
Slides
——————————————————————————————————————————
I put a lot of thoughts into these blogs, so I could share the information in a clear and useful way.
If you have any comments, thoughts, questions, or you need someone to consult with,
feel free to contact me via LinkedIn – Omid Vahdaty: