Session Info

Session Title

Apache Drill - interactive query and analysis at scale

Session Type Tutorial

Duration 45 minutes

Session Description

Apache Drill is a distributed system for interactive analysis of large-scale datasets, inspired by Google’s Dremel technology, aiming to process Petabytes of data and trillions of records in seconds.

Since its inception in mid 2012, Apache Drill has gained widespread interest in the community. In this session we focus on how Apache Drill enables interactive analysis and query at scale. We first go through typical use cases and Drill's architecture, and then focus on the data flow and query languages as well as data sources supported. Last but not least a live demo of Drill is performed.


Michael Hausenblas (MapR Technologies)

Michael works at MapR Technologies EMEA in the role of Chief Data Engineer, where he helps people to tap the potential of big data. His background is in large-scale data integration research and development, advocacy and standardisation. He has experience with NoSQL databases and the Hadoop ecosystem. Michael speaks at events, blogs about big data, and writes articles and books on the topic. Michael contributes to Apache Drill, a distributed system for interactive analysis of large-scale datasets.