ablog

不器用で落着きのない技術者のメモ

Presto についてメモ

Presto is an open-source software project to develop a database query engine using the standard Structured Query Language (SQL).

Facebook commenced development efforts on Presto in 2012, and announced its release as open source for Apache Hadoop in 2013.[1][2] In 2014, Netflix disclosed they used Presto on 10 petabytes of data stored in the Amazon Simple Storage Service (S3).[3] Airbnb released the source to web interface software called Airpal for Presto in March, 2015.[4][5] In June 2015, data-warehousing company Teradata promoted its commercial support, using the Apache License for the software.[6]

Presto’s architecture is very similar to a classic database management system using cluster computing. It can be visualized as one coordinator node working in sync with multiple worker nodes. Clients submit SQL statements that get parsed and planned following which parallel tasks are scheduled to workers. Workers jointly process rows from the data sources and produce results that are returned to the client. This compares to the original Apache Hive execution model, which used the Hadoop MapReduce mechanism on each query. It is written in the Java programming language.[1]

Presto (SQL query engine) - Wikipedia

Presto currently has a rule based optimizer.

19.3. Logical Planner — Teradata Distribution of Presto 0.167-t.0.2 Documentation