OpenSource

[SQL Engine] Presto

아르비스 2019. 2. 28. 11:41

메타 데이터의 시작으로 이동

Distributed SQL Query Engine for Big Data



Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes.

Presto was designed and written from the ground up for interactive analytics and approaches the speed of commercial data warehouses while scaling to the size of organizations like Facebook.


Presto is a distributed system that runs on a cluster of machines. A full installation includes a coordinator and multiple workers. Queries are submitted from a client such as the Presto CLI to the coordinator. The coordinator parses, analyzes and plans the query execution, then distributes the processing to the workers.

Presto Installation Overview

Requirements

Presto has a few basic requirements:

  • Linux or Mac OS X
  • Java 8, 64-bit
  • Python 2.4+


지원 Connectors

  • Accumulo Connector
  • Black Hole Connector
  • Cassandra Connector
  • Hive Connector
  • Hive Security Configuration
  • JMX Connector
  • Kafka Connector
  • Kafka Connector Tutorial
  • Kudu Connector
  • Avro Schema Evolution Rules
  • Local File Connector
  • Memory Connector
  • MongoDB Connector
  • MySQL Connector
  • PostgreSQL Connector
  • Redis Connector
  • Redshift Connector
  • SQL Server Connector
  • System Connector
  • Thrift Connector
  • TPCDS Connector
  • TPCH Connector