Distributed SQL Query Engine for Big Data
- https://prestodb.io/
- 사용하는 곳 : facebook
- Document : https://prestodb.io/docs/current/
- git : https://github.com/prestodb/presto
- Licnese : Apache 2.0
- 사용 언어 : JAVA
- Java 8 Update 151 or higher (8u151+), 64-bit. Both Oracle JDK and OpenJDK are supported.
- Maven 3.3.9+ (for building)
- Python 2.4+ (for running with the launcher script)
Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging from gigabytes to petabytes.
Presto was designed and written from the ground up for interactive analytics and approaches the speed of commercial data warehouses while scaling to the size of organizations like Facebook.
Presto is a distributed system that runs on a cluster of machines. A full installation includes a coordinator and multiple workers. Queries are submitted from a client such as the Presto CLI to the coordinator. The coordinator parses, analyzes and plans the query execution, then distributes the processing to the workers.
Requirements
Presto has a few basic requirements:
- Linux or Mac OS X
- Java 8, 64-bit
- Python 2.4+
지원 Connectors
- Accumulo Connector
- Black Hole Connector
- Cassandra Connector
- Hive Connector
- Hive Security Configuration
- JMX Connector
- Kafka Connector
- Kafka Connector Tutorial
- Kudu Connector
- Avro Schema Evolution Rules
- Local File Connector
- Memory Connector
- MongoDB Connector
- MySQL Connector
- PostgreSQL Connector
- Redis Connector
- Redshift Connector
- SQL Server Connector
- System Connector
- Thrift Connector
- TPCDS Connector
- TPCH Connector