Starburst Presto is now available in AWS! Learn More »

Latest Blog Post:

Starburst Enterprise Distribution of Presto 203e Now Available!

View Post »

Starburst Enterprise Distribution of Presto 203e Now Available!

Matt Fuller, Co-founder at Starburst Starburst is pleased to announce the availability of Presto 203e. The most notable features and additions include: AWS Glue Integration New Geospatial functions and improved Geospatial function performance Additional SQL subquery support Add SQL FILTER clause for aggregations Column level access control* Support for authentication with JWT access token Various…

Read More »

Querying data in S3 using Presto and Looker

Eric Whitlow, Technical Business Development With more and more companies using AWS for their many data processing and storage needs,  it’s never been easier to query this data with Starburst Presto on AWS and Looker, the quickly growing data analytics platform suite. Looker already provides a Presto connector which allows you to connect to a…

Read More »

Announcing the first-ever Presto Summit

Calling all Distributed SQL Fans! Presto Summit July 16th at Facebook Together with our friends at Facebook, we are excited to announce the first-ever Presto Summit! Since Presto’s introduction nearly five years ago, the open source community has steadily grown and the project has become the leading engine for SQL-on-Anything. We look forward to seeing…

Read More »

Starburst’s Presto on AWS up to 18x faster than EMR

Karol Sobczak & Anu Sudarsan, Co-Founders & Software Engineers at Starburst Introduction Last week, we announced the availability of Starburst’s Presto on AWS Marketplace. With this offering, one can deploy a Presto cluster and begin querying S3 data in a matter of minutes! Coupled with simple deployment, Presto is automatically configured for your AWS EC2 instances,…

Read More »

Presto Available on AWS Marketplace!

Matt Fuller, Co-founder at Starburst Today I am excited to announce the availability of Presto on AWS Marketplace by Starburst. The Presto AWS Marketplace offering is based on our 195e release which includes Presto’s state of the art Cost Based Optimizer (CBO) for superior performance. Presto on AWS Marketplace is available as both an Amazon…

Read More »

Data Lakes without Hadoop

It seems like migrating to the cloud has dominated the news and a lot of companies are shuttering their data centers and letting cloud providers handle it for them. Reasons such as elasticity, simplicity, and infrastructure agility are all great reasons but there are many companies that continue to host their own infrastructure. The reasons could be security or they believe the cloud doesn’t provide the cost benefits in their scenario.

For these companies, building a data lake usually means…

Read More »

Presto Memory Connector

There is a highly efficient connector for Presto! It works by storing all data in memory on Presto Worker nodes, which allow for extremely fast access times with high throughput while keeping CPU overhead at bare minimum.

Read More »

True Separation of Storage and Compute

For the last few years, the hot topic in any organization is the separation of storage and compute. With data volumes increasing on a daily basis as well as the types of data being stored, placing this data on a flexible storage medium such as HDFS and cloud object storage such as Amazon’s S3 and Azure’s Blob storage provides a company with great flexibility on when and where they consume this data.

Read More »

Presto Join Enumeration

Welcome back to the series of blog posts (checkout our previous post!) about Presto’s first Cost-Based Optimizer! Today let’s focus on the challenge of choosing the optimal join order. The order by which relations are joined affects performance of a query substantially. Poor join order might introduce unnecessary CPU and network overhead. To overcome that, the Starburst Presto release includes a state-of-the art join enumeration algorithm that will greatly benefit its users. Let’s first do a quick introduction how Presto join enumerator will speed up your common queries and then we will discuss the algorithm in more details.

Read More »