Presto Newsletter – April 2020

Welcome to the 21st issue of the Presto Newsletter. Tell your friends to sign up to get relevant news for the Presto community.

 

Events

2020 Presto Summit SF – May 21st, 2020

With the uncertainty that we’re all facing, we’re taking every precaution as we approach the upcoming Presto Summit SF. The safety of the Presto community is our priority. We’re excited to have presenters confirmed from Salesforce, Zuora, ARM Treasure Data, LinkedIn, Stitch Fix, Lyft, and Pinterest so far. We’ll make a decision in 2 weeks on whether this will be in-person in May, digital, or rescheduled.

Register Now to save your spot!

**We are currently accepting speaker submissions. If you’d like to present please send your request to summit@prestosql.io**

 

Webinar – Do More with Less: Force Multiply Your Analytics Infrastructure– (April 9th @ 1pm ET)

Overnight, the world changed. Budgets shrank. Priorities shifted. And perhaps now, more than ever, we need solid analytics to help guide our organizations. But how do we deliver in this climate?

Join us to discuss why Presto is the technology you should be looking at to fundamentally alter how your organization works with big data. Register Here

 

Webinar – New Release: Starburst Enterprise Presto 332-e – (April 14th @ 2pm ET)

Join Tom Nats, Director of Customer Solutions, as he introduces the latest release of Starburst Enterprise Presto: 332-e. This release is packed with new features that improve speed of data access with caching to accelerate Presto performance, and a Native Delta Lake Reader to provide fast SQL-based analytics to companies using Delta Lake, and improved security with Global Access Control for fine-grained access control across all data sources.

This 1-hour webinar will cover these features in detail and time will be provided to answer all your Presto questions. Register now, space is limited.

 

Webinar – Optimizing Data Analytics in a Hybrid Cloud World – (April 21st @ 1pm ET)

Most companies are not “born in the cloud.” They need to work with data that is scattered across operational databases, data warehouses, and data lakes, each of which may be in the corporate data center or the major public clouds. This distributed, dispersed nature of data should be embraced, as it is an integral part of analytics.

GigaOm and Starburst are partnering for this webinar to explore using a unified data consumption and access layer, like Presto, that is optimized for performance and ready for data distributed across platforms and physical locations. Register Here

 

Presto News & Knowledge

Establishing Universal Data Access At A Fortune 100 Telco Giant

Read this case study to learn how Starburst Enterprise Presto is helping a Fortune 100 telecommunications giant query federated data sources while transitioning to a cost-effective hybrid cloud data architecture.

 

Presto on Graviton 2 benchmarks

Graviton2-based Amazon EC2 M6g instances deliver up to 40 percent better price/performance compared with the current generation of M5 instance types. In this blog, Treasure Data explores the question: Can you achieve that price/performance for real-world applications? To test this, they turned to Presto.

 

Fighting COVID with Presto:

In a note from Starburst CEO & Co-founder, Justin Borgman, Starburst announced that they are offering a free license of Starburst Enterprise Presto for universities, hospitals, and healthcare research organizations, including Government and Non-Government Organizations (NGOs) who have teams dedicated to analyzing COVID-19 data. Read the note in its entirety in this post.

 

A Presto Data Pipeline with S3 on Kubernetes

This post presents a modern data warehouse implemented with Presto and FlashBlade S3; using Presto to ingest data and then transform it to a queryable data warehouse.

 

Starburst agrees, Kubernetes is a great place to run Presto… Now, which Kubernetes to choose when you go to the cloud?

StackRox took a wide-ranging look at the current features and limitations of the managed Kubernetes services from the three largest cloud service providers: Amazon’s Elastic Kubernetes Service (EKS), Microsoft’s Azure Kubernetes Service (AKS), and Google Kubernetes Engine (GKE). If you’re looking for information on running Presto on Kubernetes, read more here.

 

…Now if you are on premises though, try RedHat OpenShift

With Kubernetes at the core of OpenShift, this post explores what value-added features Red Hat brings to the table to make OpenShift a successful Enterprise Platform-as-a-Service (PaaS) everyone knows about from a developer perspective, but also the very reliable Container-as-a-Service from a production standpoint.

 

Podcast: Justin Borgman Interviews with The VentureFizz Podcast

Starburst CEO & Co-Founder, Justin Borgman, was recently interviewed for the  The VentureFizz Podcast. Listen as host, Keith Cline, and Justin discuss the significance of Presto and why Starburst exists.

 

Presto Releases & New Features 

Interested in querying your Teradata very fast?
Try our parallel direct connector

 

Did You Know?

  1. You can print out extra information using the presto-cli utility with the –progress flag:
    presto-cli –server 172.56.0.40:8080 –progress –e ‘select * from sometable limit 10’
    Query 20200330_110205_00001_sztvi, FINISHED, 1 node
    Splits: 34 total, 34 done (100.00%)
    0:05 [15 rows, 7.23MB] [2 rows/s, 1.43MB/s]
  2. When inserting into a table, Presto first writes into a temporary table then finally inserts into the target table. The reason is since multiple writers are writing in parallel, if a failure occurs, the results could be partial data written to the target.
  3. In Presto, double values are printed in scientific notation:

> select double ‘12312312123123.123’;

_col0         

———————–

 1.2312312123123123E13

 

These can be cast to decimal if needed:

select cast(double ‘12312312123123.123’ as decimal(20, 5));

        _col0         

———————-

 12312312123123.12300