Presto Newsletter – January 2020
Welcome to the 18th issue of the Presto Newsletter. Please sign-up to get future issues to your mailbox. We aim to track the relevant news in the Presto community and keep you updated every month.
Hands-on Presto: Fast SQL on Anything Free Training (Jan 9 @9am – Tel Aviv)
Join Big Data Boutique & other industry giants to learn everything you need to know about Presto SQL to get started querying and analyzing data on S3, HDFS and pretty much anywhere. Register to attend at the link above, space is limited.
Recap & Slides – Presto Summit NYC
The Presto Summit was hosted on December 11th, 2019 at The New York Academy of Sciences. The event was co-hosted by the Presto Software Foundation, Starburst, and supporters of the Presto project, Red Hat.
Read a recap of the day and view presentation slides from some of the largest presto users.
Be sure to check back for presentation videos next week!
Presto News & Knowledge
As many know, Presto can be deployed in the cloud or on-premises, the technology is truly platform agnostic. Combining Presto with the cloud computing services provided by GCP allows you to never get deprived of resources and continue to gain useful insights out of your data. Walmart Labs talks about setting up 3 nodes automated autoscaling Presto cluster on Google Cloud Platform using Google Compute Engine, Instance Group and Autoscaler.
This post summarizes which Hive 3 features Presto already supports, covering all the work that went into Presto to achieve that. We also outline next steps lying ahead.
Google recently announced that table format projects Delta Lake and Apache Iceberg (Incubating) are now available in the latest version of Cloud Dataproc (version 1.5 Preview). This blog walks through what table formats are, why they are useful, and how to use them on Dataproc with some examples.
AWS announced new instance types powered by Arm-based AWS Graviton2 Processor are designed to deliver up to 40% improved price/performance. This article is the report to clarify what we need to do to run Presto on the Arm-based platform and see how much benefit we can potentially obtain with Graviton Processor.
Presto Releases & New Features
That includes Japan, Korea, Hongkong, Australia, India, Singapore and more!
Did you know?
1 The Unnest function can easily break out an array of data. For example, if you had column with column named “b” that contains array data: [“5555555555″,”6666666666”] you can issue:
CROSS JOIN UNNEST(b) AS t (b_split);
The results would be:
2 When creating a table using the Hive connector, Presto creates one file per worker which can be adjusted using the “task_writer_count” session property.
3 You can easily convert unix time into UTC using: