2019 NYC Presto Summit

December 11th, 2019

The New York Academy of Sciences Conference Center
New York City, NY

Hosted By

Starburst Logo - Enterprise Presto

The Presto Summit is coming to the East Coast!

 

Join the Presto community on December 11th at The New York Academy of Sciences for an all day event focused on the world’s fastest distributed SQL query engine. 
 
The Presto Summit continues to bring together the best developers, engineers, data scientists, and executives from the Presto community to share how some of the largest and most innovative companies are using this technology to power their analytics platforms.
 
This year’s New York Summit is hosted by The Presto Software Foundation, Starburst Data, and Red Hat.  

Speakers

This year’s NYC Summit will feature speakers from…

Martin Traverso
Martin Traverso
Co-Creator, Presto
Presto Software Foundation
Dain Sundstrom
Dain Sundstrom
Co-Creator, Presto
Presto Software Foundation
David Phillips
David Phillips
Co-Creator, Presto
Presto Software Foundation
Presto Summit Speaker Michael St. Jean
Michael St. Jean
Principal Marketing Manager (Red Hat Storage)
Red Hat
Ashish Singh
Ashish Singh
Tech Lead, Data Compute Platform
Pinterest
Vinay Narayana
Vinay Narayana
Associate Director, Big Data, Messaging & Data Warehousing
Wayfair
Presto Summit Speaker Krzysztof Antończak
Krzysztof Antończak
Senior Data Engineer
OLX
Presto Summit Speaker Jakub Orłowski
Jakub Orłowski
Data Engineering Manager
OLX
Presto Summit Speaker Facundo Guerrero
Facundo Guerrero
Site Reliability Engineers Manager
OLX
Presto Summit Presenter – Ivan Black
Ivan Black
Director, Cloud Systems
FINRA
Haoyuan Li
Haoyuan (H.Y.) Li
Founder, Chairman, & CTO
Alluxio
Sajumon Joseph
Sajumon Joseph
Principal Architect
Comcast
Cheolsoo Park
Cheolsoo Park
Data Engineer
Slack
Ajay Bhonsule
Ajay Bhonsule
Sr. Engineering Mgr.
Slack
Kaycee Lai
Kaycee Lai
CEO & Founder
Promethium
Presto Summit Speaker Ken Seier
Ken Seier
Chief Architect and US Practice Lead for Data & AI
Insight's Digital Innovation Team
Headshot of Justin Borgman, CEO of Starburst Data
Justin Borgman
CEO & Co-Founder
Starburst Data
Headshot of Matt Fuller, VP of Marketing at Starburst Data
Matt Fuller
VP of Engineering & Co-Founder
Starburst Data
Headshot of Kamil Bajda Pawlikowski, CTO of Starburst Data
Kamil Bajda-Pawlikowski
CTO & Co-Founder
Starburst Data

Agenda

9:00am - 10:00am Registration & Breakfast

Complementary continental breakfast. Enjoy!

10:00am Opening Remarks

Join Red Hat for opening remarks to kick off the day!

 


Michael St. Jean

Michael St-Jean

Red Hat

10:00am Keynote with the Creators of Presto

Join the creators of Presto and the founders of The Presto Software Foundation for an opening Keynote where they’ll share recent developments and discuss the future & direction of the project. 

 

 


Presto Logo Icon

Martin Traverso, David Phillips, Dain Sundstrom

Presto Creators & Presto Software Foundation Founders

10:45am Presto at Wayfair

Presto has become the essential tool for Data Scientists and Analysts at Wayfair. Presto is relatively new at Wayfair. It was implemented about a year ago and has been actively used in the past 10 months. Attend this session to understand why Wayfair decided to implement presto, how we architected our cluster, configuration choices we made, some common issues we faced and where we are heading towards next. By breaking down our problems and approach with Presto, it will help describe some of the challenges we face, and provide color to the decisions we’ve made

 

 


Vinay Narayana

Wayfair

11:15am Query ANYTHING: Data Source Connectivity

Justin Borgman, CEO and Co-Founder of Starburst Data will discuss the growing list of data sources that Presto can connect to and how this impacts data consumers. More details coming soon…

 

 


Headshot of Justin Borgman, CEO of Starburst Data

Justin Borgman

Starburst Data

11:45am Big Fast Queries with Presto on Openshift

Next generation data platforms are embracing the proliferation of technologies that help organizations discover, catalog, process, and derive insight from their data. OpenShift, and OpenShift Container Storage are at the forefront of this transition and provide a foundation for building a self service environment for developers, data engineers, and data scientists. In this demo we’ll share how Starburst Presto on OpenShift can power your interactive and ad-hoc data discovery. SQL on anything means fast, secure access to data in OpenShift Container Storage, and federated access to data anywhere. With Starburst on OpenShift you have access to the world’s fastest open source SQL query engine, enterprise ready, across clouds public and private.

Presto, an open source distributed SQL engine, is widely recognized for its low-latency queries, high concurrency, and native ability to query multiple data sources. Proven at scale in a variety of use cases at Airbnb, Comcast, GrubHub, Facebook, FINRA, LinkedIn, Lyft, Netflix, Twitter, and Uber. In the last few years Presto experienced an unprecedented growth in popularity in both on-premises and cloud deployments over Object Stores, HDFS, NoSQL and RDBMS data stores.

 

 


Michael St. Jean

Michael St-Jean

Red Hat

Headshot of Kamil Bajda Pawlikowski, CTO of Starburst Data

Kamil Bajda-Pawlikowski

Starburst Data

 

12:15pm Journey to the Cloud: FINRA, Starburst, and Presto

Interactive discussion on FINRA’s migration to the cloud and its partnership with Starburst to enhance Presto for Enterprise data science and analytics at scale.

 

 


Presto Summit Presenter - Ivan Black

Ivan Black

FINRA

12:45pm Lunch

Join us for a complementary lunch and networking hour

1:00pm Workshop: Starburst Presto Live Demo

Details coming soon…

 

 


Matt Fuller

Starburst Data

1:40pm Presto to Presto (P2P) – Leverage remote presto clusters for distributed queries.

At Comcast, we have data lives in our on-premise data centers as well as the cloud environment. Due to network connectivity and other access challenges, it was not practical for a single Presto cluster to have access to datasets in both on-premise as well as cloud.

Join Sajumon Joseph as he discusses how they implemented a Presto-to-Presto connector that allowed our end users to have access to remote datasets through a remote Presto cluster. Additionally, they implemented secure access so that user’s authentication is honored at the remote cluster.

 

 


Presto Summit Speaker - Sajumon Joseph

Sajumon Joseph

Comcast

2:10pm Automating Data Discovery and Data Prep with Presto

Presto is great for fast queries and federated queries for data across different data sources.  But, before the data can be queried by Presto, data must first be discovered. This can be a lengthy process as the data can reside in multiple locations across various data sources. Even after the data is discovered, the data must be prepped to ensure that the proper data is combined and properly joined so it can be queried.  This becomes even more challenging manual process if the data consists of different tables and files across multiple data sources.  Promethium aims to accelerate the time to by using an AI-based approach to accelerate the data discovery and prep process so that it can be driven via NLP and reduce a potentially lengthy process from months to a matter of minutes. Attend this session to understand how users can instantly publish a virtual view in Presto to query using Promethium. 

 

 


Presto Summit Speaker - Kaycee Lai

Kaycee Lai

Promethium

2:40pm Presto at Slack

Join Cheolsoo Park & Ajay Bhonsule as they highlight why Slack turned to Presto, and ultimately Starburst to build their data platform. They’ll also look to where Slack aims to take their Presto use.

 

 


Presto Summit Speaker - Ajay Bhonsule

Ajay Bhonsule

Slack

Presto Summit Speakers - Cheolsoo Park

Cheolsoo Park

Slack

3:10pm Enabling Presto in the Cloud with Alluxio

Details coming soon…

 

 


Presto Summit Speaker - Haoyuan (H.Y.) Li

H.Y. (Haoyuan) Li

Alluxio

3:40pm Refreshment Break

Enjoy complementary refreshments and fuel up!

4:00pm Presto at Pinterest

As a data-driven company, many critical business decisions are made at Pinterest based on insights from data. Presto has played a key role to enable interactive querying at Pinterest. Operating Presto at Pinterest’s scale has involved resolving quite a few challenges. In this talk, Ashish Singh will share Pinterest’s journey on adopting, using and enhancing Presto to meet Pinterest’s interactive querying needs.

 

 


Presto Summit Speaker - Ashish Singh

Ashish Singh

Pinterest

4:30pm Operational Data Hub for OLX Markets.

As a heavily data-driven company, we are constantly looking for solutions, that help our teams providing better customer experience. 
We created an initiative, that was started to address some of the most important technical problems and limitations of the Data Organisation of OLX Markets.
A few months of research and development resulted in a solution based on S3, Presto and Airflow, and is used daily by most of our analysts, engineers and data scientists, serving more than 300 million monthly active users of OLX.

In this presentation, we would like to share our motivation, requirements and our experiences gathered during our journey from a traditional Data Warehouse approach to the Presto-based solution.

 

 


Krzysztof Antończak

Krzysztof Antończak

OLX

Jakub Orłowski

Jakub Orłowski

OLX

Facundo Guerrero

Facundo Guerrero

OLX

5:00pm Presto Drives Faster Business Results

Presto is fast becoming a centerpiece of the modern data estate. Join us as we review how Insight’s Digital Innovation team has accelerated innovation and time to value for AI development, reporting and platform migrations by leveraging Presto and how we see Presto continuing to shape the data landscape.

 

 


Ken Seier

Ken Seier

Insight’s Digital Innovation Team

5:30pm Closing Remarks

Thank you for joining us!

6:30pm - 8:30pm Happy Hour

The day’s not over yet! Enjoy food and an open bar as you meet the hosts, speakers, and other attendees

 

Details:

Yves

385 Greenwich St,

New York, NY 10013

Venue Details

Location

The New York Academy of Sciences Conference Center

250 Greenwich Street

New York, NY 10007

Sponsors

The Presto Software Foundation

The Presto Software Foundation is a non-profit organization with the singular mission of supporting a community of passionate users and developers devoted to the advancement of the Presto project for years to come.

Starburst Data Logo

Starburst Data 

Starburst Data is the enterprise Presto software company. Founded by some of the earliest contributors to the open source Presto project, Starburst provides enhanced Presto query performance, security, connectivity, and ease of use, while continuing to contribute back to the project.

Red Hat

Alluxio

Alluxio is an open source data orchestration platform for the cloud. Presto’s open source distributed SQL query engine coupled with Alluxio gives you better performance and multi-cloud capabilities for interactive analytic workloads.