Petabyte Scale Datalake Table Management with Ray, Arrow, Parquet, and S3

Thanks! Share it with your friends!

You disliked this video. Thanks for the feedback!

Published Nov 18, 2021

(Patrick Ames, Amazon)

Managing a data lake that your business depends on to continuously deliver critical insights can be a daunting task. From applying table upserts/deletes during log compaction to managing structural changes through schema evolution or repartitioning, there's a lot that can go wrong and countless trade-offs to weigh. Moreover, as the volume of data in individual tables grow to petabytes and beyond, the jobs that fulfill these tasks grow increasingly expensive, fail to complete on time, and entrench teams in operational burden. Scalability limits are reached and yesterday's corner cases become everyday realities. In this talk, we will discuss Amazon's progress toward resolving these issues in its S3-based data lake by leveraging Ray, Arrow, and Parquet. We will also review past approaches, subsequent lessons learned, goals met/missed, and anticipated future work.

Category: Management

Be the first to comment

08:58

How to make the Best Time Table? || Time Management for Students || Time Table kaise banaye

Download Video
28:55

How To Survive Getting Sh*t In The Face With An Arrow - Medieval Surgery Was A NIGHTMARE!!

Download Video
01:05

Best Structural Engineer Residential | Arrow Engineering

Download Video
00:15

Time Table || Student Time Table Management Skills

Download Video
00:11

Boiler room with a bow and arrow | KILLSHOT

Download Video
06:10

Shri AmarnathJi Yatra || Worst management Neelgarth (Baltal) || Arrow management statement on

Download Video
10:55

STARTER PACK POWER SPEC! BLAST ARROW!!

Download Video
00:12

Filling the freezer with a bow and arrow | KILLSHOT

Download Video
02:10

ISRAELI Arrow-3 has KILL Iranian ballistic missile

Download Video
00:48

Joe Rogan Shoots Arrow at Elon Musk's Cybertruck!

Download Video
03:29

Deca Roleplay Video - Lily Papadakis (Marshall) (Business Management)

Download Video
00:20

motivational video for libasna ll inspiration for upsc aspirants ll #short #upse #amitabhbachchan

Download Video
17:46

FreeStyle Libre 3 vs Dexcom G6 | Full Test & Review

Download Video
09:59

Learn Communication Skills | English Speaking | Management Insight #7

Download Video
1:43:58

How to Trade In Stocks By: Jesse Livermore. Complete Audiobook.

Download Video
06:52

Three Questions to Ask Yourself To Be a Great Leader

Download Video
07:35

find your unique style | Style Roots QUIZ + body types

Download Video

Sign in

Petabyte Scale Datalake Table Management with Ray, Arrow, Parquet, and S3