Jiatao Tao presentations

Présentations
Documents
Infographies

Plus récents Les plus populaires

Deep Dive into Project Tungsten: Bringing Spark Closer to Bare Metal-(Josh Rosen, Databricks)

Spark Summit • il y a 8 ans

Coral & Transport UDFs: Building Blocks of a Postmodern Data Warehouse

Walaa Eldin Moustafa • il y a 4 ans

Cost-based Query Optimization in Apache Phoenix using Apache Calcite

Julian Hyde • il y a 7 ans

The Volcano/Cascades Optimizer

宇傅 • il y a 5 ans

Data profiling with Apache Calcite

Julian Hyde • il y a 6 ans

Accelerating query processing with materialized views in Apache Hive

DataWorks Summit • il y a 6 ans

ORC File - Optimizing Your Big Data

DataWorks Summit • il y a 6 ans

ORC 2015: Faster, Better, Smaller

DataWorks Summit • il y a 8 ans

Using Apache Arrow, Calcite, and Parquet to Build a Relational Cache

Dremio Corporation • il y a 6 ans

Hive Bucketing in Apache Spark with Tejas Patil

Databricks • il y a 6 ans

An Adaptive Execution Engine for Apache Spark with Carson Wang and Yucai Yu

Databricks • il y a 6 ans

Spark 2.x Troubleshooting Guide

IBM • il y a 8 ans

Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang

Databricks • il y a 5 ans

Lessons from the Field: Applying Best Practices to Your Apache Spark Applications with Silvio Fiorito

Databricks • il y a 6 ans

Parquet performance tuning: the missing guide

Ryan Blue • il y a 7 ans

Efficient Data Storage for Analytics with Parquet 2.0 - Hadoop Summit 2014

Julien Le Dem • il y a 9 ans

Data Source API in Spark

Databricks • il y a 9 ans

Why you should care about data layout in the file system with Cheng Lian and Vida Ha

Databricks • il y a 6 ans

Deep Dive: Memory Management in Apache Spark

Databricks • il y a 7 ans

Apache Spark in Depth: Core Concepts, Architecture & Internals

Anton Kirillov • il y a 8 ans