SlideShare une entreprise Scribd logo
1  sur  72
Télécharger pour lire hors ligne
Hive	Bucketing	in
Apache	Spark
Tejas	Patil
Facebook
Agenda
• Why	bucketing	?
• Why	is	shuffle	bad	?
• How	to	avoid	shuffle	?
• When	to	use	bucketing	?
• Spark’s	bucketing	support
• Bucketing	semantics	of	Spark	vs	Hive
• Hive	bucketing	support	in	Spark
• SQL	Planner	improvements
Why	bucketing	?
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	JOIN
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	JOIN
Broadcast	hash	join
- Ship	smaller	table	to	all	nodes
- Stream	the	other	table
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	JOIN
Broadcast	hash	join Shuffle	hash	join
- Ship	smaller	table	to	all	nodes
- Stream	the	other	table
- Shuffle	both	tables,
- Hash	smaller	one,	stream	the	
bigger	one
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	JOIN
Broadcast	hash	join Shuffle	hash	join
- Ship	smaller	table	to	all	nodes
- Stream	the	other	table
- Shuffle	both	tables,
- Hash	smaller	one,	stream	the	
bigger	one
Sort	merge	join
- Shuffle	both	tables,
- Sort	both	tables,	buffer	one,	
stream	the	bigger	one
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	JOIN
Broadcast	hash	join Shuffle	hash	join Sort	merge	join
- Ship	smaller	table	to	all	nodes
- Stream	the	other	table
- Shuffle	both	tables,
- Hash	smaller	one,	stream	the	
bigger	one
- Shuffle	both	tables,
- Sort	both	tables,	buffer	one,	
stream	the	bigger	one
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	
Sort	Merge	Join
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	
Shuffle ShuffleShuffleShuffle
Sort	Merge	Join
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	
Sort Sort Sort Sort
Shuffle ShuffleShuffleShuffle
Sort	Merge	Join
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	
Sort
Join
Sort
Join
Sort
Join
Sort
Join
Shuffle ShuffleShuffleShuffle
Sort	Merge	Join
Sort	Merge	Join
Why	is	shuffle	bad	?
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	
Disk Disk
Disk	IO	for	shuffle	outputs
Why	is	shuffle	bad	?
Table	A Table	B
.	.	.	.	.	.	 .	.	.	.	.	.	
Shuffle Shuffle Shuffle Shuffle
M	*	R	network	connections
Why	is	shuffle	bad	?
How	to	avoid	shuffle	?
student	s JOIN	attendance	a	
ON	s.id =	a.student_id
student	s	JOIN	results	r	
ON	s.id =	r.student_id
student	s	JOIN	course_registrationc
ON	s.id =	c.student_id
……..
……..
student	s	JOIN	attendance	a	
ON	s.id =	a.student_id
student	s	JOIN	results	r	
ON	s.id =	r.student_id
student	s	JOIN	attendance	a	
ON	s.id =	a.student_id
student	s	JOIN	results	r	
ON	s.id =	r.student_id
Pre-compute	at	
table	creation
Bucketing	=
pre-(shuffle	+	sort)	inputs
on	join	keys
Without	
bucketing
Job(s)	populating	input	table
Without	
bucketing
With
bucketing
Job(s)	populating	input	table
Performance	comparison
0
100
200
300
400
500
600
700
800
900
Before After	Bucketing
Reserved	CPU	time	(days)
0
1
2
3
4
5
6
7
8
Before After	Bucketing
Latency	(hours)
When	to	use	bucketing	?
• Tables	used	frequently	in	JOINs	with	same	key
• Student	->	student_id
• Employee	->	employee_id
• Users	->	user_id
When	to	use	bucketing	?
• Tables	used	frequently	in	JOINs	with	same	key
• Student	->	student_id
• Employee	->	employee_id
• Users	->	user_id
….	=>	Dimension	tables
When	to	use	bucketing	?
• Tables	used	frequently	in	JOINs	with	same	key
• Student	->	student_id
• Employee	->	employee_id
• Users	->	user_id
• Loading	daily	cumulative	tables
• Both	base	and	delta	tables	could	be	bucketed	on	a	common	column
When	to	use	bucketing	?
• Tables	used	frequently	in	JOINs	with	same	key
• Student	->	student_id
• Employee	->	employee_id
• Users	->	user_id
• Loading	daily	cumulative	tables
• Both	base	and	delta	tables	could	be	bucketed	on	a	common	column
• Indexing	capability
Spark’s	bucketing	support
Creation	of	bucketed	tables
df.write
.bucketBy(numBuckets,	 "col1",	...)
.sortBy("col1",	...)
.saveAsTable("bucketed_table")
via	Dataframe API
Creation	of	bucketed	tables
CREATE	TABLE	bucketed_table(
column1	INT,
...
)	USING	parquet
CLUSTERED	BY	(column1,	…)
SORTED	BY	(column1,	…)
INTO	`n`	BUCKETS
via	SQL	statement
Check	bucketing	spec
scala>	sparkContext.sql("DESC	FORMATTED	student").collect.foreach(println)
[#	col_name,data_type,comment]
[student_id,int,null]
[name,int,null]
[#	Detailed	Table	Information,,]
[Database,default,]
[Table,table1,]
[Owner,tejas,]
[Created,Fri May	12	08:06:33	PDT	2017,]
[Type,MANAGED,]
[Num Buckets,64,]
[Bucket	Columns,[`student_id`],]
[Sort	Columns,[`student_id`],]
[Properties,[serialization.format=1],]
[Serde Library,org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe,]
[InputFormat,org.apache.hadoop.mapred.SequenceFileInputFormat,]
Bucketing	config
SET	spark.sql.sources.bucketing.enabled=true
[SPARK-15453]	Extract	bucketing	info	in	
FileSourceScanExec
SELECT	*	FROM	tableA JOIN	tableB ON	tableA.i=	tableB.i AND	tableA.j=	tableB.j
[SPARK-15453]	Extract	bucketing	info	in	
FileSourceScanExec
SELECT	*	FROM	tableA JOIN	tableB ON	tableA.i=	tableB.i AND	tableA.j=	tableB.j
Sort	Merge	Join
Sort
Exchange
TableScan
Sort
Exchange
TableScan Already	bucketed	on	columns	(i,	j)
Shuffle	on	columns	(i,	j)
Sort	on	columns	(i,	j)
Join	on	[i,j],	[i,j]	respectively	of	relations
[SPARK-15453]	Extract	bucketing	info	in	
FileSourceScanExec
SELECT	*	FROM	tableA JOIN	tableB ON	tableA.i=	tableB.i AND	tableA.j=	tableB.j
Sort	Merge	Join
Sort
Exchange
TableScan
Sort
Exchange
TableScan Already	bucketed	on	columns	(i,	j)
Shuffle	on	columns	(i,	j)
Sort	on	columns	(i,	j)
Join	on	[i,j],	[i,j]	respectively	of	relations
[SPARK-15453]	Extract	bucketing	info	in	
FileSourceScanExec
SELECT	*	FROM	tableA JOIN	tableB ON	tableA.i=	tableB.i AND	tableA.j=	tableB.j
Sort	Merge	Join
TableScan TableScan
Sort	Merge	Join
TableScan TableScanSort
Exchange
Sort
Exchange
Bucketing	semantics	of
Spark	vs	Hive
Bucketing	semantics	of	Spark	vs	Hive
bucket	0 bucket	1
bucket
(n-1)
…..
Hive
my_bucketed_table
Bucketing	semantics	of	Spark	vs	Hive
bucket	0 bucket	1
bucket
(n-1)
…..
Hive Spark
bucket	0 bucket	1
bucket
(n-1)
…..
bucket	0
bucket	0
bucket	1
bucket
(n-1)bucket
(n-1)bucket
(n-1)
my_bucketed_tablemy_bucketed_table
Bucketing	semantics	of	Spark	vs	Hive
bucket	0 bucket	1
bucket
(n-1)
…..
Hive
M M M M M
R R R
…..
Bucketing	semantics	of	Spark	vs	Hive
bucket	0 bucket	1
bucket
(n-1)
…..
Hive Spark
M M M M M
R R R
…..
bucket	0 bucket	1
bucket
(n-1)
…..
M M M M M…..
bucket	0
bucket	0
bucket	1
bucket
(n-1)bucket
(n-1)bucket
(n-1)
Bucketing	semantics	of	Spark	vs	Hive
Hive Spark
Model Optimizes	 reads,	writes	are	costly Writes	are	cheaper,	 reads	are	
costlier
Bucketing	semantics	of	Spark	vs	Hive
Hive Spark
Model Optimizes	 reads,	writes	are	costly Writes	are	cheaper,	 reads	are	
costlier
Unit	of	bucketing A	single	file	represents	 a	 single	
bucket
A	collection	 of	files	together	
comprise	 a	single	bucket
Bucketing	semantics	of	Spark	vs	Hive
Hive Spark
Model Optimizes	 reads,	writes	are	costly Writes	are	cheaper,	 reads	are	
costlier
Unit	of	bucketing A	single	file	represents	 a	 single	
bucket
A	collection	 of	files	together	
comprise	 a	single	bucket
Sort	ordering Each	bucket	is	sorted	globally.	 This	
makes	it	easy	to	optimize	 queries	
reading	this	data
Each	file	is	individually	 sorted	 but	
the	“bucket”	as	a	whole	is	not	
globally	 sorted
Bucketing	semantics	of	Spark	vs	Hive
Hive Spark
Model Optimizes	 reads,	writes	are	costly Writes	are	cheaper,	 reads	are	
costlier
Unit	of	bucketing A	single	file	represents	 a	 single	
bucket
A	collection	 of	files	together	
comprise	 a	single	bucket
Sort	ordering Each	bucket	is	sorted	globally.	 This	
makes	it	easy	to	optimize	 queries	
reading	this	data
Each	file	is	individually	 sorted	 but	
the	“bucket”	as	a	whole	is	not	
globally	 sorted
Hashing	function Hive’s	 inbuilt	 hash Murmur3Hash
[SPARK-19256] Hive	bucketing	support
• Introduce	Hive’s	hashing	function	[SPARK-17495]
• Enable	creating	hive	bucketed	tables	[SPARK-17729]
• Support	Hive’s	`Bucketed	file	format`
• Propagate	Hive	bucketing	information	to	planner	[SPARK-17654]
• Expressing	outputPartitioning and	requiredChildDistribution
• Creating	empty	bucket	files	when	no	data	for	a	bucket
• Allow	Sort	Merge	join	over	tables	with	number	of	buckets	
multiple	of	each	other
• Support	N-way	Sort	Merge	Join
[SPARK-19256]	Hive	bucketing	support
• Introduce	Hive’s	hashing	function	[SPARK-17495]
• Enable	creating	hive	bucketed	tables	[SPARK-17729]
• Support	Hive’s	`Bucketed	file	format`
• Propagate	Hive	bucketing	information	to	planner	[SPARK-17654]
• Expressing	outputPartitioning and	requiredChildDistribution
• Creating	empty	bucket	files	when	no	data	for	a	bucket
• Allow	Sort	Merge	join	over	tables	with	number	of	buckets	
multiple	of	each	other
• Support	N-way	Sort	Merge	Join
Merged	in
upstream
[SPARK-19256]	Hive	bucketing	support
• Introduce	Hive’s	hashing	function	[SPARK-17495]
• Enable	creating	hive	bucketed	tables	[SPARK-17729]
• Support	Hive’s	`Bucketed	file	format`
• Propagate	Hive	bucketing	information	to	planner	[SPARK-17654]
• Expressing	outputPartitioning and	requiredChildDistribution
• Creating	empty	bucket	files	when	no	data	for	a	bucket
• Allow	Sort	Merge	join	over	tables	with	number	of	buckets	
multiple	of	each	other
• Support	N-way	Sort	Merge	Join
FB-prod	(6	months)
SQL	Planner	improvements
[SPARK-19122]	Unnecessary	shuffle+sort added	if	
join	predicates	ordering	differ	from	bucketing	and	
sorting	order
SELECT	*	FROM	a	JOIN	b	ON	a.x=b.xAND	a.y=b.y
[SPARK-19122]	Unnecessary	shuffle+sort added	if	
join	predicates	ordering	differ	from	bucketing	and	
sorting	order
Sort	Merge	Join
TableScan TableScan
SELECT	*	FROM	a	JOIN	b	ON	a.x=b.xAND	a.y=b.y
[SPARK-19122]	Unnecessary	shuffle+sort added	if	
join	predicates	ordering	differ	from	bucketing	and	
sorting	order
SELECT	*	FROM	a	JOIN	b	ON	a.x=b.xAND	a.y=b.y
Sort	Merge	Join
TableScan TableScan
SELECT	*	FROM	a	JOIN	b	ON	a.y=b.y AND	a.x=b.x
[SPARK-19122]	Unnecessary	shuffle+sort added	if	
join	predicates	ordering	differ	from	bucketing	and	
sorting	order
Sort	Merge	Join
Exchange
TableScan
Sort
Exchange
TableScan
Sort
Sort	Merge	Join
TableScan TableScan
SELECT	*	FROM	a	JOIN	b	ON	a.x=b.xAND	a.y=b.y SELECT	*	FROM	a	JOIN	b	ON	a.y=b.y AND	a.x=b.x
[SPARK-19122]	Unnecessary	shuffle+sort added	if	
join	predicates	ordering	differ	from	bucketing	and	
sorting	order
Sort	Merge	Join
Exchange
TableScan
Sort
Exchange
TableScan
Sort
Sort	Merge	Join
TableScan TableScan
SELECT	*	FROM	a	JOIN	b	ON	a.x=b.xAND	a.y=b.y SELECT	*	FROM	a	JOIN	b	ON	a.y=b.y AND	a.x=b.x
[SPARK-19122]	Unnecessary	shuffle+sort added	if	
join	predicates	ordering	differ	from	bucketing	and	
sorting	order
Sort	Merge	Join
TableScan TableScan
SELECT	*	FROM	a	JOIN	b	ON	a.x=b.xAND	a.y=b.y SELECT	*	FROM	a	JOIN	b	ON	a.y=b.y AND	a.x=b.x
Sort	Merge	Join
TableScan TableScan
[SPARK-17271]	Planner	adds	un-necessary	Sort	
even	if	child	ordering	is	semantically	same	as	
required	ordering
SELECT	*	FROM	table1	a	JOIN	table2	b	ON	a.col1=b.col1
[SPARK-17271]	Planner	adds	un-necessary	Sort	
even	if	child	ordering	is	semantically	same	as	
required	ordering
SELECT	*	FROM	table1	a	JOIN	table2	b	ON	a.col1=b.col1
Sort	Merge	Join
Sort
TableScan
Sort
TableScan Already	bucketed	on	column	`col1`
Sort	on	columns	(a.col1	and	b.col1)
Join	on	[col1],	[col1]	respectively	of	relations
[SPARK-17271]	Planner	adds	un-necessary	Sort	
even	if	child	ordering	is	semantically	same	as	
required	ordering
SELECT	*	FROM	table1	a	JOIN	table2	b	ON	a.col1=b.col1
Sort	Merge	Join
TableScan TableScan Already	bucketed	on	column	`col1`
Sort	on	columns	(a.col1	and	b.col1)Sort Sort
Join	on	[col1],	[col1]	respectively	of	relations
[SPARK-17271]	Planner	adds	un-necessary	Sort	
even	if	child	ordering	is	semantically	same	as	
required	ordering
SELECT	*	FROM	table1	a	JOIN	table2	b	ON	a.col1=b.col1
Sort	Merge	Join
TableScan TableScan
Sort Sort
Sort	Merge	Join
TableScan TableScan
[SPARK-17698]	Join	predicates	should	not	
contain	filter	clauses
SELECT	a.id,	b.id FROM	table1	a	FULL	OUTER	JOIN	table2	b	ON	a.id=	b.id AND	a.id='1'	AND	b.id='1'
[SPARK-17698]	Join	predicates	should	not	
contain	filter	clauses
SELECT	a.id,	b.id FROM	table1	a	FULL	OUTER	JOIN	table2	b	ON	a.id=	b.id AND	a.id='1'	AND	b.id='1'
Sort	Merge	Join
TableScan TableScan
Sort
Exchange
Sort
Exchange
Already	bucketed	on	column	[id]
Shuffle	on	columns	[id,	1,	id]	and	[id,	id,	1]
Sort	on	columns	[id,	id,	1]	and	[id,	1,	id]	
Join	on	[id,	id,	1],	[id,	1,	id]	respectively	of	relations
[SPARK-17698]	Join	predicates	should	not	
contain	filter	clauses
SELECT	a.id,	b.id FROM	table1	a	FULL	OUTER	JOIN	table2	b	ON	a.id =	b.id AND a.id='1'	AND b.id='1'
Sort	Merge	Join
TableScan TableScan
Sort
Exchange
Sort
Exchange
Already	bucketed	on	column	[id]
Shuffle	on	columns	[id,	1,	id]	and	[id,	id,	1]
Sort	on	columns	[id,	id,	1]	and	[id,	1,	id]	
Join	on	[id,	id,	1],	[id,	1,	id]	respectively	of	relations
[SPARK-17698]	Join	predicates	should	not	
contain	filter	clauses
SELECT	a.id,	b.id FROM	table1	a	FULL	OUTER	JOIN	table2	b	ON	a.id =	b.id AND a.id='1'	AND b.id='1'
Sort	Merge	Join
TableScan TableScan
Sort
Exchange
Sort
Exchange
Already	bucketed	on	column	[id]
Shuffle	on	columns	[id,	1,	id]	and	[id,	id,	1]
Sort	on	columns	[id,	id,	1]	and	[id,	1,	id]	
Join	on	[id,	id,	1],	[id,	1,	id]	respectively	of	relations
[SPARK-17698]	Join	predicates	should	not	
contain	filter	clauses
SELECT	a.id,	b.id FROM	table1	a	FULL	OUTER	JOIN	table2	b	ON	a.id =	b.id AND a.id='1'	AND b.id='1'
Sort	Merge	Join
TableScan TableScan
Sort	Merge	Join
TableScan TableScanSort
Exchange
Sort
Exchange
[SPARK-13450]	Introduce	
ExternalAppendOnlyUnsafeRowArray
[SPARK-13450]	Introduce	
ExternalAppendOnlyUnsafeRowArray
Buffered	
relation
Streamed	
relation
If	there	are	a	lot	of	rows	to	be	
buffered	(eg.	Skew),	
the	query	would	OOM
row	with	
key=`foo`
rows	with	
key=`foo`
[SPARK-13450]	Introduce	
ExternalAppendOnlyUnsafeRowArray
Buffered	
relation
Streamed	
relation
row	with	
key=`foo`
rows	with	
key=`foo`
Disk
buffer	would	spill	to	disk	
after	reaching	a	threshold
spill
Summary
• Shuffle	and	sort	is	costly	due	to	disk	and	network	IO
• Bucketing	will	pre-(shuffle	and	sort)	the	inputs
• Expect	at	least	2-5x	gains	after	bucketing	input	tables	for	joins
• Candidates	for	bucketing:
• Tables	used	frequently	in	JOINs	with	same	key
• Loading	of	data	in	cumulative	tables
• Spark’s	bucketing	is	not	compatible	with	Hive’s	bucketing
Questions	?

Contenu connexe

Tendances

Understanding Query Plans and Spark UIs
Understanding Query Plans and Spark UIsUnderstanding Query Plans and Spark UIs
Understanding Query Plans and Spark UIsDatabricks
 
Dynamic Partition Pruning in Apache Spark
Dynamic Partition Pruning in Apache SparkDynamic Partition Pruning in Apache Spark
Dynamic Partition Pruning in Apache SparkDatabricks
 
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...Databricks
 
Apache Hudi: The Path Forward
Apache Hudi: The Path ForwardApache Hudi: The Path Forward
Apache Hudi: The Path ForwardAlluxio, Inc.
 
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the CloudAmazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the CloudNoritaka Sekiyama
 
How to build a streaming Lakehouse with Flink, Kafka, and Hudi
How to build a streaming Lakehouse with Flink, Kafka, and HudiHow to build a streaming Lakehouse with Flink, Kafka, and Hudi
How to build a streaming Lakehouse with Flink, Kafka, and HudiFlink Forward
 
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Apache Spark Data Source V2 with Wenchen Fan and Gengliang WangApache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Apache Spark Data Source V2 with Wenchen Fan and Gengliang WangDatabricks
 
Productizing Structured Streaming Jobs
Productizing Structured Streaming JobsProductizing Structured Streaming Jobs
Productizing Structured Streaming JobsDatabricks
 
Hudi architecture, fundamentals and capabilities
Hudi architecture, fundamentals and capabilitiesHudi architecture, fundamentals and capabilities
Hudi architecture, fundamentals and capabilitiesNishith Agarwal
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks
 
Common Strategies for Improving Performance on Your Delta Lakehouse
Common Strategies for Improving Performance on Your Delta LakehouseCommon Strategies for Improving Performance on Your Delta Lakehouse
Common Strategies for Improving Performance on Your Delta LakehouseDatabricks
 
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...Databricks
 
Apache Spark Core—Deep Dive—Proper Optimization
Apache Spark Core—Deep Dive—Proper OptimizationApache Spark Core—Deep Dive—Proper Optimization
Apache Spark Core—Deep Dive—Proper OptimizationDatabricks
 
Native Support of Prometheus Monitoring in Apache Spark 3.0
Native Support of Prometheus Monitoring in Apache Spark 3.0Native Support of Prometheus Monitoring in Apache Spark 3.0
Native Support of Prometheus Monitoring in Apache Spark 3.0Databricks
 
Bucketing 2.0: Improve Spark SQL Performance by Removing Shuffle
Bucketing 2.0: Improve Spark SQL Performance by Removing ShuffleBucketing 2.0: Improve Spark SQL Performance by Removing Shuffle
Bucketing 2.0: Improve Spark SQL Performance by Removing ShuffleDatabricks
 
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...InfluxData
 
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...Spark Summit
 
Making Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeMaking Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeDatabricks
 
Spark Shuffle Deep Dive (Explained In Depth) - How Shuffle Works in Spark
Spark Shuffle Deep Dive (Explained In Depth) - How Shuffle Works in SparkSpark Shuffle Deep Dive (Explained In Depth) - How Shuffle Works in Spark
Spark Shuffle Deep Dive (Explained In Depth) - How Shuffle Works in SparkBo Yang
 
Speed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with AlluxioSpeed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with AlluxioAlluxio, Inc.
 

Tendances (20)

Understanding Query Plans and Spark UIs
Understanding Query Plans and Spark UIsUnderstanding Query Plans and Spark UIs
Understanding Query Plans and Spark UIs
 
Dynamic Partition Pruning in Apache Spark
Dynamic Partition Pruning in Apache SparkDynamic Partition Pruning in Apache Spark
Dynamic Partition Pruning in Apache Spark
 
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...
Everyday I'm Shuffling - Tips for Writing Better Spark Programs, Strata San J...
 
Apache Hudi: The Path Forward
Apache Hudi: The Path ForwardApache Hudi: The Path Forward
Apache Hudi: The Path Forward
 
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the CloudAmazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
Amazon S3 Best Practice and Tuning for Hadoop/Spark in the Cloud
 
How to build a streaming Lakehouse with Flink, Kafka, and Hudi
How to build a streaming Lakehouse with Flink, Kafka, and HudiHow to build a streaming Lakehouse with Flink, Kafka, and Hudi
How to build a streaming Lakehouse with Flink, Kafka, and Hudi
 
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Apache Spark Data Source V2 with Wenchen Fan and Gengliang WangApache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
 
Productizing Structured Streaming Jobs
Productizing Structured Streaming JobsProductizing Structured Streaming Jobs
Productizing Structured Streaming Jobs
 
Hudi architecture, fundamentals and capabilities
Hudi architecture, fundamentals and capabilitiesHudi architecture, fundamentals and capabilities
Hudi architecture, fundamentals and capabilities
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and Hudi
 
Common Strategies for Improving Performance on Your Delta Lakehouse
Common Strategies for Improving Performance on Your Delta LakehouseCommon Strategies for Improving Performance on Your Delta Lakehouse
Common Strategies for Improving Performance on Your Delta Lakehouse
 
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...
Designing ETL Pipelines with Structured Streaming and Delta Lake—How to Archi...
 
Apache Spark Core—Deep Dive—Proper Optimization
Apache Spark Core—Deep Dive—Proper OptimizationApache Spark Core—Deep Dive—Proper Optimization
Apache Spark Core—Deep Dive—Proper Optimization
 
Native Support of Prometheus Monitoring in Apache Spark 3.0
Native Support of Prometheus Monitoring in Apache Spark 3.0Native Support of Prometheus Monitoring in Apache Spark 3.0
Native Support of Prometheus Monitoring in Apache Spark 3.0
 
Bucketing 2.0: Improve Spark SQL Performance by Removing Shuffle
Bucketing 2.0: Improve Spark SQL Performance by Removing ShuffleBucketing 2.0: Improve Spark SQL Performance by Removing Shuffle
Bucketing 2.0: Improve Spark SQL Performance by Removing Shuffle
 
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
InfluxDB IOx Tech Talks: Query Engine Design and the Rust-Based DataFusion in...
 
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
Spark + Parquet In Depth: Spark Summit East Talk by Emily Curtin and Robbie S...
 
Making Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta LakeMaking Apache Spark Better with Delta Lake
Making Apache Spark Better with Delta Lake
 
Spark Shuffle Deep Dive (Explained In Depth) - How Shuffle Works in Spark
Spark Shuffle Deep Dive (Explained In Depth) - How Shuffle Works in SparkSpark Shuffle Deep Dive (Explained In Depth) - How Shuffle Works in Spark
Spark Shuffle Deep Dive (Explained In Depth) - How Shuffle Works in Spark
 
Speed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with AlluxioSpeed Up Uber's Presto with Alluxio
Speed Up Uber's Presto with Alluxio
 

Plus de Databricks

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDatabricks
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Databricks
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Databricks
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Databricks
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of HadoopDatabricks
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDatabricks
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceDatabricks
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringDatabricks
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixDatabricks
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationDatabricks
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchDatabricks
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesDatabricks
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesDatabricks
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsDatabricks
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkDatabricks
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesDatabricks
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkDatabricks
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeDatabricks
 

Plus de Databricks (20)

DW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptxDW Migration Webinar-March 2022.pptx
DW Migration Webinar-March 2022.pptx
 
Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1Data Lakehouse Symposium | Day 1 | Part 1
Data Lakehouse Symposium | Day 1 | Part 1
 
Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2Data Lakehouse Symposium | Day 1 | Part 2
Data Lakehouse Symposium | Day 1 | Part 2
 
Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2Data Lakehouse Symposium | Day 2
Data Lakehouse Symposium | Day 2
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop
 
Democratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized PlatformDemocratizing Data Quality Through a Centralized Platform
Democratizing Data Quality Through a Centralized Platform
 
Learn to Use Databricks for Data Science
Learn to Use Databricks for Data ScienceLearn to Use Databricks for Data Science
Learn to Use Databricks for Data Science
 
Why APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML MonitoringWhy APM Is Not the Same As ML Monitoring
Why APM Is Not the Same As ML Monitoring
 
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch FixThe Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix
 
Stage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI IntegrationStage Level Scheduling Improving Big Data and AI Integration
Stage Level Scheduling Improving Big Data and AI Integration
 
Simplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorchSimplify Data Conversion from Spark to TensorFlow and PyTorch
Simplify Data Conversion from Spark to TensorFlow and PyTorch
 
Scaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on KubernetesScaling your Data Pipelines with Apache Spark on Kubernetes
Scaling your Data Pipelines with Apache Spark on Kubernetes
 
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark PipelinesScaling and Unifying SciKit Learn and Apache Spark Pipelines
Scaling and Unifying SciKit Learn and Apache Spark Pipelines
 
Sawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature AggregationsSawtooth Windows for Feature Aggregations
Sawtooth Windows for Feature Aggregations
 
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkRedis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink
 
Re-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and SparkRe-imagine Data Monitoring with whylogs and Spark
Re-imagine Data Monitoring with whylogs and Spark
 
Raven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction QueriesRaven: End-to-end Optimization of ML Prediction Queries
Raven: End-to-end Optimization of ML Prediction Queries
 
Processing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache SparkProcessing Large Datasets for ADAS Applications using Apache Spark
Processing Large Datasets for ADAS Applications using Apache Spark
 
Massive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta LakeMassive Data Processing in Adobe Using Delta Lake
Massive Data Processing in Adobe Using Delta Lake
 

Dernier

GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样vhwb25kk
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Jack DiGiovanna
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceSapana Sha
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSINGmarianagonzalez07
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 

Dernier (20)

GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
1:1定制(UQ毕业证)昆士兰大学毕业证成绩单修改留信学历认证原版一模一样
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
Building on a FAIRly Strong Foundation to Connect Academic Research to Transl...
 
Call Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts ServiceCall Girls In Dwarka 9654467111 Escorts Service
Call Girls In Dwarka 9654467111 Escorts Service
 
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
2006_GasProcessing_HB (1).pdf HYDROCARBON PROCESSING
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Defence Colony Delhi 💯Call Us 🔝8264348440🔝
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 

Hive Bucketing in Apache Spark with Tejas Patil