SlideShare une entreprise Scribd logo
1  sur  45
Télécharger pour lire hors ligne
https://portal.futuregrid.org 
Big Data in the Cloud: 
Research and Education
September 9 2013
PPAM 2013 Warsaw
Geoffrey Fox
gcf@indiana.edu
http://www.infomall.org http://www.futuregrid.org
School of Informatics and Computing
Community Grids Laboratory
Indiana University Bloomington
https://portal.futuregrid.org 
Some Issues to Discuss Today
• Economic Imperative: There are a lot of data and a lot of 
jobs
• Computing Model: Industry adopted clouds which are 
attractive for data analytics. HPC also useful in some cases
• Progress in scalable robust Algorithms: new data need 
different algorithms than before
• Progress in Data Intensive Programming Models
• Progress in Data Science Education: opportunities at 
universities
2
https://portal.futuregrid.org 
Data Deluge
3
https://portal.futuregrid.org  4Meeker/Wu May 29 2013 Internet Trends D11 Conference 
IP Traffic per year ~ 12% Total Created
https://portal.futuregrid.org  5Meeker/Wu May 29 2013 Internet Trends D11 Conference 
https://portal.futuregrid.org 
Some Data sizes
~40 109 Web pages at ~300 kilobytes each = 10
Petabytes
LHC 15 petabytes per year
Radiology 69 petabytes per year
Square Kilometer Array Telescope will be 100
terabits/second; LSST Survey >20TB per day
Earth Observation becoming ~4 petabytes per year
Earthquake Science – few terabytes total today
PolarGrid – 100’s terabytes/year becoming petabytes
Exascale simulation data dumps – terabytes/second
Deep Learning to train self driving car; 100 million
megapixel images ~ 100 terabytes
6
https://portal.futuregrid.org  7
4
8
3
10
51 NIST Big Data Use Cases
http://bigdatawg.nist.gov/usecases.php
https://portal.futuregrid.org  8
6
4
5
10
1
51 NIST Big Data Use Cases
http://bigdatawg.nist.gov/usecases.php
https://portal.futuregrid.org 
Jobs
9
https://portal.futuregrid.org 
Jobs v. Countries
10
http://www.microsoft.com/en‐us/news/features/2012/mar12/03‐05CloudComputingJobs.aspx
https://portal.futuregrid.org 
McKinsey Institute on Big Data Jobs
• There will be a shortage of talent necessary for organizations to take 
advantage of big data. By 2018, the United States alone could face a 
shortage of 140,000 to 190,000 people with deep analytical skills as well as 
1.5 million managers and analysts with the know‐how to use the analysis of 
big data to make effective decisions.
• At IU, Informatics aimed at 1.5 million jobs. Computer Science covers the 
140,000 to 190,000
11
http://www.mckinsey.com/mgi/publications/big_data/index.asp.
https://portal.futuregrid.org  12Meeker/Wu May 29 2013 Internet Trends D11 Conference 
https://portal.futuregrid.org  13Meeker/Wu May 29 2013 Internet Trends D11 Conference 
https://portal.futuregrid.org 
Computing Model
Industry adopted clouds which are 
attractive for data analytics
14
https://portal.futuregrid.org 
5 years Cloud Computing
2 years Big Data Transformational
https://portal.futuregrid.org 
Amazon making money
• It took Amazon Web Services (AWS) eight 
years to hit $650 million in revenue, according 
to Citigroup in 2010. 
• Just three years later, Macquarie Capital 
analyst Ben Schachter estimates that AWS will 
top $3.8 billion in 2013 revenue, up from $2.1 
billion in 2012 (estimated), valuing the AWS 
business at $19 billion.  
https://portal.futuregrid.org 
Physically Clouds are Clear
• A bunch of computers in an efficient data center with an 
excellent Internet connection
• They were produced to meet need of public‐facing Web 
2.0 e‐Commerce/Social Networking sites
• They can be considered as “optimal giant data center” 
plus internet connection
• Note enterprises use private clouds that are giant data 
centers but not optimized for Internet access
• Exascale build‐out of commercial cloud infrastructure: for 
2014‐15 expect 10,000,000 new servers and 10 Exabytes
of storage in major commercial cloud data centers 
worldwide. 
https://portal.futuregrid.org 
Data Intensive Applications and 
Programming Models
18
https://portal.futuregrid.org 
Clouds & Data Intensive Applications
• Applications tend to be new and so can consider emerging 
technologies such as clouds
• Do not have lots of small messages but rather large reduction (aka 
Collective) operations
– New optimizations e.g. for huge messages
• “Large Scale Optimization”: Deep Learning, Social Image 
Organization, Clustering and Multidimensional Scaling which are 
variants of EM 
• EM (expectation maximization) tends to be good for clouds and 
Iterative MapReduce
– Quite complicated computations (so compute largish compared to 
communicate)
– Communication is Reduction operations (global sums or linear) or Broadcast
• Machine Learning has FULL Matrix kernels
19
https://portal.futuregrid.org 
Some (NIST)Large Data mining Problems I
• Find W’s by iteration 
(Steepest Descent method)
• Find 11 Billion W’s from 10 
million images = 9 layer NN
• “Pure” Full Matrix 
Multiplication 
MPI+GPU gets near 
optimal performance
• GPU+MPI 100 times 
previous Google work
• Note Datamining
often gives full matrices 
• http://salsahpc.indiana.edu/summerworkshop2013/index.html
• Deep Learning: (Google/Stanford) Recognize features such as bikes 
or faces with a learning network
45
1
2
4
8
16
32
64
1 4 9 16 36 64
Factor Speedup
# GPUs
11.2B
6.9B
3.0B
1.9B
680M
185M
Linear
https://portal.futuregrid.org 
Protein Universe Browser for COG Sequences with a 
few illustrative biologically identified clusters
• Dimension reduction 
MDS for visualization 
and clustering in non 
metric spaces
• O(N2) algorithms 
with full matrices
• Important Online 
(interpolation) 
methods
• Expectation 
Maximization 
(Iterative AllReduce) 
and Levenberg
Marquardt with 
Conjugate Gradient 
21
https://portal.futuregrid.org 
Some (NIST)Large Data mining Problems II
• Determine optimal geo and angle representation of “all” images by giant 
least squares fit to 6‐D Camera pose of each image and 3D position of points 
in scene
• Levenberg‐Marquardt using Conjugate Gradient to estimate leading 
eigenvector and solve equations
• Note such Newton approaches fail for learning networks as too many 
parameters
• Need Hadoop and HDFS with “trivial problem” of just 15,000 images and 
75,000 points giving 1 TB messages per iteration
• Over 500 million images uploaded each day (1 in 1000 Eiffel tower) …..
22
https://portal.futuregrid.org 
Alternative Approach to Image Classification
• Instead of learning networks one can (always) use 
clustering to divide spaces into compact nearby 
regions
• Characterize images by a feature vector in 512‐2048 
dimensional spaces (HOG or Histograms of Oriented 
Gradients)
• Cluster (K‐means) 100 million vectors (100,000 
images) into 10 million clusters
• Giant Broadcast and AllReduce Operations that 
stress most MPI implementations
• Note Kmeans (Mahout) dreadful with Hadoop
23
https://portal.futuregrid.org 
Clusters v. Regions
• In Lymphocytes clusters are distinct
• In Pathology (NIST Big Data Use Case), clusters divide space 
into regions and sophisticated methods like deterministic 
annealing are probably unnecessary
24
Pathology 54D
Lymphocytes 4D
https://portal.futuregrid.org 
Map Collective Model (Judy Qiu)
• Combine MPI and MapReduce ideas
• Implement collectives optimally on Infiniband, 
Azure, Amazon ……
25
Input
map
Generalized Reduce
Initial Collective Step
Final Collective Step
Iterate
https://portal.futuregrid.org 
4 Forms of MapReduce
26
(a) Map Only
(d) Loosely
Synchronous
(c) Iterative
MapReduce
(b) Classic
MapReduce
Input
map
reduce
Input
map
reduce
Iterations
Input
Output
map
Pij
BLAST Analysis
Parametric sweep
Pleasingly Parallel
High Energy Physics
(HEP) Histograms
Distributed search
Classic MPI
PDE Solvers and
particle dynamics
Domain of MapReduce and Iterative Extensions
Science Clouds
MPI
Exascale
Expectation maximization
Clustering e.g. Kmeans
Linear Algebra, Page Rank
(a) Map Only
(d) Loosely
Synchronous
(c) Iterative
MapReduce
(b) Classic
MapReduce
InputInput
mapmap
reducereduce
InputInput
mapmap
reducereduce
IterationsIterations
InputInput
OutputOutput
mapmap
Pij
BLAST Analysis
Parametric sweep
Pleasingly Parallel
High Energy Physics
(HEP) Histograms
Distributed search
Classic MPI
PDE Solvers and
particle dynamics
Domain of MapReduce and Iterative Extensions
Science Clouds
MPI
Exascale
Expectation maximization
Clustering e.g. Kmeans
Linear Algebra, Page Rank
MPI is Map followed by Point to Point Communication – as in style d)
https://portal.futuregrid.org 
Twister for Data Intensive 
Iterative Applications
• (Iterative) MapReduce structure with Map‐Collective is 
framework
• Twister runs on Linux or Azure
• Twister4Azure is built on top of Azure tables, queues, 
storage
Compute Communication Reduce/ barrier
New Iteration
Larger Loop‐
Invariant Data
Generalize to Generalize to 
arbitrary 
Collective 
Broadcast
Smaller Loop‐
Variant Data
Qiu, Gunarathne
https://portal.futuregrid.org 
Kmeans Clustering on Azure
Number of tasks running as function of  time
0
50
100
150
200
250
300
0 25 50 75 100 125 150 175 200 225 250
Number of Executing Map Tasks
Elapsed Time (s)
This shows that the communication and synchronization overheads between iterations are very small 
(less than one second, which is the lowest measured unit for this graph). 
128 Million data points(19GB), 500 centroids (78KB), 20 dimensions
10 iterations, 256 cores, 256 map tasks per iteration
https://portal.futuregrid.org 
Kmeans Clustering
Execution Time per task
128 Million data points(19GB), 500 centroids (78KB), 20 dimensions
10 iterations, 256 cores, 256 map tasks per iteration
0
10
20
30
40
50
60
70
0 256 512 768 1024 1280 1536 1792 2048 2304
Task Execution Time (s)
Map Task ID
https://portal.futuregrid.org 
• Shaded areas are computing only where Hadoop on HPC cluster 
fastest
• Areas above shading are overheads where T4A smallest and T4A with 
AllReduce collective has lowest overhead
• Note even on Azure Java (Orange) faster than T4A C# 30
0
200
400
600
800
1000
1200
1400
32 x 32 M 64 x 64 M 128 x 128 M 256 x 256 M
Time (s)
Num. Cores X Num. Data Points
Hadoop AllReduce
Hadoop MapReduce
Twister4Azure AllReduce
Twister4Azure Broadcast
Twister4Azure
HDInsight
(AzureHadoop)
Kmeans and (Iterative) MapReduce
https://portal.futuregrid.org 
Details of K‐means Linux Hadoop and 
Hadoop with AllReduce Collective
31
https://portal.futuregrid.org 
Data Science Education
Opportunities at universities
see recent New York Times articles
http://datascience101.wordpress.com/2013/04/13/new‐york‐times‐data‐science‐articles/
32
https://portal.futuregrid.org 
Data Science Education
• Broad Range of Topics from Policy to curation to 
applications and algorithms, programming models, data 
systems, statistics, and broad range of CS subjects such as 
Clouds, Programming, HCI,
• Plenty of Jobs and broader range of possibilities than 
computational science but similar cosmic issues
– What type of degree (Certificate, minor, track, “real” degree)
– What implementation (department, interdisciplinary group 
supporting education and research program)
• NIST Big Data initiative identifies Big Data, Data Science, 
Data Scientist as core concepts
• There are over 40 Data Science Curricula (4 Undergraduate, 
31 Masters,  5 Certificate, 3 PhD)
33
https://portal.futuregrid.org 
Computational Science
• Interdisciplinary field between computer science 
and applications with primary focus on simulation 
areas
• Very successful as a research area
– XSEDE and Exascale systems enable
• Several academic programs but these have been 
less successful than computational science research 
as
– No consensus as to curricula and jobs (don’t appoint 
faculty in computational science; do appoint to DoE labs)
– Field relatively small 
• Started around 1990
34
https://portal.futuregrid.org 
Data Science at Indiana University
• Link Statistics & School of Informatics and Computing 
(Computer Science, Informatics, Information & Library 
Science)
• Broader than most offerings
• Ought IMHO to involve application faculty
• Areas Data Analysis and Statistics, Data Lifecycle, 
Infrastructure (Clouds, Security), Applications
– How broad should requirements be
• Offer online Masters in MOOC format in full scale Fall 2014 
and as certificate on January 2014.
– Also allow residential students in flipped mode
• Free trial run of my MOOC on Big Data Mid October 2013
35
https://portal.futuregrid.org 
MOOC’s
36
https://portal.futuregrid.org  37Meeker/Wu May 29 2013 Internet Trends D11 Conference 
https://portal.futuregrid.org 
Massive Open Online Courses (MOOC)
• MOOC’s are very “hot” these days with Udacity and 
Coursera as start‐ups; perhaps over 100,000 participants 
• Relevant to Data Science (where IU is preparing a MOOC) 
as this is a new field with few courses at most universities
• Typical model is collection of short prerecorded segments 
(talking head over PowerPoint) of length 3‐15 minutes
• These “lesson objects” can be viewed as “songs”
• Google Course Builder (python open source) builds 
customizable  MOOC’s as “playlists” of “songs”
• Tells you to capture all material as “lesson objects”
• We are aiming to build a repository of many “songs”; used 
in many ways – tutorials, classes …
38
https://portal.futuregrid.org  39Meeker/Wu May 29 2013 Internet Trends D11 Conference 
https://portal.futuregrid.org  40
• Twelve 
~10 
minutes 
lesson 
objects in 
this 
lecture
• IU wants 
us to close 
caption if 
use in real 
course
https://portal.futuregrid.org 
Customizable MOOC’s 
• We could  teach one class to 100,000 students or 2,000 classes to 50 
students
• The 2,000 class choice has 2 useful features
– One can use the usual (electronic) mentoring/grading technology
– One can customize each of 2,000 classes for a particular audience given their 
level and interests
– One can even allow student to customize – that’s what one does in making 
play lists in iTunes
– Flipped Classroom
• Both models can be supported by a repository of lesson objects (3‐
15 minute video segments) in the cloud
• The teacher can choose from existing lesson objects and add their 
own to produce a new customized course with new lessons 
contributed back to repository
41
https://portal.futuregrid.org 
Key MOOC areas costing money/effort
• Make content including content, quizzes, homework
• Record video 
• Make web site
• Social Networking Interaction for mentoring student‐
Teaching assistants and student‐student
• Defining how to support computing labs with FutureGrid 
or appliances + Virtual Box
– Appliances scale as download to student’s client
– Virtual machines essential 
• Analyse/Evaluate interactions
42
https://portal.futuregrid.org  43
FutureGrid hosts many classes per semester
How to use FutureGrid is shared MOOC
https://portal.futuregrid.org 
Conclusions
44
https://portal.futuregrid.org 
Conclusions• Data Intensive programs are not like simulations as they have large 
“reductions” (“collectives”) and do not have many small messages
– Clouds suitable and in fact HPC sometimes optimal
• Iterative MapReduce an interesting approach; need to optimize collectives 
for new applications (Data analytics) and resources (clouds, GPU’s …)
• Need an initiative to build scalable high performance data analytics library
on top of interoperable cloud‐HPC platform
– Full matrices important
• More employment opportunities in clouds than HPC and Grids and in data 
than simulation; so cloud and data related activities popular with students
• Community activity to discuss data science education
– Agree on curricula; is such a degree attractive?
• Role of MOOC’s for either
– Disseminating new curricula 
– Managing course fragments that can be assembled into custom courses 
for particular interdisciplinary students
45

Contenu connexe

Tendances

Web search-metrics-tutorial-www2010-section-1of7-introduction
Web search-metrics-tutorial-www2010-section-1of7-introductionWeb search-metrics-tutorial-www2010-section-1of7-introduction
Web search-metrics-tutorial-www2010-section-1of7-introductionAli Dasdan
 
Frictionless Supercomputing - MEW25
Frictionless Supercomputing - MEW25Frictionless Supercomputing - MEW25
Frictionless Supercomputing - MEW25Martin Hamilton
 
Web search-metrics-tutorial-www2010-section-7of7-presentation
Web search-metrics-tutorial-www2010-section-7of7-presentationWeb search-metrics-tutorial-www2010-section-7of7-presentation
Web search-metrics-tutorial-www2010-section-7of7-presentationAli Dasdan
 
Digital Transformation of Civil Engineering and Construction
Digital Transformation of Civil Engineering and ConstructionDigital Transformation of Civil Engineering and Construction
Digital Transformation of Civil Engineering and Constructionpdemian
 
Elements of AI Luxembourg - session 5
Elements of AI Luxembourg - session 5Elements of AI Luxembourg - session 5
Elements of AI Luxembourg - session 5Jeremie Dauphin
 
Digital Transformation of Civil Engineering and Construction
Digital Transformation of Civil Engineering and ConstructionDigital Transformation of Civil Engineering and Construction
Digital Transformation of Civil Engineering and Constructionpdemian
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Alexandru Iosup
 
Big Data in the Cloud: Enabling the Fourth Paradigm by Matching SMEs with Dat...
Big Data in the Cloud: Enabling the Fourth Paradigm by Matching SMEs with Dat...Big Data in the Cloud: Enabling the Fourth Paradigm by Matching SMEs with Dat...
Big Data in the Cloud: Enabling the Fourth Paradigm by Matching SMEs with Dat...Alexandru Iosup
 
How Does Data Science Impact the Semantic Web?
How Does Data Science Impact the Semantic Web?How Does Data Science Impact the Semantic Web?
How Does Data Science Impact the Semantic Web?Philip Bourne
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?Anna Fensel
 
Jisc - Rebooting a National Innovation Agency (EUNIS 2014)
Jisc - Rebooting a National Innovation Agency (EUNIS 2014)Jisc - Rebooting a National Innovation Agency (EUNIS 2014)
Jisc - Rebooting a National Innovation Agency (EUNIS 2014)Martin Hamilton
 
Research data spring: DataVault
Research data spring: DataVaultResearch data spring: DataVault
Research data spring: DataVaultJisc RDM
 
Makers Go To College - Your Digital Future 2016
Makers Go To College - Your Digital Future 2016Makers Go To College - Your Digital Future 2016
Makers Go To College - Your Digital Future 2016Martin Hamilton
 
Evolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and ApplicationsEvolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and ApplicationsChris Bizer
 
GrenchMark at CCGrid, May 2006.
GrenchMark at CCGrid, May 2006.GrenchMark at CCGrid, May 2006.
GrenchMark at CCGrid, May 2006.Alexandru Iosup
 
Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...Erin Robinson
 
The future of cloud computing - Jisc Digifest 2016
The future of cloud computing - Jisc Digifest 2016The future of cloud computing - Jisc Digifest 2016
The future of cloud computing - Jisc Digifest 2016Jisc
 
SafeShare - Networkshop44
SafeShare - Networkshop44SafeShare - Networkshop44
SafeShare - Networkshop44Jisc
 
Global Research Data Initiatives
Global Research Data InitiativesGlobal Research Data Initiatives
Global Research Data InitiativesSarah Jones
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platformsYannis Charalabidis
 

Tendances (20)

Web search-metrics-tutorial-www2010-section-1of7-introduction
Web search-metrics-tutorial-www2010-section-1of7-introductionWeb search-metrics-tutorial-www2010-section-1of7-introduction
Web search-metrics-tutorial-www2010-section-1of7-introduction
 
Frictionless Supercomputing - MEW25
Frictionless Supercomputing - MEW25Frictionless Supercomputing - MEW25
Frictionless Supercomputing - MEW25
 
Web search-metrics-tutorial-www2010-section-7of7-presentation
Web search-metrics-tutorial-www2010-section-7of7-presentationWeb search-metrics-tutorial-www2010-section-7of7-presentation
Web search-metrics-tutorial-www2010-section-7of7-presentation
 
Digital Transformation of Civil Engineering and Construction
Digital Transformation of Civil Engineering and ConstructionDigital Transformation of Civil Engineering and Construction
Digital Transformation of Civil Engineering and Construction
 
Elements of AI Luxembourg - session 5
Elements of AI Luxembourg - session 5Elements of AI Luxembourg - session 5
Elements of AI Luxembourg - session 5
 
Digital Transformation of Civil Engineering and Construction
Digital Transformation of Civil Engineering and ConstructionDigital Transformation of Civil Engineering and Construction
Digital Transformation of Civil Engineering and Construction
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.
 
Big Data in the Cloud: Enabling the Fourth Paradigm by Matching SMEs with Dat...
Big Data in the Cloud: Enabling the Fourth Paradigm by Matching SMEs with Dat...Big Data in the Cloud: Enabling the Fourth Paradigm by Matching SMEs with Dat...
Big Data in the Cloud: Enabling the Fourth Paradigm by Matching SMEs with Dat...
 
How Does Data Science Impact the Semantic Web?
How Does Data Science Impact the Semantic Web?How Does Data Science Impact the Semantic Web?
How Does Data Science Impact the Semantic Web?
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
 
Jisc - Rebooting a National Innovation Agency (EUNIS 2014)
Jisc - Rebooting a National Innovation Agency (EUNIS 2014)Jisc - Rebooting a National Innovation Agency (EUNIS 2014)
Jisc - Rebooting a National Innovation Agency (EUNIS 2014)
 
Research data spring: DataVault
Research data spring: DataVaultResearch data spring: DataVault
Research data spring: DataVault
 
Makers Go To College - Your Digital Future 2016
Makers Go To College - Your Digital Future 2016Makers Go To College - Your Digital Future 2016
Makers Go To College - Your Digital Future 2016
 
Evolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and ApplicationsEvolving the Web into a Global Dataspace – Advances and Applications
Evolving the Web into a Global Dataspace – Advances and Applications
 
GrenchMark at CCGrid, May 2006.
GrenchMark at CCGrid, May 2006.GrenchMark at CCGrid, May 2006.
GrenchMark at CCGrid, May 2006.
 
Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...Putting Data to Work: Moving science forward together beyond where we thought...
Putting Data to Work: Moving science forward together beyond where we thought...
 
The future of cloud computing - Jisc Digifest 2016
The future of cloud computing - Jisc Digifest 2016The future of cloud computing - Jisc Digifest 2016
The future of cloud computing - Jisc Digifest 2016
 
SafeShare - Networkshop44
SafeShare - Networkshop44SafeShare - Networkshop44
SafeShare - Networkshop44
 
Global Research Data Initiatives
Global Research Data InitiativesGlobal Research Data Initiatives
Global Research Data Initiatives
 
Designing a second generation of open data platforms
Designing a second generation of open data platformsDesigning a second generation of open data platforms
Designing a second generation of open data platforms
 

Similaire à Big Data and Clouds: Research and Education

Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Aravindharamanan S
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Geoffrey Fox
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...Geoffrey Fox
 
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...LIBER Europe
 
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...Geoffrey Fox
 
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205Martin Hamilton
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Future of Data Platform in Cloud Native world
Future of Data Platform in Cloud Native worldFuture of Data Platform in Cloud Native world
Future of Data Platform in Cloud Native worldSrivatsan Srinivasan
 
IDC Perspectives on Big Data Outside of HPC
IDC Perspectives on Big Data Outside of HPCIDC Perspectives on Big Data Outside of HPC
IDC Perspectives on Big Data Outside of HPCinside-BigData.com
 
Cyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in ScienceCyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in ScienceCameron Kiddle
 
IoT to Cloud: Middle Layer (e.g Gateway, Hubs, Fog, Edge Computing)
IoT to Cloud: Middle Layer (e.g Gateway, Hubs, Fog, Edge Computing)IoT to Cloud: Middle Layer (e.g Gateway, Hubs, Fog, Edge Computing)
IoT to Cloud: Middle Layer (e.g Gateway, Hubs, Fog, Edge Computing)Bob Marcus
 
Lesson 2: Cloud data storage and retrieval
Lesson 2: Cloud data storage and retrievalLesson 2: Cloud data storage and retrieval
Lesson 2: Cloud data storage and retrievalOriol Borrás Gené
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
What makes it worth becoming a Data Engineer?
What makes it worth becoming a Data Engineer?What makes it worth becoming a Data Engineer?
What makes it worth becoming a Data Engineer?Hadi Fadlallah
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Chris Jang
 
Research, the Cloud, and the IRB
Research, the Cloud, and the IRBResearch, the Cloud, and the IRB
Research, the Cloud, and the IRBMichael Zimmer
 

Similaire à Big Data and Clouds: Research and Education (20)

Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1Eecs6893 big dataanalytics-lecture1
Eecs6893 big dataanalytics-lecture1
 
Bigdata-Intro.pptx
Bigdata-Intro.pptxBigdata-Intro.pptx
Bigdata-Intro.pptx
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Centerp...
 
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
Big Data Applications & Analytics Motivation: Big Data and the Cloud; Center...
 
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
 
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
AI-Driven Science and Engineering with the Global AI and Modeling Supercomput...
 
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205
Cloud for Research and Innovation - UK USA HPC workshop, Oxford, July 205
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Future of Data Platform in Cloud Native world
Future of Data Platform in Cloud Native worldFuture of Data Platform in Cloud Native world
Future of Data Platform in Cloud Native world
 
IDC Perspectives on Big Data Outside of HPC
IDC Perspectives on Big Data Outside of HPCIDC Perspectives on Big Data Outside of HPC
IDC Perspectives on Big Data Outside of HPC
 
Cyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in ScienceCyberinfrastructure and its Role in Science
Cyberinfrastructure and its Role in Science
 
Exploring the Cloud
Exploring the CloudExploring the Cloud
Exploring the Cloud
 
IoT to Cloud: Middle Layer (e.g Gateway, Hubs, Fog, Edge Computing)
IoT to Cloud: Middle Layer (e.g Gateway, Hubs, Fog, Edge Computing)IoT to Cloud: Middle Layer (e.g Gateway, Hubs, Fog, Edge Computing)
IoT to Cloud: Middle Layer (e.g Gateway, Hubs, Fog, Edge Computing)
 
Lesson 2: Cloud data storage and retrieval
Lesson 2: Cloud data storage and retrievalLesson 2: Cloud data storage and retrieval
Lesson 2: Cloud data storage and retrieval
 
NREN 3.0
NREN 3.0NREN 3.0
NREN 3.0
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
What makes it worth becoming a Data Engineer?
What makes it worth becoming a Data Engineer?What makes it worth becoming a Data Engineer?
What makes it worth becoming a Data Engineer?
 
QuSandbox+NVIDIA Rapids
QuSandbox+NVIDIA RapidsQuSandbox+NVIDIA Rapids
QuSandbox+NVIDIA Rapids
 
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
Google Cloud Platform & rockPlace Big Data Event-Mar.31.2016
 
Research, the Cloud, and the IRB
Research, the Cloud, and the IRBResearch, the Cloud, and the IRB
Research, the Cloud, and the IRB
 

Plus de Geoffrey Fox

Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...Geoffrey Fox
 
High Performance Computing and Big Data
High Performance Computing and Big Data High Performance Computing and Big Data
High Performance Computing and Big Data Geoffrey Fox
 
Spidal Java: High Performance Data Analytics with Java on Large Multicore HPC...
Spidal Java: High Performance Data Analytics with Java on Large Multicore HPC...Spidal Java: High Performance Data Analytics with Java on Large Multicore HPC...
Spidal Java: High Performance Data Analytics with Java on Large Multicore HPC...Geoffrey Fox
 
Big Data HPC Convergence
Big Data HPC ConvergenceBig Data HPC Convergence
Big Data HPC ConvergenceGeoffrey Fox
 
Data Science and Online Education
Data Science and Online EducationData Science and Online Education
Data Science and Online EducationGeoffrey Fox
 
High Performance Processing of Streaming Data
High Performance Processing of Streaming DataHigh Performance Processing of Streaming Data
High Performance Processing of Streaming DataGeoffrey Fox
 
Classifying Simulation and Data Intensive Applications and the HPC-Big Data C...
Classifying Simulation and Data Intensive Applications and the HPC-Big Data C...Classifying Simulation and Data Intensive Applications and the HPC-Big Data C...
Classifying Simulation and Data Intensive Applications and the HPC-Big Data C...Geoffrey Fox
 
Visualizing and Clustering Life Science Applications in Parallel 
Visualizing and Clustering Life Science Applications in Parallel Visualizing and Clustering Life Science Applications in Parallel 
Visualizing and Clustering Life Science Applications in Parallel Geoffrey Fox
 
Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Geoffrey Fox
 
HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...
HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...
HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...Geoffrey Fox
 
Data Science Curriculum at Indiana University
Data Science Curriculum at Indiana UniversityData Science Curriculum at Indiana University
Data Science Curriculum at Indiana UniversityGeoffrey Fox
 
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...
What is the "Big Data" version of the Linpack Benchmark?; What is “Big Data...What is the "Big Data" version of the Linpack Benchmark?; What is “Big Data...
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...Geoffrey Fox
 
Experience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC TechnologyExperience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC TechnologyGeoffrey Fox
 
Cloud Services for Big Data Analytics
Cloud Services for Big Data AnalyticsCloud Services for Big Data Analytics
Cloud Services for Big Data AnalyticsGeoffrey Fox
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesGeoffrey Fox
 
Comparing Big Data and Simulation Applications and Implications for Software ...
Comparing Big Data and Simulation Applications and Implications for Software ...Comparing Big Data and Simulation Applications and Implications for Software ...
Comparing Big Data and Simulation Applications and Implications for Software ...Geoffrey Fox
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeGeoffrey Fox
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Geoffrey Fox
 
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC Geoffrey Fox
 
Classification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different FacetsClassification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different FacetsGeoffrey Fox
 

Plus de Geoffrey Fox (20)

Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
Next Generation Grid: Integrating Parallel and Distributed Computing Runtimes...
 
High Performance Computing and Big Data
High Performance Computing and Big Data High Performance Computing and Big Data
High Performance Computing and Big Data
 
Spidal Java: High Performance Data Analytics with Java on Large Multicore HPC...
Spidal Java: High Performance Data Analytics with Java on Large Multicore HPC...Spidal Java: High Performance Data Analytics with Java on Large Multicore HPC...
Spidal Java: High Performance Data Analytics with Java on Large Multicore HPC...
 
Big Data HPC Convergence
Big Data HPC ConvergenceBig Data HPC Convergence
Big Data HPC Convergence
 
Data Science and Online Education
Data Science and Online EducationData Science and Online Education
Data Science and Online Education
 
High Performance Processing of Streaming Data
High Performance Processing of Streaming DataHigh Performance Processing of Streaming Data
High Performance Processing of Streaming Data
 
Classifying Simulation and Data Intensive Applications and the HPC-Big Data C...
Classifying Simulation and Data Intensive Applications and the HPC-Big Data C...Classifying Simulation and Data Intensive Applications and the HPC-Big Data C...
Classifying Simulation and Data Intensive Applications and the HPC-Big Data C...
 
Visualizing and Clustering Life Science Applications in Parallel 
Visualizing and Clustering Life Science Applications in Parallel Visualizing and Clustering Life Science Applications in Parallel 
Visualizing and Clustering Life Science Applications in Parallel 
 
Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...Lessons from Data Science Program at Indiana University: Curriculum, Students...
Lessons from Data Science Program at Indiana University: Curriculum, Students...
 
HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...
HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...
HPC-ABDS High Performance Computing Enhanced Apache Big Data Stack (with a ...
 
Data Science Curriculum at Indiana University
Data Science Curriculum at Indiana UniversityData Science Curriculum at Indiana University
Data Science Curriculum at Indiana University
 
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...
What is the "Big Data" version of the Linpack Benchmark?; What is “Big Data...What is the "Big Data" version of the Linpack Benchmark?; What is “Big Data...
What is the "Big Data" version of the Linpack Benchmark? ; What is “Big Data...
 
Experience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC TechnologyExperience with Online Teaching with Open Source MOOC Technology
Experience with Online Teaching with Open Source MOOC Technology
 
Cloud Services for Big Data Analytics
Cloud Services for Big Data AnalyticsCloud Services for Big Data Analytics
Cloud Services for Big Data Analytics
 
Matching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software ArchitecturesMatching Data Intensive Applications and Hardware/Software Architectures
Matching Data Intensive Applications and Hardware/Software Architectures
 
Comparing Big Data and Simulation Applications and Implications for Software ...
Comparing Big Data and Simulation Applications and Implications for Software ...Comparing Big Data and Simulation Applications and Implications for Software ...
Comparing Big Data and Simulation Applications and Implications for Software ...
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
 
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
HPC-ABDS: The Case for an Integrating Apache Big Data Stack with HPC
 
Classification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different FacetsClassification of Big Data Use Cases by different Facets
Classification of Big Data Use Cases by different Facets
 

Dernier

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Dernier (20)

Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

Big Data and Clouds: Research and Education