SlideShare une entreprise Scribd logo
1  sur  43
Télécharger pour lire hors ligne
Graph Realities
Paco Nathan @pacoid
Personal Background
• applied math, machine learning, distributed systems
• R&D for neural networks, incl. hardware (1986-1997)
• “guinea pig” for early cloud (2005-ff)
• led data teams in industry
• assisted popular open source projects: 

Spark, Jupyter, etc.
• development focus on natural language plus adjacent
knowledge graph use cases
• since 2018, increasingly working at the intersection 

of public sector + enterprise + open source
Motivations
Not all that long ago, graph applications were considered
exotic and expensive.
Until recently, few software engineers had much experience
putting graphs to work; however, those use cases have now
become much more commonplace.
This talk explores a practical use case, one that addresses
key issues of data governance and reproducible research,
and depends on sophisticated use of graph technology.
First, some perspectives and industry analysis…
part 1:

graph perspectives
Perspectives
• the ubiquity of linked data
• the tyranny of “thinking relational”
• the primacy of working with graphs 

(and their math analog, tensors)
• nouns vs. verbs vs. adjectives

(extreme nominalization)
• evolution of hardware, cloud, 

and cluster topologies
• the power of graph embeddings
Historical Context
“Data Science: Past and Future”

Rev 2 (2019-05-24) slides
“What is Data Science?”

IBM Data Science Community (2019-03-04)
Just Enough Math

O’Reilly Media (2014)
• John Tukey: data analytics as an intrinsically empirical 

and interdisciplinary field (1962)
• most popular data frameworks leveraged some graph 

processing, albeit obscured, ad-hoc, clumsy…
• they did well, given the hardware available at the time
Beauty in sparsity…
SuiteSparse Matrix Collection: 

a widely used set of sparse matrix
benchmarks collected from a wide 

range of applications
sparse.tamu.edu/
…for when you really, really, need
some interesting graph data
Theme 1: Stuffing graphs into matrices
algebraic graph theory allowed reuse of linear algebra impl
v
u
w
x
u v w x
u 0 1 0 1
v 1 0 1 1
w 0 1 0 1
x 1 1 1 0
• e.g., transform graph to an adjacency matrix
• most will be relatively sparse
• use LINPACK, BLAS, or libraries built atop
• much to leverage: SVD, power method, QR decomp, etc.
Theme 1: Stuffing graphs into matrices
for many real-world problems, the data are essentially graphs
1. real-world data
2. graph theory for representation
3. convert to sparse matrix for production
4. cost-effective parallel processing at scale
ergo, leverage low dimensional structure in high dimensional data
N Dims good, 2 Dims baa-d
However, complex graphs cannot be represented
as 2D matrices without serious information loss.
Ideally, tensors would be a better representation 

to use for linear algebra libraries.
While tensor decomposition is a hard problem, 

the general class of problems became much 

more interesting after 2012…
N Dims good, 2 Dims baa-d
However, complex graphs cannot be represented
as 2D matrices without serious information loss.
Ideally, tensors would be a better representation 

to use for linear algebra libraries.
While tensor decomposition is a hard problem, 

the general class of problems became much 

more interesting after 2012…
“The real problem is that programmers have spent far too

much time worrying about efficiency in the wrong place

and at the wrong times; premature optimization is the

root of all evil (or at least most of it) in programming.”
Don Knuth
Theme 2: Nouns, Verbs, Adjectives
Tracing back to the origins of relational databases, 

Edgar Codd was furious about how badly SQL and RDBMS
had misinterpreted his mathematical modeling of relations.
Years of EDW reinforced a sense of an extreme nominalization
with so much of the data representation being reduced into
dimensions, facts, indexes
Theme 2: Nouns, Verbs, Adjectives
a carry-over of extreme nominalization into graph DBs also

over-emphasizes the role of nodes and centrality for adjusting
the granularity of graph representations:
• discounts the importance of relations
• “mostly nouns, a few verbs, some adjectives”
• serious information loss
IMO: graph DB frameworks tend to err in this aspect, 

both in terms of representation and algorithm support.
Part of a long-term narrative arc in IT…
• arguably, circa 2001 was the heyday of DW+BI – later 

acting as an “embedded institution” w.r.t. data science
• Agile Manifesto became another “embedded institution”
• a generation of developers equated “database” with “relational”,

with a belief that legibility of systems == legibility of the data
• even so, first-movers collectively made a sudden turn 

toward NoSQL, partly in reaction to RDBMS pricing
• see also: 

“Statistical Modeling: The Two Cultures”

Leo Breiman UC Berkeley (2001)
Adjusting data resolution in graphs
In contrast, consider:
“Extracting the multiscale backbone of complex weighted networks”
M. Ángeles Serrano, Marián Boguña, Alessandro Vespignani

PNAS (2009-04-21)
Filtering large noisy graphs based on both 

nodes and edges can be useful for automated 

approaches in knowledge graph construction, 

see: github.com/DerwenAI/disparity_filter
An emerging trend disrupts the past 15-20 years 

of software engineering practice:
hardware > software > process
Hardware is now evolving more rapidly than software,
which is evolving more rapidly than effective process
Moore’s Law is all but dead, although ironically 

many inefficiencies had been based on it
See also: Pete Warden (2018) regarding

TensorFlow.js on low-power devices
Theme 3: Hardware in perspective
Theme 3: Evolution of cloud patterns
UC Berkeley published a 2009 report
about early use cases for cloud
computing, which foresaw the shape of
industry deployments over much of the
next decade, and led directly to Apache
Mesos and Apache Spark
It’s fascinating to study the contrasts
between that 2009 report and its 2019
follow-up.
(minor footnote: vimeo.com/3616394)
2009
Theme 3: Evolution of cloud patterns
Early cloud was intentionally “dumbed down”
to resemble popular virtualization software –
recognizable by IT staff – to support migration.
That approach is no longer needed.
Also, the physics + economics of cloud use
tend to imply less “framework” layers.
More contemporary patterns will force a
restructuring – for efficiency and security – 

i.e., decoupling computation and storage.
2019
Theme 3: Cluster topologies, by generation
Opinion: one problem with software/hardware interface for distributed
systems is that it’s taken decades to prioritize the need for handling
graphs/tensors directly within popular, accessible open source libraries,
without having some commercial database vendor intermediate.
1990s mid-2000s current
Theme 3: Cluster topologies, by generation
1990s mid-2000s current
see also: Jeff Dean (2013)
youtu.be/S9twUcX1Zp0
NB: graph
part 2:

industry analysis
“Two Cultures” for AI
A more useful distinction:
• ML is about the tools and technologies
• AI is about use case impact on social systems
Industry surveys for AI and Cloud adoption
• “Three surveys of AI adoption reveal key advice 

from more mature practices”

Ben Lorica, Paco Nathan O’Reilly Media (2019-02-20)
• Episode 7, Domino: surveying “ABC” adoption in enterprise

(2019-03-03)
Trends: Knowledge Graphs
We found that close to one
quarter of respondents were
using knowledge graphs.

Trends: Knowledge Graphs
Healthcare is adopting use 

of knowledge graphs more 

so than in Finance. 

Trends: Knowledge Graphs
Mature practices show more
interest in use of knowledge
graphs than firms which are 

still evaluating ML use cases.
Trends: an accelerating gap in AI funding
Note: firms with early advantage
are investing more, moving still
further away from the pack.

Overview of Data Governance
Paco Nathan @pacoid
Overview of Data Governance
derwen.ai/s/6fqt
cloud
3
cloud
2
cloud
1
security
security
security
compliance
db
mobile
devices
mobile
devices
mobile
devices
edge
cache
web
servers
dw
business
analytics
dat
govdurable
store
logs
models
other
data sci
workflows
cluster
compute
external
data
external
data
external
data
edge
inference
edge
inference
edge
inference
models
streaming data
dat
gov
dat
gov
dat
gov
dat
gov
dat
gov
dat
gov
dat
gov
we noted a resurgence in data 

governance – this report examines

key themes, vendors, issues, etc.
Unpacking AutoML
derwen.ai/s/yvkg
we noted an uptick in adoption for
a third aspect, co-evolving along
with DG and MLOps
meta-learning feature
selection
hyperparameter
optimization
model
selection
auto
scaling
feature
engineering
train
models
evaluate
results
integrate,
deploy
data
prep
data platform
usecases
Data Gov dovetails with MLOps and AutoML
meta-learning feature
selection
hyperparameter
optimization
model
selection
auto
scaling
feature
engineering
train
models
evaluate
results
integrate,
deploy
data
prep
data platform
usecases
data gov
trends
augment
AutoML
data gov
practices
follow
MLOps
Emerging category: watch the “AI Natives”
Projects (mostly OSS) that leverage knowledge graph 

of metadata about datasets and their usage:
• Amundsen @ Lyft

data discovery and metadata
• Databook @ Uber

manage metadata about datasets (pending OSS)
• Marquez @ WeWork, Stitch Fix

collect, aggregate, visualize metadata
• Data Hub @ LinkedIn

data discovery and lineage
• Metcat @ Netflix

data discovery, metadata service
• Dataportal @ Airbnb

integrated data-space (not OSS)
part 3:

case study – rich context
Administrative Data Research Facility
Coleridge Initiative

Julia Lane, et al. NYU Wagner
• FedRAMP-compliant ADRF framework on AWS GovCloud: 

“public agency capacity to accelerate the effective use of 

new datasets”
• for research projects using cross-agency sensitive data, 

in US and EU (and UK) – now in use by 15+ agencies
• cited as the first federal example of Secure Access to
Confidential Data in the final report of the Commission 

on Evidence-Based Policymaking
• augments Data Stewardship practices; collaboration 

with Project Jupyter on the related data gov features
• funding by Schmidt Futures, Sloan, Overdeck
ADRF and Rich Context
Coleridge Initiative

Julia Lane, et al. NYU Wagner
• Rich Context: knowledge graph of metadata about datasets,
used for entity linking, link prediction, recommendations, etc.
• benefits: agencies, researchers, publishers, data stewards,
data providers – see white paper
• ongoing ML competition for linking research publications
with dataset attribution (first comp. won by Allen AI)
• see “Human-in-the-loop AI for scholarly infrastructure”
• upcoming book:

Rich Search and Discovery for Research Datasets: Building
the next generation of scholarly infrastructure
AI for Scholarly Infrastructure
Rich Context
overall scope
leaderboard
competition
publisher
use cases
HITL:
RePEC, etc.
authors
accept/reject
links
models
infer links
corpus
research
pubs
leaderboard
evals results
inferred
linked data
1
2
3
• collaboration with SAGE Pub, Digital Science,

RePEc, etc.; partnering with Bundesbank (EU)
• knowledge graph vocabulary integrates W3C
metadata standards: DCAT, PAV, DCMI, CITO,
FaBiO, FOAF, etc.
• data as a strategic asset: knowledge graph 

produces an open corpus for the leaderboard
competition
• human-in-the-loop AI used to infer metadata 

then confirm with authors via RePEC, etc.
• adjacent work: graph embedding, meta-learning,
persistent identifiers, reproducible research
Related work at Project Jupyter
Make datasets and projects top-level constructs,
support metadata exchange and privacy-preserving
telemetry from notebook usage (due Oct 2019):
• JupyterLab Commenting and real-time collab 

similar to Google Docs
• JupyterLab Data Explorer: register datasets 

within research projects
• JupyterLab Metadata Explorer: browse metadata
descriptions, get recommendations through
knowledge graph inference (via extension)
• Data Registry (original proposal)
• Telemetry (privacy-preserving, reports usage)
Related work at Project Jupyter
Active Learning as a data strategy
Experts decide
about edge cases,
providing examples
Experts learn through
Customer interactions
Customers request
Sales, Marketing,
Service, Training
Experts gain insights
via Model explanations
ML
Models
Models focus Experts
(e.g., weak supervision)
Organizational
Learning
Human
Experts
Examples,
Actions
Customers
Models act on decisions
when possible
Customer
Use Cases
Models explore
uncertainty when needed
derwen.ai/s/d8b7
teams of people + machines,

leveraging the complementary

strengths of both
Parting thought
In many ways, we’re at a point in the industry with
graph data – particularly for use of knowledge graph
of metadata about dataset usage – which resembles
conditions immediately before “Web 2.0” became
big news.
The emerging category of “AI natives” projects
mentioned earlier could be parlayed into data utilities
more flexible than the AI services which the current
hyperscalers are fielding.
Watch this space.
Just Enough Math Rich Context Hylbert-Speys Themes + Confs
per Pacoid
publicaXons, interviews, conference summaries…
https://derwen.ai/paco

@pacoid
Rev conf

Contenu connexe

Tendances

Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceCambridge Semantics
 
Knowledge graphs and graph databases
Knowledge graphs and graph databasesKnowledge graphs and graph databases
Knowledge graphs and graph databasesUche Ogbuji
 
Knowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningKnowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningConnected Data World
 
An Overview of the Emerging Graph Landscape (Oct 2013)
An Overview of the Emerging Graph Landscape (Oct 2013)An Overview of the Emerging Graph Landscape (Oct 2013)
An Overview of the Emerging Graph Landscape (Oct 2013)Emil Eifrem
 
NoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive AnalyticsNoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive AnalyticsInfiniteGraph
 
Big Graph : Tools, Techniques, Issues, Challenges and Future Directions
Big Graph : Tools, Techniques, Issues, Challenges and Future Directions Big Graph : Tools, Techniques, Issues, Challenges and Future Directions
Big Graph : Tools, Techniques, Issues, Challenges and Future Directions csandit
 
Graphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4jGraphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4jNeo4j
 
The Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing SystemsThe Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing SystemsNeo4j
 
Scaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsScaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsConnected Data World
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemSemantic Web Company
 
How Semantics Solves Big Data Challenges
How Semantics Solves Big Data ChallengesHow Semantics Solves Big Data Challenges
How Semantics Solves Big Data ChallengesDATAVERSITY
 
Using the Semantic Web Stack to Make Big Data Smarter
Using the Semantic Web Stack to Make  Big Data SmarterUsing the Semantic Web Stack to Make  Big Data Smarter
Using the Semantic Web Stack to Make Big Data SmarterMatheus Mota
 
Combining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASA
Combining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASACombining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASA
Combining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASANeo4j
 
Using A Distributed Graph Database To Make Sense Of Disparate Data Stores
Using A Distributed Graph Database To Make Sense Of Disparate Data StoresUsing A Distributed Graph Database To Make Sense Of Disparate Data Stores
Using A Distributed Graph Database To Make Sense Of Disparate Data StoresInfiniteGraph
 
Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesSrinath Srinivasa
 
Using Knowledge Graphs to Predict Customer Needs and Improve Quality
Using Knowledge Graphs to Predict Customer Needs and Improve QualityUsing Knowledge Graphs to Predict Customer Needs and Improve Quality
Using Knowledge Graphs to Predict Customer Needs and Improve QualityNeo4j
 
Big Data Landscape 2016
Big Data Landscape 2016 Big Data Landscape 2016
Big Data Landscape 2016 Matt Turck
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningCambridge Semantics
 

Tendances (20)

Knowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data ScienceKnowledge Graph for Machine Learning and Data Science
Knowledge Graph for Machine Learning and Data Science
 
Knowledge graphs and graph databases
Knowledge graphs and graph databasesKnowledge graphs and graph databases
Knowledge graphs and graph databases
 
Knowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep LearningKnowledge graphs, meet Deep Learning
Knowledge graphs, meet Deep Learning
 
An Overview of the Emerging Graph Landscape (Oct 2013)
An Overview of the Emerging Graph Landscape (Oct 2013)An Overview of the Emerging Graph Landscape (Oct 2013)
An Overview of the Emerging Graph Landscape (Oct 2013)
 
NoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive AnalyticsNoSQL Technology and Real-time, Accurate Predictive Analytics
NoSQL Technology and Real-time, Accurate Predictive Analytics
 
Big Graph : Tools, Techniques, Issues, Challenges and Future Directions
Big Graph : Tools, Techniques, Issues, Challenges and Future Directions Big Graph : Tools, Techniques, Issues, Challenges and Future Directions
Big Graph : Tools, Techniques, Issues, Challenges and Future Directions
 
Graphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4jGraphs in Telecommunications - Jesus Barrasa, Neo4j
Graphs in Telecommunications - Jesus Barrasa, Neo4j
 
The Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing SystemsThe Future is Big Graphs: A Community View on Graph Processing Systems
The Future is Big Graphs: A Community View on Graph Processing Systems
 
Scaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analyticsScaling up business value with real-time operational graph analytics
Scaling up business value with real-time operational graph analytics
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
 
How Semantics Solves Big Data Challenges
How Semantics Solves Big Data ChallengesHow Semantics Solves Big Data Challenges
How Semantics Solves Big Data Challenges
 
Using the Semantic Web Stack to Make Big Data Smarter
Using the Semantic Web Stack to Make  Big Data SmarterUsing the Semantic Web Stack to Make  Big Data Smarter
Using the Semantic Web Stack to Make Big Data Smarter
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
 
Combining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASA
Combining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASACombining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASA
Combining a Knowledge Graph and Graph Algorithms to Find Hidden Skills at NASA
 
Graph db
Graph dbGraph db
Graph db
 
Using A Distributed Graph Database To Make Sense Of Disparate Data Stores
Using A Distributed Graph Database To Make Sense Of Disparate Data StoresUsing A Distributed Graph Database To Make Sense Of Disparate Data Stores
Using A Distributed Graph Database To Make Sense Of Disparate Data Stores
 
Big Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and OpportunitiesBig Data and the Semantic Web: Challenges and Opportunities
Big Data and the Semantic Web: Challenges and Opportunities
 
Using Knowledge Graphs to Predict Customer Needs and Improve Quality
Using Knowledge Graphs to Predict Customer Needs and Improve QualityUsing Knowledge Graphs to Predict Customer Needs and Improve Quality
Using Knowledge Graphs to Predict Customer Needs and Improve Quality
 
Big Data Landscape 2016
Big Data Landscape 2016 Big Data Landscape 2016
Big Data Landscape 2016
 
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep LearningRisk Analytics Using Knowledge Graphs / FIBO with Deep Learning
Risk Analytics Using Knowledge Graphs / FIBO with Deep Learning
 

Similaire à Graph Realities

Big Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely headingBig Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely headingPaco Nathan
 
Data Science in Future Tense
Data Science in Future TenseData Science in Future Tense
Data Science in Future TensePaco Nathan
 
Apache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataApache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataPaco Nathan
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPeter Wang
 
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two DatabasesNoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases✔ Eric David Benari, PMP
 
Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)infoblog
 
Evaluation of graph databases
Evaluation of graph databasesEvaluation of graph databases
Evaluation of graph databasesijaia
 
Defining the true cloud (SugarCRM Webinar from 2012)
Defining the true cloud (SugarCRM Webinar from 2012)Defining the true cloud (SugarCRM Webinar from 2012)
Defining the true cloud (SugarCRM Webinar from 2012)Esteban Kolsky
 
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your DataCloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your DataCloudera, Inc.
 
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceAI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceOptum
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?Samet KILICTAS
 
The Evolving Landscape of Data Engineering
The Evolving Landscape of Data EngineeringThe Evolving Landscape of Data Engineering
The Evolving Landscape of Data EngineeringAndrei Savu
 
Big data - what, why, where, when and how
Big data - what, why, where, when and howBig data - what, why, where, when and how
Big data - what, why, where, when and howbobosenthil
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Alexandru Iosup
 
Future of big data nick kabra speaker compendium march 2013
Future of big data nick kabra speaker compendium march 2013Future of big data nick kabra speaker compendium march 2013
Future of big data nick kabra speaker compendium march 2013nkabra
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trendsAlan Morrison
 
Mapping objects to_relational_databases
Mapping objects to_relational_databasesMapping objects to_relational_databases
Mapping objects to_relational_databasesIvan Paredes
 
Parallel Computing 2007: Overview
Parallel Computing 2007: OverviewParallel Computing 2007: Overview
Parallel Computing 2007: OverviewGeoffrey Fox
 

Similaire à Graph Realities (20)

Big Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely headingBig Data is changing abruptly, and where it is likely heading
Big Data is changing abruptly, and where it is likely heading
 
Data Science in Future Tense
Data Science in Future TenseData Science in Future Tense
Data Science in Future Tense
 
Apache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big DataApache Spark and the Emerging Technology Landscape for Big Data
Apache Spark and the Emerging Technology Landscape for Big Data
 
Python's Role in the Future of Data Analysis
Python's Role in the Future of Data AnalysisPython's Role in the Future of Data Analysis
Python's Role in the Future of Data Analysis
 
Big data business case
Big data   business caseBig data   business case
Big data business case
 
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two DatabasesNoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
NoSQL Object DB & NewSQL Columnar DB, A Tale of Two Databases
 
Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)Claremont Report on Database Research: Research Directions (Le Gruenwald)
Claremont Report on Database Research: Research Directions (Le Gruenwald)
 
Evaluation of graph databases
Evaluation of graph databasesEvaluation of graph databases
Evaluation of graph databases
 
Defining the true cloud (SugarCRM Webinar from 2012)
Defining the true cloud (SugarCRM Webinar from 2012)Defining the true cloud (SugarCRM Webinar from 2012)
Defining the true cloud (SugarCRM Webinar from 2012)
 
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your DataCloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
Cloudera Breakfast: Advanced Analytics Part II: Do More With Your Data
 
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data ScienceAI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
AI, Knowledge Representation and Graph Databases -
 Key Trends in Data Science
 
How Graph Databases used in Police Department?
How Graph Databases used in Police Department?How Graph Databases used in Police Department?
How Graph Databases used in Police Department?
 
The Evolving Landscape of Data Engineering
The Evolving Landscape of Data EngineeringThe Evolving Landscape of Data Engineering
The Evolving Landscape of Data Engineering
 
Big data - what, why, where, when and how
Big data - what, why, where, when and howBig data - what, why, where, when and how
Big data - what, why, where, when and how
 
Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.Cloud Programming Models: eScience, Big Data, etc.
Cloud Programming Models: eScience, Big Data, etc.
 
MODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdf
MODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdfMODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdf
MODEL_FOR_SEMANTICALLY_RICH_POINT_CLOUD.pdf
 
Future of big data nick kabra speaker compendium march 2013
Future of big data nick kabra speaker compendium march 2013Future of big data nick kabra speaker compendium march 2013
Future of big data nick kabra speaker compendium march 2013
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trends
 
Mapping objects to_relational_databases
Mapping objects to_relational_databasesMapping objects to_relational_databases
Mapping objects to_relational_databases
 
Parallel Computing 2007: Overview
Parallel Computing 2007: OverviewParallel Computing 2007: Overview
Parallel Computing 2007: Overview
 

Plus de Connected Data World

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenConnected Data World
 
Graph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaGraph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaConnected Data World
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Connected Data World
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine LearningConnected Data World
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3Connected Data World
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data ModelConnected Data World
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseConnected Data World
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Connected Data World
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleConnected Data World
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Connected Data World
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the WebConnected Data World
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsConnected Data World
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGOConnected Data World
 
What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?Connected Data World
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesConnected Data World
 
Develop A Basic Recommendation System using Cypher
Develop A Basic Recommendation System using CypherDevelop A Basic Recommendation System using Cypher
Develop A Basic Recommendation System using CypherConnected Data World
 
A Semi-Automatic Tool for Linked Data Integration
A Semi-Automatic Tool for Linked Data IntegrationA Semi-Automatic Tool for Linked Data Integration
A Semi-Automatic Tool for Linked Data IntegrationConnected Data World
 
One Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLOne Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLConnected Data World
 
Dow Jones: Reimagining the News as a Knowledge Graph
Dow Jones: Reimagining the News as a Knowledge GraphDow Jones: Reimagining the News as a Knowledge Graph
Dow Jones: Reimagining the News as a Knowledge GraphConnected Data World
 

Plus de Connected Data World (20)

Systems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van HarmelenSystems that learn and reason | Frank Van Harmelen
Systems that learn and reason | Frank Van Harmelen
 
Graph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora LassilaGraph Abstractions Matter by Ora Lassila
Graph Abstractions Matter by Ora Lassila
 
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
Κnowledge Architecture: Combining Strategy, Data Science and Information Arch...
 
How to get started with Graph Machine Learning
How to get started with Graph Machine LearningHow to get started with Graph Machine Learning
How to get started with Graph Machine Learning
 
Graphs in sustainable finance
Graphs in sustainable financeGraphs in sustainable finance
Graphs in sustainable finance
 
From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3From Taxonomies and Schemas to Knowledge Graphs: Part 3
From Taxonomies and Schemas to Knowledge Graphs: Part 3
 
In Search of the Universal Data Model
In Search of the Universal Data ModelIn Search of the Universal Data Model
In Search of the Universal Data Model
 
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph DatabaseGraph in Apache Cassandra. The World’s Most Scalable Graph Database
Graph in Apache Cassandra. The World’s Most Scalable Graph Database
 
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
Powering Question-Driven Problem Solving to Improve the Chances of Finding Ne...
 
Semantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scaleSemantic similarity for faster Knowledge Graph delivery at scale
Semantic similarity for faster Knowledge Graph delivery at scale
 
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
Knowledge Graphs and AI to Hyper-Personalise the Fashion Retail Experience at...
 
Schema, Google & The Future of the Web
Schema, Google & The Future of the WebSchema, Google & The Future of the Web
Schema, Google & The Future of the Web
 
Elegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property GraphsElegant and Scalable Code Querying with Code Property Graphs
Elegant and Scalable Code Querying with Code Property Graphs
 
Graph for Good: Empowering your NGO
Graph for Good: Empowering your NGOGraph for Good: Empowering your NGO
Graph for Good: Empowering your NGO
 
What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?What are we Talking About, When we Talk About Ontology?
What are we Talking About, When we Talk About Ontology?
 
Ontology Services for the Biomedical Sciences
Ontology Services for the Biomedical SciencesOntology Services for the Biomedical Sciences
Ontology Services for the Biomedical Sciences
 
Develop A Basic Recommendation System using Cypher
Develop A Basic Recommendation System using CypherDevelop A Basic Recommendation System using Cypher
Develop A Basic Recommendation System using Cypher
 
A Semi-Automatic Tool for Linked Data Integration
A Semi-Automatic Tool for Linked Data IntegrationA Semi-Automatic Tool for Linked Data Integration
A Semi-Automatic Tool for Linked Data Integration
 
One Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACLOne Ontology, One Data Set, Multiple Shapes with SHACL
One Ontology, One Data Set, Multiple Shapes with SHACL
 
Dow Jones: Reimagining the News as a Knowledge Graph
Dow Jones: Reimagining the News as a Knowledge GraphDow Jones: Reimagining the News as a Knowledge Graph
Dow Jones: Reimagining the News as a Knowledge Graph
 

Dernier

办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一F sss
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDRafezzaman
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degreeyuu sss
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excelysmaelreyes
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfBoston Institute of Analytics
 

Dernier (20)

办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
 
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTDINTERNSHIP ON PURBASHA COMPOSITE TEX LTD
INTERNSHIP ON PURBASHA COMPOSITE TEX LTD
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
办美国阿肯色大学小石城分校毕业证成绩单pdf电子版制作修改#真实留信入库#永久存档#真实可查#diploma#degree
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Business Analytics using Microsoft Excel
Business Analytics using Microsoft ExcelBusiness Analytics using Microsoft Excel
Business Analytics using Microsoft Excel
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdfPredicting Salary Using Data Science: A Comprehensive Analysis.pdf
Predicting Salary Using Data Science: A Comprehensive Analysis.pdf
 

Graph Realities

  • 2.
  • 3. Personal Background • applied math, machine learning, distributed systems • R&D for neural networks, incl. hardware (1986-1997) • “guinea pig” for early cloud (2005-ff) • led data teams in industry • assisted popular open source projects: 
 Spark, Jupyter, etc. • development focus on natural language plus adjacent knowledge graph use cases • since 2018, increasingly working at the intersection 
 of public sector + enterprise + open source
  • 4. Motivations Not all that long ago, graph applications were considered exotic and expensive. Until recently, few software engineers had much experience putting graphs to work; however, those use cases have now become much more commonplace. This talk explores a practical use case, one that addresses key issues of data governance and reproducible research, and depends on sophisticated use of graph technology. First, some perspectives and industry analysis…
  • 6. Perspectives • the ubiquity of linked data • the tyranny of “thinking relational” • the primacy of working with graphs 
 (and their math analog, tensors) • nouns vs. verbs vs. adjectives
 (extreme nominalization) • evolution of hardware, cloud, 
 and cluster topologies • the power of graph embeddings
  • 7. Historical Context “Data Science: Past and Future”
 Rev 2 (2019-05-24) slides “What is Data Science?”
 IBM Data Science Community (2019-03-04) Just Enough Math
 O’Reilly Media (2014) • John Tukey: data analytics as an intrinsically empirical 
 and interdisciplinary field (1962) • most popular data frameworks leveraged some graph 
 processing, albeit obscured, ad-hoc, clumsy… • they did well, given the hardware available at the time
  • 8. Beauty in sparsity… SuiteSparse Matrix Collection: 
 a widely used set of sparse matrix benchmarks collected from a wide 
 range of applications sparse.tamu.edu/ …for when you really, really, need some interesting graph data
  • 9. Theme 1: Stuffing graphs into matrices algebraic graph theory allowed reuse of linear algebra impl v u w x u v w x u 0 1 0 1 v 1 0 1 1 w 0 1 0 1 x 1 1 1 0 • e.g., transform graph to an adjacency matrix • most will be relatively sparse • use LINPACK, BLAS, or libraries built atop • much to leverage: SVD, power method, QR decomp, etc.
  • 10. Theme 1: Stuffing graphs into matrices for many real-world problems, the data are essentially graphs 1. real-world data 2. graph theory for representation 3. convert to sparse matrix for production 4. cost-effective parallel processing at scale ergo, leverage low dimensional structure in high dimensional data
  • 11. N Dims good, 2 Dims baa-d However, complex graphs cannot be represented as 2D matrices without serious information loss. Ideally, tensors would be a better representation 
 to use for linear algebra libraries. While tensor decomposition is a hard problem, 
 the general class of problems became much 
 more interesting after 2012…
  • 12. N Dims good, 2 Dims baa-d However, complex graphs cannot be represented as 2D matrices without serious information loss. Ideally, tensors would be a better representation 
 to use for linear algebra libraries. While tensor decomposition is a hard problem, 
 the general class of problems became much 
 more interesting after 2012… “The real problem is that programmers have spent far too
 much time worrying about efficiency in the wrong place
 and at the wrong times; premature optimization is the
 root of all evil (or at least most of it) in programming.” Don Knuth
  • 13. Theme 2: Nouns, Verbs, Adjectives Tracing back to the origins of relational databases, 
 Edgar Codd was furious about how badly SQL and RDBMS had misinterpreted his mathematical modeling of relations. Years of EDW reinforced a sense of an extreme nominalization with so much of the data representation being reduced into dimensions, facts, indexes
  • 14. Theme 2: Nouns, Verbs, Adjectives a carry-over of extreme nominalization into graph DBs also
 over-emphasizes the role of nodes and centrality for adjusting the granularity of graph representations: • discounts the importance of relations • “mostly nouns, a few verbs, some adjectives” • serious information loss IMO: graph DB frameworks tend to err in this aspect, 
 both in terms of representation and algorithm support.
  • 15. Part of a long-term narrative arc in IT… • arguably, circa 2001 was the heyday of DW+BI – later 
 acting as an “embedded institution” w.r.t. data science • Agile Manifesto became another “embedded institution” • a generation of developers equated “database” with “relational”,
 with a belief that legibility of systems == legibility of the data • even so, first-movers collectively made a sudden turn 
 toward NoSQL, partly in reaction to RDBMS pricing • see also: 
 “Statistical Modeling: The Two Cultures”
 Leo Breiman UC Berkeley (2001)
  • 16. Adjusting data resolution in graphs In contrast, consider: “Extracting the multiscale backbone of complex weighted networks” M. Ángeles Serrano, Marián Boguña, Alessandro Vespignani
 PNAS (2009-04-21) Filtering large noisy graphs based on both 
 nodes and edges can be useful for automated 
 approaches in knowledge graph construction, 
 see: github.com/DerwenAI/disparity_filter
  • 17. An emerging trend disrupts the past 15-20 years 
 of software engineering practice: hardware > software > process Hardware is now evolving more rapidly than software, which is evolving more rapidly than effective process Moore’s Law is all but dead, although ironically 
 many inefficiencies had been based on it See also: Pete Warden (2018) regarding
 TensorFlow.js on low-power devices Theme 3: Hardware in perspective
  • 18. Theme 3: Evolution of cloud patterns UC Berkeley published a 2009 report about early use cases for cloud computing, which foresaw the shape of industry deployments over much of the next decade, and led directly to Apache Mesos and Apache Spark It’s fascinating to study the contrasts between that 2009 report and its 2019 follow-up. (minor footnote: vimeo.com/3616394) 2009
  • 19. Theme 3: Evolution of cloud patterns Early cloud was intentionally “dumbed down” to resemble popular virtualization software – recognizable by IT staff – to support migration. That approach is no longer needed. Also, the physics + economics of cloud use tend to imply less “framework” layers. More contemporary patterns will force a restructuring – for efficiency and security – 
 i.e., decoupling computation and storage. 2019
  • 20. Theme 3: Cluster topologies, by generation Opinion: one problem with software/hardware interface for distributed systems is that it’s taken decades to prioritize the need for handling graphs/tensors directly within popular, accessible open source libraries, without having some commercial database vendor intermediate. 1990s mid-2000s current
  • 21. Theme 3: Cluster topologies, by generation 1990s mid-2000s current see also: Jeff Dean (2013) youtu.be/S9twUcX1Zp0 NB: graph
  • 23. “Two Cultures” for AI A more useful distinction: • ML is about the tools and technologies • AI is about use case impact on social systems
  • 24. Industry surveys for AI and Cloud adoption • “Three surveys of AI adoption reveal key advice 
 from more mature practices”
 Ben Lorica, Paco Nathan O’Reilly Media (2019-02-20) • Episode 7, Domino: surveying “ABC” adoption in enterprise
 (2019-03-03)
  • 25. Trends: Knowledge Graphs We found that close to one quarter of respondents were using knowledge graphs.

  • 26. Trends: Knowledge Graphs Healthcare is adopting use 
 of knowledge graphs more 
 so than in Finance. 

  • 27. Trends: Knowledge Graphs Mature practices show more interest in use of knowledge graphs than firms which are 
 still evaluating ML use cases.
  • 28. Trends: an accelerating gap in AI funding Note: firms with early advantage are investing more, moving still further away from the pack.

  • 29. Overview of Data Governance Paco Nathan @pacoid Overview of Data Governance derwen.ai/s/6fqt cloud 3 cloud 2 cloud 1 security security security compliance db mobile devices mobile devices mobile devices edge cache web servers dw business analytics dat govdurable store logs models other data sci workflows cluster compute external data external data external data edge inference edge inference edge inference models streaming data dat gov dat gov dat gov dat gov dat gov dat gov dat gov we noted a resurgence in data 
 governance – this report examines
 key themes, vendors, issues, etc.
  • 30. Unpacking AutoML derwen.ai/s/yvkg we noted an uptick in adoption for a third aspect, co-evolving along with DG and MLOps meta-learning feature selection hyperparameter optimization model selection auto scaling feature engineering train models evaluate results integrate, deploy data prep data platform usecases
  • 31. Data Gov dovetails with MLOps and AutoML meta-learning feature selection hyperparameter optimization model selection auto scaling feature engineering train models evaluate results integrate, deploy data prep data platform usecases data gov trends augment AutoML data gov practices follow MLOps
  • 32. Emerging category: watch the “AI Natives” Projects (mostly OSS) that leverage knowledge graph 
 of metadata about datasets and their usage: • Amundsen @ Lyft
 data discovery and metadata • Databook @ Uber
 manage metadata about datasets (pending OSS) • Marquez @ WeWork, Stitch Fix
 collect, aggregate, visualize metadata • Data Hub @ LinkedIn
 data discovery and lineage • Metcat @ Netflix
 data discovery, metadata service • Dataportal @ Airbnb
 integrated data-space (not OSS)
  • 33. part 3:
 case study – rich context
  • 34. Administrative Data Research Facility Coleridge Initiative
 Julia Lane, et al. NYU Wagner • FedRAMP-compliant ADRF framework on AWS GovCloud: 
 “public agency capacity to accelerate the effective use of 
 new datasets” • for research projects using cross-agency sensitive data, 
 in US and EU (and UK) – now in use by 15+ agencies • cited as the first federal example of Secure Access to Confidential Data in the final report of the Commission 
 on Evidence-Based Policymaking • augments Data Stewardship practices; collaboration 
 with Project Jupyter on the related data gov features • funding by Schmidt Futures, Sloan, Overdeck
  • 35. ADRF and Rich Context Coleridge Initiative
 Julia Lane, et al. NYU Wagner • Rich Context: knowledge graph of metadata about datasets, used for entity linking, link prediction, recommendations, etc. • benefits: agencies, researchers, publishers, data stewards, data providers – see white paper • ongoing ML competition for linking research publications with dataset attribution (first comp. won by Allen AI) • see “Human-in-the-loop AI for scholarly infrastructure” • upcoming book:
 Rich Search and Discovery for Research Datasets: Building the next generation of scholarly infrastructure
  • 36. AI for Scholarly Infrastructure Rich Context overall scope leaderboard competition publisher use cases HITL: RePEC, etc. authors accept/reject links models infer links corpus research pubs leaderboard evals results inferred linked data 1 2 3 • collaboration with SAGE Pub, Digital Science,
 RePEc, etc.; partnering with Bundesbank (EU) • knowledge graph vocabulary integrates W3C metadata standards: DCAT, PAV, DCMI, CITO, FaBiO, FOAF, etc. • data as a strategic asset: knowledge graph 
 produces an open corpus for the leaderboard competition • human-in-the-loop AI used to infer metadata 
 then confirm with authors via RePEC, etc. • adjacent work: graph embedding, meta-learning, persistent identifiers, reproducible research
  • 37.
  • 38.
  • 39. Related work at Project Jupyter Make datasets and projects top-level constructs, support metadata exchange and privacy-preserving telemetry from notebook usage (due Oct 2019): • JupyterLab Commenting and real-time collab 
 similar to Google Docs • JupyterLab Data Explorer: register datasets 
 within research projects • JupyterLab Metadata Explorer: browse metadata descriptions, get recommendations through knowledge graph inference (via extension) • Data Registry (original proposal) • Telemetry (privacy-preserving, reports usage)
  • 40. Related work at Project Jupyter
  • 41. Active Learning as a data strategy Experts decide about edge cases, providing examples Experts learn through Customer interactions Customers request Sales, Marketing, Service, Training Experts gain insights via Model explanations ML Models Models focus Experts (e.g., weak supervision) Organizational Learning Human Experts Examples, Actions Customers Models act on decisions when possible Customer Use Cases Models explore uncertainty when needed derwen.ai/s/d8b7 teams of people + machines,
 leveraging the complementary
 strengths of both
  • 42. Parting thought In many ways, we’re at a point in the industry with graph data – particularly for use of knowledge graph of metadata about dataset usage – which resembles conditions immediately before “Web 2.0” became big news. The emerging category of “AI natives” projects mentioned earlier could be parlayed into data utilities more flexible than the AI services which the current hyperscalers are fielding. Watch this space.
  • 43. Just Enough Math Rich Context Hylbert-Speys Themes + Confs per Pacoid publicaXons, interviews, conference summaries… https://derwen.ai/paco
 @pacoid Rev conf