SlideShare une entreprise Scribd logo
1  sur  14
Télécharger pour lire hors ligne
Industry Models – Visualisation & Graph DB
May 2017
Michal Miklas
2
Reasons for interest in Graph DB: Exploration of new content delivery formats and tools for
content consumption
 Use of graph DB as store for all glossaries and organisation data stores
metadata and their relationships
 Common independent interface for all services, apps and users
 Flexible schema (does not need to be predefined), easily extensible
 Consumable – easy access to the data via common API/interface
 Visualisation – a business user friendly tool for data exploration, navigation,
search and analysis
 Easy integration of Industry models content (or derived from) with customer
collection of glossaries, vocabularies, ontologies and other models
 Not seen as replacement of the main formats & content
authoring/management tools, rather complementary as a “read only
platform” mainly beneficial for business users and analysts
 Any current work is and future work will stay flexible & compatible with main
software tools: “Titan/Janusgraph with Tinkerpop”, “IBM Graph”, Neo4j
3
Industry Model Common Components
Data Models
Vocabulary
Atomic
Warehouse
Model
Dimensional
Warehouse
Model
Business
Data Model
Business
Terms
Analytical
Requirements
Supportive
Content
Industry Models
Industry concepts in plain business language and with no
modeling. Business Terms are organized by Business
Categories. The mapping to the data models allows the
transformation of requirements into IT data structures.
Business Terms
High level groups of business information to express
business Measures along axes of analysis, which are
named Dimensions.
Analytical Requirements
Grouping of terms incorporating any terminology
originating from an internal or external source. It is used to
support data structures such as regulatory reports,
industry standards, business architecture standards,
vendor interfaces, or legacy source systems.
Supportive Content
Highly normalized conceptual data
model that is an enterprise-wide,
generic, and flexible data epresentation
of informational systems.
Business Data Model
A normalized design level data
model representing the
repository of atomic data used
for informational processing.
Atomic Model
A design level dimensional model
representing the repository of analytical
data. It contains star schemas
supporting the Analytical Requirements
Dimensional Model
4
Tools Currently Used & Supported
 Data Modelling tools:
• Infosphere Data Architect
• Erwin Data Modeler
 Data Governance tools:
• Infosphere Governance Catalog
(Business Glossaries, Models
and other metadata)
5
Graph Databases & Graph Data Visualisation tools
 Graph Databases
• Neo4j Community Edition (GPL v3 license)
• Neo4j Enterprise Edition (Evaluation & Commercial License)
• Titan (Apache License 2.0)
• Janusgraph (Apache 2 License & Creative Commons Attribution 4.0 International)
• IBM Graph managed service on Bluemix (build on Titan/Tinkeprop stack)
 Graph computing framework
• Tinkerpop (Apache License 2.0) embedded in Titan/Janusgraph but can be use as
standalone GraphDB too for demos and small projects using in-memory TinkerGraph
 Graph data visualisation
• Neo4j Browser (same licensing as above) - part of the Neo4j Graph DB
• Linkurious Enterprise (commercial license)
vis.js javascript visualisation library (Apache 2.0 and MIT)
6
Architecture options
7
Energy & Utilities - Outage - Neo4j
8
Energy & Utilities - Outage - Linkurious
9
Energy & Utilities - Outage - Vis.JS
10
Example of Graph DB schema model
11
Content Transformation
 IM artifact formats
• IGC export in XML
• Logical Models LDM files are XML files
 Transformation is split into two steps (fully working script prototype in Powershell)
This allows to bring any customer’s data into the mix in easy to understand format: collection of
CSV files in two folders – nodes and edges – each file represents different node/edge type with
any properties as CSV columns (each node type can have different set of properties/columns) –
the only mandatory fields are ID and name
1. XML (LDM & IGC) to CSV transform
2. CSV to GRAPHML transform
• also produces Graph DB schema model in JSON format (for IBM Graph on Bluemix)
• also produces groovy script for schema creation for Titan/Janusgraph
 GRAPHML graph data format – includes schema and all node and edge data
• a format importable to Titan/JanusGraph using the Gremlin/Tinkerpop console
• Tinkeprop/Gramlin can be also used to import to Neo4j
• Format recognized and supported also by IBM graph (although currently with limitation –
size of file cannot be over 10MB)
12
Meaning of Node types in Graph DB
 Logical model based types
• entity
• attribute
• package
• model
• diagram
 Physical model based types
• column
• Table
 Glossary based types
• term
• category
13
Meaning of Edge types in Graph DB
 Assigns:
• term assignation to asset: attribute/column/entity/table
 Belongs:
• describes parent object = ownership
• entity/table to package
• package to package
• package to model
• term to category
• category to category
• category to glossary
• diagram to package
 Describes:
• attribute describes entity
• column describes table
• term isOf another term
 Maps:
• attribute to attribute calculation
• attribute to attribute/entity population in the same model or AWM-
>DWM
• column to column/table population in the same schema or AWM-
>DWM
• note: covers both population and calculation dependencies
• design transformation dependency
• attribute to attribute/entity (from another model e.g. BDM->AWM or
BDM->DWM)
• table to entity
• column to attribute
 References:
• term to category: can be referenced by any number of categories (in
addition to owning category)
• entity/table to diagram: can be referenced by any number of diagrams
 Relates:
• entity to entity relationship in ER models
• table to table relationship in ER models
• term to term relatedTerm
 Subtype:
• entity is subtype of another entity generalization/inheritance in ER
models
• term isTypeOf another term
 Synonym:
• one term is synonym of another - in a directed graph there is a direction
of this edge suggesting master-child
14
Common attribute across all edge & node types
 Taxonomy:
• Logical Business Data Model
• Logical Atomic Warehouse Model
• Logical Dimensional Warehouse
Model
• Business Glossary
• Analytical Requirements
• Supportive Content
• Scopes
• Physical Dimensional Warehouse
Model
• Physical Business Data Model
• Physical Atomic Warehouse Model
 Taxonomy Type:
• logicalModel
• physicalModel
• glossary
 Industry:
• banking
• insurance
• healthcare
• Utilities
 Version

Contenu connexe

Similaire à IM in Graph 2017-05.pdf

Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...Bill Duncan
 
Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...Bill Duncan
 
Developing Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse SiriusDeveloping Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse SiriusObeo
 
Dynamics ax 2012 development overview
Dynamics ax 2012 development overviewDynamics ax 2012 development overview
Dynamics ax 2012 development overviewAli Raza Zaidi
 
CS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQLCS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQLPalani Kumar
 
DITA 1.3: What's New and Different
DITA 1.3: What's New and DifferentDITA 1.3: What's New and Different
DITA 1.3: What's New and Differentdclsocialmedia
 
Ugif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUgif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUGIF
 
UNIT III CAD STANDARDS
UNIT III CAD STANDARDS UNIT III CAD STANDARDS
UNIT III CAD STANDARDS ravis205084
 
IBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application DevelopmentIBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application DevelopmentAndrew Coleman
 
the Modeling is a way of thinking about the
the Modeling is a way of thinking about thethe Modeling is a way of thinking about the
the Modeling is a way of thinking about thesaman zaker
 
SSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentSSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentRyan Casey
 
CAD Data Exchange format used in industry
CAD Data Exchange format used in industryCAD Data Exchange format used in industry
CAD Data Exchange format used in industryrahulkatre9
 
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURINGUnit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURINGMohanumar S
 
Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com Serena Software
 
Real-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSASReal-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSASLynn Langit
 
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptxUnit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptxdinesh babu
 
Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)PascalDesmarets1
 
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...Srivatsan Ramanujam
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4jSina Khorami
 

Similaire à IM in Graph 2017-05.pdf (20)

Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...Automate document generation from sys ml models with rational rhapsody report...
Automate document generation from sys ml models with rational rhapsody report...
 
Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...Automate document generation from SysML models with Rational Rhapsody Reporte...
Automate document generation from SysML models with Rational Rhapsody Reporte...
 
Developing Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse SiriusDeveloping Modeling Tool for RM-ODP with Eclipse Sirius
Developing Modeling Tool for RM-ODP with Eclipse Sirius
 
Dynamics ax 2012 development overview
Dynamics ax 2012 development overviewDynamics ax 2012 development overview
Dynamics ax 2012 development overview
 
CS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQLCS8091_BDA_Unit_V_NoSQL
CS8091_BDA_Unit_V_NoSQL
 
DITA 1.3: What's New and Different
DITA 1.3: What's New and DifferentDITA 1.3: What's New and Different
DITA 1.3: What's New and Different
 
Ugif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutesUgif 10 2012 lycia2 introduction in 45 minutes
Ugif 10 2012 lycia2 introduction in 45 minutes
 
MDE in Practice
MDE in PracticeMDE in Practice
MDE in Practice
 
UNIT III CAD STANDARDS
UNIT III CAD STANDARDS UNIT III CAD STANDARDS
UNIT III CAD STANDARDS
 
IBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application DevelopmentIBM InterConnect 2015 - IIB Effective Application Development
IBM InterConnect 2015 - IIB Effective Application Development
 
the Modeling is a way of thinking about the
the Modeling is a way of thinking about thethe Modeling is a way of thinking about the
the Modeling is a way of thinking about the
 
SSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope DocumentSSAS RLS Prototype | Vision and Scope Document
SSAS RLS Prototype | Vision and Scope Document
 
CAD Data Exchange format used in industry
CAD Data Exchange format used in industryCAD Data Exchange format used in industry
CAD Data Exchange format used in industry
 
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURINGUnit 3-ME8691 & COMPUTER AIDED DESIGN AND    MANUFACTURING
Unit 3-ME8691 & COMPUTER AIDED DESIGN AND MANUFACTURING
 
Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com Serena Mainframe VUG In-Com
Serena Mainframe VUG In-Com
 
Real-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSASReal-world BISM in SQL Server 2012 SSAS
Real-world BISM in SQL Server 2012 SSAS
 
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptxUnit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
Unit 3-ASSEMBLY OF PARTS AND CAD STANDARDS.pptx
 
Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)Tutorial Expert How-To - Command Line Interface (CLI)
Tutorial Expert How-To - Command Line Interface (CLI)
 
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
A Pipeline for Distributed Topic and Sentiment Analysis of Tweets on Pivotal ...
 
Graph Database and Neo4j
Graph Database and Neo4jGraph Database and Neo4j
Graph Database and Neo4j
 

Dernier

Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 

Dernier (20)

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 

IM in Graph 2017-05.pdf

  • 1. Industry Models – Visualisation & Graph DB May 2017 Michal Miklas
  • 2. 2 Reasons for interest in Graph DB: Exploration of new content delivery formats and tools for content consumption  Use of graph DB as store for all glossaries and organisation data stores metadata and their relationships  Common independent interface for all services, apps and users  Flexible schema (does not need to be predefined), easily extensible  Consumable – easy access to the data via common API/interface  Visualisation – a business user friendly tool for data exploration, navigation, search and analysis  Easy integration of Industry models content (or derived from) with customer collection of glossaries, vocabularies, ontologies and other models  Not seen as replacement of the main formats & content authoring/management tools, rather complementary as a “read only platform” mainly beneficial for business users and analysts  Any current work is and future work will stay flexible & compatible with main software tools: “Titan/Janusgraph with Tinkerpop”, “IBM Graph”, Neo4j
  • 3. 3 Industry Model Common Components Data Models Vocabulary Atomic Warehouse Model Dimensional Warehouse Model Business Data Model Business Terms Analytical Requirements Supportive Content Industry Models Industry concepts in plain business language and with no modeling. Business Terms are organized by Business Categories. The mapping to the data models allows the transformation of requirements into IT data structures. Business Terms High level groups of business information to express business Measures along axes of analysis, which are named Dimensions. Analytical Requirements Grouping of terms incorporating any terminology originating from an internal or external source. It is used to support data structures such as regulatory reports, industry standards, business architecture standards, vendor interfaces, or legacy source systems. Supportive Content Highly normalized conceptual data model that is an enterprise-wide, generic, and flexible data epresentation of informational systems. Business Data Model A normalized design level data model representing the repository of atomic data used for informational processing. Atomic Model A design level dimensional model representing the repository of analytical data. It contains star schemas supporting the Analytical Requirements Dimensional Model
  • 4. 4 Tools Currently Used & Supported  Data Modelling tools: • Infosphere Data Architect • Erwin Data Modeler  Data Governance tools: • Infosphere Governance Catalog (Business Glossaries, Models and other metadata)
  • 5. 5 Graph Databases & Graph Data Visualisation tools  Graph Databases • Neo4j Community Edition (GPL v3 license) • Neo4j Enterprise Edition (Evaluation & Commercial License) • Titan (Apache License 2.0) • Janusgraph (Apache 2 License & Creative Commons Attribution 4.0 International) • IBM Graph managed service on Bluemix (build on Titan/Tinkeprop stack)  Graph computing framework • Tinkerpop (Apache License 2.0) embedded in Titan/Janusgraph but can be use as standalone GraphDB too for demos and small projects using in-memory TinkerGraph  Graph data visualisation • Neo4j Browser (same licensing as above) - part of the Neo4j Graph DB • Linkurious Enterprise (commercial license) vis.js javascript visualisation library (Apache 2.0 and MIT)
  • 7. 7 Energy & Utilities - Outage - Neo4j
  • 8. 8 Energy & Utilities - Outage - Linkurious
  • 9. 9 Energy & Utilities - Outage - Vis.JS
  • 10. 10 Example of Graph DB schema model
  • 11. 11 Content Transformation  IM artifact formats • IGC export in XML • Logical Models LDM files are XML files  Transformation is split into two steps (fully working script prototype in Powershell) This allows to bring any customer’s data into the mix in easy to understand format: collection of CSV files in two folders – nodes and edges – each file represents different node/edge type with any properties as CSV columns (each node type can have different set of properties/columns) – the only mandatory fields are ID and name 1. XML (LDM & IGC) to CSV transform 2. CSV to GRAPHML transform • also produces Graph DB schema model in JSON format (for IBM Graph on Bluemix) • also produces groovy script for schema creation for Titan/Janusgraph  GRAPHML graph data format – includes schema and all node and edge data • a format importable to Titan/JanusGraph using the Gremlin/Tinkerpop console • Tinkeprop/Gramlin can be also used to import to Neo4j • Format recognized and supported also by IBM graph (although currently with limitation – size of file cannot be over 10MB)
  • 12. 12 Meaning of Node types in Graph DB  Logical model based types • entity • attribute • package • model • diagram  Physical model based types • column • Table  Glossary based types • term • category
  • 13. 13 Meaning of Edge types in Graph DB  Assigns: • term assignation to asset: attribute/column/entity/table  Belongs: • describes parent object = ownership • entity/table to package • package to package • package to model • term to category • category to category • category to glossary • diagram to package  Describes: • attribute describes entity • column describes table • term isOf another term  Maps: • attribute to attribute calculation • attribute to attribute/entity population in the same model or AWM- >DWM • column to column/table population in the same schema or AWM- >DWM • note: covers both population and calculation dependencies • design transformation dependency • attribute to attribute/entity (from another model e.g. BDM->AWM or BDM->DWM) • table to entity • column to attribute  References: • term to category: can be referenced by any number of categories (in addition to owning category) • entity/table to diagram: can be referenced by any number of diagrams  Relates: • entity to entity relationship in ER models • table to table relationship in ER models • term to term relatedTerm  Subtype: • entity is subtype of another entity generalization/inheritance in ER models • term isTypeOf another term  Synonym: • one term is synonym of another - in a directed graph there is a direction of this edge suggesting master-child
  • 14. 14 Common attribute across all edge & node types  Taxonomy: • Logical Business Data Model • Logical Atomic Warehouse Model • Logical Dimensional Warehouse Model • Business Glossary • Analytical Requirements • Supportive Content • Scopes • Physical Dimensional Warehouse Model • Physical Business Data Model • Physical Atomic Warehouse Model  Taxonomy Type: • logicalModel • physicalModel • glossary  Industry: • banking • insurance • healthcare • Utilities  Version