SlideShare a Scribd company logo
1 of 63
Surviving as a Data Architect in a
Polyglot Database World
Karen Lopez
@datachick
SpotTheStation.NASA.gov
Karen López
Karen has 20+ years of data and information architecture
experience on large, multi-project programs.
She is a frequent speaker on data modeling, data-driven
methodologies and pattern data models.
She wants you to love your data.
www.datamodel.com
POLL: Audience
Who are you?
Outcomes: Leave here understanding:
Why multi-
model
designs are
important
Database /
datastore
types
Multi-model
thinking
How to
future-proof
your data
arch career
How to
learn more
A Good Data Architect
Cost, Benefit,
Risk
Data
Protection
Business
Requirements
& models
Design
Models
Fit to needs
Polyglot
Polyschematic
What do you mean,
SURVIVE?
• Continue to be involved in data-
related architectural decisions
• Data models wanted and appreciated
• Data model driven development still
a thing
• Valued for bringing business needs
and database opportunities together
• Team player
• Knowledgeable of database features
What’s new
in the
database
world?
Hybrid
Hybrid
Hybrid
Hybrid
Purely relational (SQL)
databases don’t exist
any longer
• Columnstore features
• XML
• JSON
• Other NoSQL features
01
Applications make use
of multiple database
and datastore
technologies
02
Schemas in a variety of
places
03
Typical Characteristics of Traditional Apps
Real-time request/response
Synchronous
Always connected
Transactional
Scale-Up
Failover
Modern Cloud Apps Characteristics
Asynchronous / Queued
Eventual Consistency
Microservices
Command & Query Pattern
Polyglot Persistence
Do you support/model/do stuff
In relational technologies?
In pre-relational tech?
In post-relational (NoSQL) tech?
How’s that going for ya?
A very fast
review of
NoSQL & SQL
I have other recorded webinars at
Dataversity that go into more detail
Basically Available
Soft State
Eventually Consistent
BASE ACID
Atomic
Consistent
Isolated
Durable
BASE - ACID
This used to be an Either/Or decision
BASE
ACID
Polyglot
persistence
•Optimized for data
•Optimized for workload
Not all new
•EAV
•XML
•Architecture paradigm: OLAP/DW
and OLTP
But now….
NoSQL, Not Only SQL, SQL*
Relational Key Value Columnar
Column
Family
Document Hadoop Graph
Plus all the hybrid versions of these…
Relational
Partition Key Row Key Value
Marketing 00001
Name Age
Bob 35
Marketing 00002
Name Age
Karen 42
Marketing Department
Name Count
MKTG 104
Sales 00010
Name Age
Sam 29
• Simple Map
• Semi-Structured
• De-normalized
Key-Value Databases
Cassandra Redis
Oracle
NoSQL
CosmosDB
Key-Value
• No schema, no relationships
• Collections of documents
• Documents are JSON objects
Document databases
{
"title": "Polo Long Sleeve Shirt",
"color": "red",
"material": "cotton"
}
{
"title": "NoSQL Distilled,
"author": [ "Martin Fowler", “P. Sadalage" ],
"isbn": "978-0321826626",
"pages": 192
}
Document
CosmosDB MongoDB BSON &
JSON
Databases,
Documents,
Collections
• Map-based
• Relationships of Edges and Nodes
Graph databases
Graph
Neo4j
DataStax
Enterprise
Graph
SQL Server CosmosDB
Wait…What?
Graph Database and Processing inside relational database engine
Column Family
Columnar
HP Vertica SAP IQ
This means…
• SQL vs. NoSQL isn’t a thing any longer
• Other RDBMS vendors will be adding non-
relational functions and structures
• Data Architects who specialize in
relational-only modeling and design will
be overly specialized
• We don’t have a choice to ignore these
new database models
Most relational databases are adding NoSQL
support
Columnstore
Graph
XML
JSON
You’ve probably
even developed
your own as a
table
Hybrid is the future,
though
• Column Family + Key Value
• Relational + Graph + Columnstore
• Relational + Columnstore
• Graph + Column Family + Key
Value + Document
…all in the same engine. This is a
major change in how things worked
a decade ago.
Did you notice?
• Database as a Service from
Microsoft
• Supports (for now)
• Graph
• Key Value
• Column Family
• Document
• Globally distributed data
• Scalable
• Tunable Consistency
CosmosDB
But what does this all mean?
Data Modeling process impacts
Data Modeler impacts
Quotes from team
NoSQL
• No need for a data model
• Can’t be mapped to a data
model
• SQL is not flexible
• SQL Databases don’t scale
• SQL Databases die after about 1
GB size
• SQL Databases use old
technology
Quotes from team
SQL (relational)
• Eventual Consistency? WTH?
• What do you mean data quality
just doesn’t matter?
• My tools won’t work with these
databases
• It’s all a fad; There will be a new
fad tomorrow
• Bbbbbut…but..but DATA
QUALITY!!!!! INTEGRITY!!!
What is the role of existing Data Models?
Data Models – Traditional Process
Conceptual
(Data)
Model
Logical
Data Model
Physical
Data
Model(s) OLTP
OLTP
OLTP OLTP
OLTP
MARTMART
OLTP
OLTP
OLTP
What about the
Modeling Tasks
& methods
Schema
Schemaless
Schema on read
Polyschematic
Schemas
• In the database as physical
structure
• In a separate location such as
• DSD
• Schema layer on top of the physical
structure
• In the application code
• In the data itself
New Process
Apply
Apply logical data model understandings to physical structure
• Using data modeling and/or tools
• Using a whiteboard
• Using code
Understand Understand data
Understand Understand underlying architecture
Traditional Data Modeler Involvement
Project Initiation
Architecture and
Infrastructure
Design
SW Requirements
Development
Deployment
Modern Data Modeler Involvement
Project Initiation
Architecture and
Infrastructure
Design
SW Requirements
Development
Deployment
What do we really mean by scale?
Rapidly add more compute & data power
Massively parallel processing
Cheap, commodity hardware, but lots of it
Optimized for
Query/Reads/Questions/Telling stories
Most NoSQL
• Was developed with scale in mind
• Has tunable consistency
• Scale out, versus scale up
• Uses distributed data (multiple
copies), globally
• Understanding workloads and
workload trends is important
This metadata isn’t often collected
and modeled by data architect
Where Data
Models Can
Help
• reverse engineer, where possible.
• one entity per tab spreadsheet
• some Normalized (master data +
combinations)
Create a
Physical Data
Model of Key
Data Sources
• definitions
• gotchas
• expected domains
• metadata
Model the
metadata
• Describe data consistently
• Don’t prescribe structure
• Naming standards
• Datatype-ish
Think
Differently
Think Differently
• Data Modeling tools have to catch up
• New tools are being developed
• Data Models can’t be prescriptions
all the time
• Naming standards over constraints
• Data Types are general, higher level
• Tunable data quality
How are you
transitioning
to “different”?
Entity-Relationship
Modeling & IDEF1X
Is it enough?
Should we extend it?
Should we create new notations for each
type of Database?
Scrap and start all of it again?
What about all those very sexy data
modelers?
“Every design decision should
include cost, benefit and risk”
- Karen Lopez
Great Enterprise Data Modelers Characteristics
Patient
Zen balance / non-attachment
Get-er done
Collaboration not sentries/cops
Broad Business Domain
Broad, strategic view
Deep, project empathy
Respected
Good project experience
Good negotiators
52
So let’s summarize:
1. The more SQL-like features available for NoSQL
databases, the more likely a data modeling tool is to
support it.
2. Modeling tool vendors will support features that users
ask for cause them to win deals. This is not a bad thing.
3. Serious NoSQL vendors* understand that hybrid is the
enterprise data story. They want us to find a way.
4. Our data models have value, even if the NoSQL solution
doesn’t require a lot of constraints.
10 Tips for Data Modelers
1. Learn about these methods – don’t avoid them
2. Get hands-on training. Get certified even
3. Learn the lingo
4. Use the lingo
5. Be able to describe data modeling and data
governance to the context of these database
technologies and their use cases
10 Tips for Data Modelers
6. Bring data models (and other models) to the team
7. Be ahead of the curve on new NoSQL and SQL
features in your DBMSs
8. Understand the use cases for each type of database
technology
9. Let go, a little, when it makes sense
10. Enjoy the new database smell!
What to do
Learn
Get Hands-on
experience with
these new data
technologies
Talk to your tool
vendors
Bring models
you have
Resources
Making Sense of NoSQL clearly and
concisely explains the concepts,
features, benefits, potential, and
limitations of NoSQL technologies.
Using examples and use cases,
illustrations, and plain, jargon-free
writing, this guide shows how you
can effectively assemble a NoSQL
solution to replace or augment the
traditional RDBMS you have now.
And it’s FREE!
GraphDatabases.com
This book is written for anyone who
is working with, or will be working
with MongoDB, including business
analysts, data modelers, database
administrators, developers, project
managers, and data scientists.
PostgreSQL
Riak
Hbase
MongoDB
Neo4J
CouchDB
Redis
Questions?
•
#TEAMDATA
www.datamodel.com
Thank you, you were
great.

More Related Content

What's hot

5 Data Modeling for NoSQL 1/2
5 Data Modeling for NoSQL 1/25 Data Modeling for NoSQL 1/2
5 Data Modeling for NoSQL 1/2Fabio Fumarola
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?James Serra
 
Transitioning to a BI Role
Transitioning to a BI RoleTransitioning to a BI Role
Transitioning to a BI RoleJames Serra
 
Making Sense of Schema on Read
Making Sense of Schema on ReadMaking Sense of Schema on Read
Making Sense of Schema on ReadKent Graziano
 
Introduction to Machine Learning on Azure
Introduction to Machine Learning on AzureIntroduction to Machine Learning on Azure
Introduction to Machine Learning on AzureAntonios Chatzipavlis
 
Azure Databricks is Easier Than You Think
Azure Databricks is Easier Than You ThinkAzure Databricks is Easier Than You Think
Azure Databricks is Easier Than You ThinkIke Ellis
 
Schemaless Databases
Schemaless DatabasesSchemaless Databases
Schemaless DatabasesDan Gunter
 
Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)James Serra
 
Data modeling trends for analytics
Data modeling trends for analyticsData modeling trends for analytics
Data modeling trends for analyticsIke Ellis
 
What's new in SQL Server 2016
What's new in SQL Server 2016What's new in SQL Server 2016
What's new in SQL Server 2016James Serra
 
Introduction to Azure DocumentDB
Introduction to Azure DocumentDBIntroduction to Azure DocumentDB
Introduction to Azure DocumentDBDenny Lee
 
Azure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in ActionAzure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in ActionDenys Chamberland
 
NoSQL Tel Aviv Meetup#1: NoSQL Data Modeling
NoSQL Tel Aviv Meetup#1: NoSQL Data ModelingNoSQL Tel Aviv Meetup#1: NoSQL Data Modeling
NoSQL Tel Aviv Meetup#1: NoSQL Data ModelingNoSQL TLV
 
OLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure SynapseOLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure SynapseAtScale
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure Antonios Chatzipavlis
 
Should I move my database to the cloud?
Should I move my database to the cloud?Should I move my database to the cloud?
Should I move my database to the cloud?James Serra
 
SQLBits X Scaling out with SQL Azure Federations
SQLBits X Scaling out with SQL Azure FederationsSQLBits X Scaling out with SQL Azure Federations
SQLBits X Scaling out with SQL Azure FederationsMichael Rys
 

What's hot (20)

5 Data Modeling for NoSQL 1/2
5 Data Modeling for NoSQL 1/25 Data Modeling for NoSQL 1/2
5 Data Modeling for NoSQL 1/2
 
How does Microsoft solve Big Data?
How does Microsoft solve Big Data?How does Microsoft solve Big Data?
How does Microsoft solve Big Data?
 
Transitioning to a BI Role
Transitioning to a BI RoleTransitioning to a BI Role
Transitioning to a BI Role
 
Making Sense of Schema on Read
Making Sense of Schema on ReadMaking Sense of Schema on Read
Making Sense of Schema on Read
 
Introduction to Machine Learning on Azure
Introduction to Machine Learning on AzureIntroduction to Machine Learning on Azure
Introduction to Machine Learning on Azure
 
Azure Databricks is Easier Than You Think
Azure Databricks is Easier Than You ThinkAzure Databricks is Easier Than You Think
Azure Databricks is Easier Than You Think
 
Introduction to Azure Data Lake
Introduction to Azure Data LakeIntroduction to Azure Data Lake
Introduction to Azure Data Lake
 
Schemaless Databases
Schemaless DatabasesSchemaless Databases
Schemaless Databases
 
Nosql data models
Nosql data modelsNosql data models
Nosql data models
 
Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)Azure Synapse Analytics Overview (r1)
Azure Synapse Analytics Overview (r1)
 
Data modeling trends for analytics
Data modeling trends for analyticsData modeling trends for analytics
Data modeling trends for analytics
 
What's new in SQL Server 2016
What's new in SQL Server 2016What's new in SQL Server 2016
What's new in SQL Server 2016
 
Introduction to Azure DocumentDB
Introduction to Azure DocumentDBIntroduction to Azure DocumentDB
Introduction to Azure DocumentDB
 
Azure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in ActionAzure Cosmos DB + Gremlin API in Action
Azure Cosmos DB + Gremlin API in Action
 
NoSQL Tel Aviv Meetup#1: NoSQL Data Modeling
NoSQL Tel Aviv Meetup#1: NoSQL Data ModelingNoSQL Tel Aviv Meetup#1: NoSQL Data Modeling
NoSQL Tel Aviv Meetup#1: NoSQL Data Modeling
 
OLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure SynapseOLAP on the Cloud with Azure Databricks and Azure Synapse
OLAP on the Cloud with Azure Databricks and Azure Synapse
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure
 
Should I move my database to the cloud?
Should I move my database to the cloud?Should I move my database to the cloud?
Should I move my database to the cloud?
 
SQLBits X Scaling out with SQL Azure Federations
SQLBits X Scaling out with SQL Azure FederationsSQLBits X Scaling out with SQL Azure Federations
SQLBits X Scaling out with SQL Azure Federations
 
Microsoft azure documentDB
Microsoft azure documentDBMicrosoft azure documentDB
Microsoft azure documentDB
 

Similar to How to Survive as a Data Architect in a Polyglot Database World

GraphTalks Rome - Selecting the right Technology
GraphTalks Rome - Selecting the right TechnologyGraphTalks Rome - Selecting the right Technology
GraphTalks Rome - Selecting the right TechnologyNeo4j
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabasesAdi Challa
 
Big Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is EssentialBig Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is EssentialBigDataExpo
 
Modeling Webinar: State of the Union for Data Innovation - 2016
Modeling Webinar: State of the Union for Data Innovation - 2016Modeling Webinar: State of the Union for Data Innovation - 2016
Modeling Webinar: State of the Union for Data Innovation - 2016DATAVERSITY
 
Transform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataTransform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataAshnikbiz
 
Steps towards business intelligence
Steps towards business intelligenceSteps towards business intelligence
Steps towards business intelligenceAhsan Kabir
 
Evolution of Distributed Database Technologies in the Digital era
Evolution of Distributed Database Technologies in the Digital eraEvolution of Distributed Database Technologies in the Digital era
Evolution of Distributed Database Technologies in the Digital eraVishal Puri
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databasesJames Serra
 
Hadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelHadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelUwe Printz
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Denodo
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)James Serra
 
What is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PMWhat is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PMProduct School
 
Introduction to Bigdata and NoSQL
Introduction to Bigdata and NoSQLIntroduction to Bigdata and NoSQL
Introduction to Bigdata and NoSQLTushar Shende
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkCaserta
 
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
 How to use Big Data and Data Lake concept in business using Hadoop and Spark... How to use Big Data and Data Lake concept in business using Hadoop and Spark...
How to use Big Data and Data Lake concept in business using Hadoop and Spark...Institute of Contemporary Sciences
 
Evolution of the DBA to Data Platform Administrator/Specialist
Evolution of the DBA to Data Platform Administrator/SpecialistEvolution of the DBA to Data Platform Administrator/Specialist
Evolution of the DBA to Data Platform Administrator/SpecialistTony Rogerson
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceSense Corp
 
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...IDERA Software
 

Similar to How to Survive as a Data Architect in a Polyglot Database World (20)

GraphTalks Rome - Selecting the right Technology
GraphTalks Rome - Selecting the right TechnologyGraphTalks Rome - Selecting the right Technology
GraphTalks Rome - Selecting the right Technology
 
NoSQLDatabases
NoSQLDatabasesNoSQLDatabases
NoSQLDatabases
 
Big Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is EssentialBig Data Expo 2015 - Barnsten Why Data Modelling is Essential
Big Data Expo 2015 - Barnsten Why Data Modelling is Essential
 
NoSql Brownbag
NoSql BrownbagNoSql Brownbag
NoSql Brownbag
 
Modeling Webinar: State of the Union for Data Innovation - 2016
Modeling Webinar: State of the Union for Data Innovation - 2016Modeling Webinar: State of the Union for Data Innovation - 2016
Modeling Webinar: State of the Union for Data Innovation - 2016
 
NoSQL
NoSQLNoSQL
NoSQL
 
Transform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataTransform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big Data
 
Steps towards business intelligence
Steps towards business intelligenceSteps towards business intelligence
Steps towards business intelligence
 
Evolution of Distributed Database Technologies in the Digital era
Evolution of Distributed Database Technologies in the Digital eraEvolution of Distributed Database Technologies in the Digital era
Evolution of Distributed Database Technologies in the Digital era
 
Relational databases vs Non-relational databases
Relational databases vs Non-relational databasesRelational databases vs Non-relational databases
Relational databases vs Non-relational databases
 
Hadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data ModelHadoop meets Agile! - An Agile Big Data Model
Hadoop meets Agile! - An Agile Big Data Model
 
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?Data Lake Acceleration vs. Data Virtualization - What’s the difference?
Data Lake Acceleration vs. Data Virtualization - What’s the difference?
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)Data Lakehouse, Data Mesh, and Data Fabric (r1)
Data Lakehouse, Data Mesh, and Data Fabric (r1)
 
What is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PMWhat is Data as a Service by T-Mobile Principle Technical PM
What is Data as a Service by T-Mobile Principle Technical PM
 
Introduction to Bigdata and NoSQL
Introduction to Bigdata and NoSQLIntroduction to Bigdata and NoSQL
Introduction to Bigdata and NoSQL
 
Mastering Customer Data on Apache Spark
Mastering Customer Data on Apache SparkMastering Customer Data on Apache Spark
Mastering Customer Data on Apache Spark
 
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
 How to use Big Data and Data Lake concept in business using Hadoop and Spark... How to use Big Data and Data Lake concept in business using Hadoop and Spark...
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
 
Evolution of the DBA to Data Platform Administrator/Specialist
Evolution of the DBA to Data Platform Administrator/SpecialistEvolution of the DBA to Data Platform Administrator/Specialist
Evolution of the DBA to Data Platform Administrator/Specialist
 
Managing Large Amounts of Data with Salesforce
Managing Large Amounts of Data with SalesforceManaging Large Amounts of Data with Salesforce
Managing Large Amounts of Data with Salesforce
 
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
Geek Sync | Avoid the Seven Mistakes Data Modelers Make in Aiding Data Govern...
 

More from Karen Lopez

DGIQ East 2023 AI Ethics SIG
DGIQ East 2023 AI Ethics SIGDGIQ East 2023 AI Ethics SIG
DGIQ East 2023 AI Ethics SIGKaren Lopez
 
A Designer's Favourite Security and Privacy Features in SQL Server and Azure ...
A Designer's Favourite Security and Privacy Features in SQL Server and Azure ...A Designer's Favourite Security and Privacy Features in SQL Server and Azure ...
A Designer's Favourite Security and Privacy Features in SQL Server and Azure ...Karen Lopez
 
Data in the Stars
Data in the StarsData in the Stars
Data in the StarsKaren Lopez
 
Designer's Favorite New Features in SQLServer
Designer's Favorite New Features in SQLServerDesigner's Favorite New Features in SQLServer
Designer's Favorite New Features in SQLServerKaren Lopez
 
WhoseTinklingInYourDataLake - DAMA Chicago.pdf
WhoseTinklingInYourDataLake - DAMA Chicago.pdfWhoseTinklingInYourDataLake - DAMA Chicago.pdf
WhoseTinklingInYourDataLake - DAMA Chicago.pdfKaren Lopez
 
Expert Cloud Data Backup and Recovery Best Practice.pptx
Expert Cloud Data Backup and Recovery Best Practice.pptxExpert Cloud Data Backup and Recovery Best Practice.pptx
Expert Cloud Data Backup and Recovery Best Practice.pptxKaren Lopez
 
Manage Your Time So It Doesn't Manage You
Manage Your Time So It Doesn't Manage YouManage Your Time So It Doesn't Manage You
Manage Your Time So It Doesn't Manage YouKaren Lopez
 
Migrating Data and Databases to Azure
Migrating Data and Databases to AzureMigrating Data and Databases to Azure
Migrating Data and Databases to AzureKaren Lopez
 
Blockchain for the DBA and Data Professional
Blockchain for the DBA and Data ProfessionalBlockchain for the DBA and Data Professional
Blockchain for the DBA and Data ProfessionalKaren Lopez
 
Blockchain for the DBA and Data Professional
Blockchain for the DBA and Data ProfessionalBlockchain for the DBA and Data Professional
Blockchain for the DBA and Data ProfessionalKaren Lopez
 
Data Modeling for Security, Privacy and Data Protection
Data Modeling for Security, Privacy and Data ProtectionData Modeling for Security, Privacy and Data Protection
Data Modeling for Security, Privacy and Data ProtectionKaren Lopez
 
Fast Focus: SQL Server Graph Database & Processing
Fast Focus: SQL Server Graph Database & ProcessingFast Focus: SQL Server Graph Database & Processing
Fast Focus: SQL Server Graph Database & ProcessingKaren Lopez
 
Designing for Data Security by Karen Lopez
Designing for Data Security by Karen LopezDesigning for Data Security by Karen Lopez
Designing for Data Security by Karen LopezKaren Lopez
 
The Key to Keys - Database Design
The Key to Keys - Database DesignThe Key to Keys - Database Design
The Key to Keys - Database DesignKaren Lopez
 
NoSQL and Data Modeling for Data Modelers
NoSQL and Data Modeling for Data ModelersNoSQL and Data Modeling for Data Modelers
NoSQL and Data Modeling for Data ModelersKaren Lopez
 

More from Karen Lopez (15)

DGIQ East 2023 AI Ethics SIG
DGIQ East 2023 AI Ethics SIGDGIQ East 2023 AI Ethics SIG
DGIQ East 2023 AI Ethics SIG
 
A Designer's Favourite Security and Privacy Features in SQL Server and Azure ...
A Designer's Favourite Security and Privacy Features in SQL Server and Azure ...A Designer's Favourite Security and Privacy Features in SQL Server and Azure ...
A Designer's Favourite Security and Privacy Features in SQL Server and Azure ...
 
Data in the Stars
Data in the StarsData in the Stars
Data in the Stars
 
Designer's Favorite New Features in SQLServer
Designer's Favorite New Features in SQLServerDesigner's Favorite New Features in SQLServer
Designer's Favorite New Features in SQLServer
 
WhoseTinklingInYourDataLake - DAMA Chicago.pdf
WhoseTinklingInYourDataLake - DAMA Chicago.pdfWhoseTinklingInYourDataLake - DAMA Chicago.pdf
WhoseTinklingInYourDataLake - DAMA Chicago.pdf
 
Expert Cloud Data Backup and Recovery Best Practice.pptx
Expert Cloud Data Backup and Recovery Best Practice.pptxExpert Cloud Data Backup and Recovery Best Practice.pptx
Expert Cloud Data Backup and Recovery Best Practice.pptx
 
Manage Your Time So It Doesn't Manage You
Manage Your Time So It Doesn't Manage YouManage Your Time So It Doesn't Manage You
Manage Your Time So It Doesn't Manage You
 
Migrating Data and Databases to Azure
Migrating Data and Databases to AzureMigrating Data and Databases to Azure
Migrating Data and Databases to Azure
 
Blockchain for the DBA and Data Professional
Blockchain for the DBA and Data ProfessionalBlockchain for the DBA and Data Professional
Blockchain for the DBA and Data Professional
 
Blockchain for the DBA and Data Professional
Blockchain for the DBA and Data ProfessionalBlockchain for the DBA and Data Professional
Blockchain for the DBA and Data Professional
 
Data Modeling for Security, Privacy and Data Protection
Data Modeling for Security, Privacy and Data ProtectionData Modeling for Security, Privacy and Data Protection
Data Modeling for Security, Privacy and Data Protection
 
Fast Focus: SQL Server Graph Database & Processing
Fast Focus: SQL Server Graph Database & ProcessingFast Focus: SQL Server Graph Database & Processing
Fast Focus: SQL Server Graph Database & Processing
 
Designing for Data Security by Karen Lopez
Designing for Data Security by Karen LopezDesigning for Data Security by Karen Lopez
Designing for Data Security by Karen Lopez
 
The Key to Keys - Database Design
The Key to Keys - Database DesignThe Key to Keys - Database Design
The Key to Keys - Database Design
 
NoSQL and Data Modeling for Data Modelers
NoSQL and Data Modeling for Data ModelersNoSQL and Data Modeling for Data Modelers
NoSQL and Data Modeling for Data Modelers
 

Recently uploaded

modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxdolaknnilon
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhYasamin16
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一F La
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一F sss
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...ttt fff
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档208367051
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...ssuserf63bd7
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 

Recently uploaded (20)

modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptx
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhhThiophen Mechanism khhjjjjjjjhhhhhhhhhhh
Thiophen Mechanism khhjjjjjjjhhhhhhhhhhh
 
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
办理(UWIC毕业证书)英国卡迪夫城市大学毕业证成绩单原版一比一
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
 
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
原版1:1定制南十字星大学毕业证(SCU毕业证)#文凭成绩单#真实留信学历认证永久存档
 
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
Statistics, Data Analysis, and Decision Modeling, 5th edition by James R. Eva...
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 

How to Survive as a Data Architect in a Polyglot Database World

  • 1. Surviving as a Data Architect in a Polyglot Database World Karen Lopez @datachick
  • 3. Karen López Karen has 20+ years of data and information architecture experience on large, multi-project programs. She is a frequent speaker on data modeling, data-driven methodologies and pattern data models. She wants you to love your data. www.datamodel.com
  • 5. Outcomes: Leave here understanding: Why multi- model designs are important Database / datastore types Multi-model thinking How to future-proof your data arch career How to learn more
  • 6. A Good Data Architect Cost, Benefit, Risk Data Protection Business Requirements & models Design Models Fit to needs
  • 7.
  • 9. What do you mean, SURVIVE? • Continue to be involved in data- related architectural decisions • Data models wanted and appreciated • Data model driven development still a thing • Valued for bringing business needs and database opportunities together • Team player • Knowledgeable of database features
  • 11. Hybrid Purely relational (SQL) databases don’t exist any longer • Columnstore features • XML • JSON • Other NoSQL features 01 Applications make use of multiple database and datastore technologies 02 Schemas in a variety of places 03
  • 12. Typical Characteristics of Traditional Apps Real-time request/response Synchronous Always connected Transactional Scale-Up Failover
  • 13. Modern Cloud Apps Characteristics Asynchronous / Queued Eventual Consistency Microservices Command & Query Pattern Polyglot Persistence
  • 14. Do you support/model/do stuff In relational technologies? In pre-relational tech? In post-relational (NoSQL) tech? How’s that going for ya?
  • 15. A very fast review of NoSQL & SQL I have other recorded webinars at Dataversity that go into more detail
  • 16. Basically Available Soft State Eventually Consistent BASE ACID Atomic Consistent Isolated Durable BASE - ACID
  • 17. This used to be an Either/Or decision BASE ACID
  • 18. Polyglot persistence •Optimized for data •Optimized for workload Not all new •EAV •XML •Architecture paradigm: OLAP/DW and OLTP But now….
  • 19. NoSQL, Not Only SQL, SQL* Relational Key Value Columnar Column Family Document Hadoop Graph Plus all the hybrid versions of these…
  • 21. Partition Key Row Key Value Marketing 00001 Name Age Bob 35 Marketing 00002 Name Age Karen 42 Marketing Department Name Count MKTG 104 Sales 00010 Name Age Sam 29 • Simple Map • Semi-Structured • De-normalized Key-Value Databases
  • 23. • No schema, no relationships • Collections of documents • Documents are JSON objects Document databases { "title": "Polo Long Sleeve Shirt", "color": "red", "material": "cotton" } { "title": "NoSQL Distilled, "author": [ "Martin Fowler", “P. Sadalage" ], "isbn": "978-0321826626", "pages": 192 }
  • 24. Document CosmosDB MongoDB BSON & JSON Databases, Documents, Collections
  • 25. • Map-based • Relationships of Edges and Nodes Graph databases
  • 27. Wait…What? Graph Database and Processing inside relational database engine
  • 30. This means… • SQL vs. NoSQL isn’t a thing any longer • Other RDBMS vendors will be adding non- relational functions and structures • Data Architects who specialize in relational-only modeling and design will be overly specialized • We don’t have a choice to ignore these new database models
  • 31. Most relational databases are adding NoSQL support Columnstore Graph XML JSON You’ve probably even developed your own as a table
  • 32. Hybrid is the future, though • Column Family + Key Value • Relational + Graph + Columnstore • Relational + Columnstore • Graph + Column Family + Key Value + Document …all in the same engine. This is a major change in how things worked a decade ago.
  • 33. Did you notice? • Database as a Service from Microsoft • Supports (for now) • Graph • Key Value • Column Family • Document • Globally distributed data • Scalable • Tunable Consistency CosmosDB
  • 34. But what does this all mean? Data Modeling process impacts Data Modeler impacts
  • 35. Quotes from team NoSQL • No need for a data model • Can’t be mapped to a data model • SQL is not flexible • SQL Databases don’t scale • SQL Databases die after about 1 GB size • SQL Databases use old technology
  • 36. Quotes from team SQL (relational) • Eventual Consistency? WTH? • What do you mean data quality just doesn’t matter? • My tools won’t work with these databases • It’s all a fad; There will be a new fad tomorrow • Bbbbbut…but..but DATA QUALITY!!!!! INTEGRITY!!!
  • 37. What is the role of existing Data Models?
  • 38. Data Models – Traditional Process Conceptual (Data) Model Logical Data Model Physical Data Model(s) OLTP OLTP OLTP OLTP OLTP MARTMART OLTP OLTP OLTP
  • 39. What about the Modeling Tasks & methods Schema Schemaless Schema on read Polyschematic
  • 40. Schemas • In the database as physical structure • In a separate location such as • DSD • Schema layer on top of the physical structure • In the application code • In the data itself
  • 41. New Process Apply Apply logical data model understandings to physical structure • Using data modeling and/or tools • Using a whiteboard • Using code Understand Understand data Understand Understand underlying architecture
  • 42. Traditional Data Modeler Involvement Project Initiation Architecture and Infrastructure Design SW Requirements Development Deployment
  • 43. Modern Data Modeler Involvement Project Initiation Architecture and Infrastructure Design SW Requirements Development Deployment
  • 44. What do we really mean by scale? Rapidly add more compute & data power Massively parallel processing Cheap, commodity hardware, but lots of it Optimized for Query/Reads/Questions/Telling stories
  • 45. Most NoSQL • Was developed with scale in mind • Has tunable consistency • Scale out, versus scale up • Uses distributed data (multiple copies), globally • Understanding workloads and workload trends is important This metadata isn’t often collected and modeled by data architect
  • 46. Where Data Models Can Help • reverse engineer, where possible. • one entity per tab spreadsheet • some Normalized (master data + combinations) Create a Physical Data Model of Key Data Sources • definitions • gotchas • expected domains • metadata Model the metadata • Describe data consistently • Don’t prescribe structure • Naming standards • Datatype-ish Think Differently
  • 47. Think Differently • Data Modeling tools have to catch up • New tools are being developed • Data Models can’t be prescriptions all the time • Naming standards over constraints • Data Types are general, higher level • Tunable data quality
  • 48. How are you transitioning to “different”?
  • 49. Entity-Relationship Modeling & IDEF1X Is it enough? Should we extend it? Should we create new notations for each type of Database? Scrap and start all of it again?
  • 50. What about all those very sexy data modelers?
  • 51. “Every design decision should include cost, benefit and risk” - Karen Lopez
  • 52. Great Enterprise Data Modelers Characteristics Patient Zen balance / non-attachment Get-er done Collaboration not sentries/cops Broad Business Domain Broad, strategic view Deep, project empathy Respected Good project experience Good negotiators 52
  • 53. So let’s summarize: 1. The more SQL-like features available for NoSQL databases, the more likely a data modeling tool is to support it. 2. Modeling tool vendors will support features that users ask for cause them to win deals. This is not a bad thing. 3. Serious NoSQL vendors* understand that hybrid is the enterprise data story. They want us to find a way. 4. Our data models have value, even if the NoSQL solution doesn’t require a lot of constraints.
  • 54. 10 Tips for Data Modelers 1. Learn about these methods – don’t avoid them 2. Get hands-on training. Get certified even 3. Learn the lingo 4. Use the lingo 5. Be able to describe data modeling and data governance to the context of these database technologies and their use cases
  • 55. 10 Tips for Data Modelers 6. Bring data models (and other models) to the team 7. Be ahead of the curve on new NoSQL and SQL features in your DBMSs 8. Understand the use cases for each type of database technology 9. Let go, a little, when it makes sense 10. Enjoy the new database smell!
  • 56. What to do Learn Get Hands-on experience with these new data technologies Talk to your tool vendors Bring models you have
  • 58. Making Sense of NoSQL clearly and concisely explains the concepts, features, benefits, potential, and limitations of NoSQL technologies. Using examples and use cases, illustrations, and plain, jargon-free writing, this guide shows how you can effectively assemble a NoSQL solution to replace or augment the traditional RDBMS you have now.
  • 60. This book is written for anyone who is working with, or will be working with MongoDB, including business analysts, data modelers, database administrators, developers, project managers, and data scientists.