SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
LITTLE DATA IN A BIG
DATA WORLD
The case of health care
Dataversity / DAMA
February 2016
Laura Sebastian-Coleman,
Ph.D., IQCP
Offered by: Connecticut General Life Insurance Company or Cigna Health and Life Insurance Company.
About me
• Doing data quality in health care since 2003
• Have worked in banking, manufacturing, distribution,
commercial insurance, and academia.
• All have influenced my understanding of data, quality,
and measurement
• Developed the Data Quality Assessment Framework
(DQAF); published in Measuring Data Quality for
Ongoing Improvement (2013).
• IAIDQ Distinguished Member Award 2015.
• DAMA Publications Director, beginning summer 2015
• Influences on my thinking about data:
• The challenge of how to measure data quality
• The concept of measurement itself: A problems of
measurement is a microcosm of the general
challenge of data definition and collection.
• The demands of data warehousing, especially of data
integration.
2
• Abstract:
– While technological innovation brings constant change to the data landscape, many
organizations still struggle with the basics: ensuring they have reliable, high quality data.
– In health care, the promise of insight to be gained through analytics is dependent on ensuring
the interactions between providers and patients are recorded accurately and completely.
– A lot of health care data is dependent on person-to-person contact. This fact influences its
quality, as well as how it is captured, stored, accessed and used.
– This presentation will ask the audience to think about data in old and new ways in order to gain
additional insight about how to improve the quality of data, regardless of size.
• Order of information
– The promise of big data in health care vs. the reality of the health care system
– The data quality in health care
– How we think about data and data quality
– Making good on the promise of health care analytics
Abstract and Agenda
3
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• According to the blog Big Data Landscape, Big Data Health Care apps enable people
and organizations to
– Make well-informed decisions
– Enhance patient engagement
– Encourage prescription medication compliance
– Adjust lifestyle choices
– Maximize levels of well-being
– Predict and prepare for individual illnesses
– Manage demand within the health care system
• In Health Care Delivery of the Future, Price Waterhouse Coopers describes a New
Health Economy in which:
– Digitally-enabled care is a fundamental business imperative.
– There are major shifts in how care is being delivered.
– Digital technology bridges time, distance and the expectation gap between
consumers and clinicians.
The Promise of Big Data in Health Care
4
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• At a recent insurance summit in
Hartford, Connecticut, health care
executives grappled with a
quandary:
“Only in health care does
innovation lead to higher costs.”
• Not only that, but within the health
care system, there are
inefficiencies. One executive
estimated
“Thirty cents on the dollar, at least,
is not used effectively.”
• Getting people to act in the best
way for their health is
“Very easy to say. Very hard to do.”
The Reality of Health Care
5
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• The promise of health care analytics and big data depends on having high quality data.
• Technology limitations
– Data output from older, legacy claim adjudication systems differs greatly within and
between organizations
– Medical provider transition from paper to Electronic Health Records (EHR) is
relatively recent and still underway
– In 2008, only 4% of Providers had full EHR, 13% had a basic system
• Lack of standards /
standards implemented
inconsistently
• No standards for data quality
Challenges with the Quality of Health Care Data
6
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• Goal: Improve knowledge of the quality of data gathered in non-clinical trial settings.
– As research data warehouses and large scale data networks begin to become
established sources of observational data, it becomes more critical that consistent
methods for assessing and reporting data quality are developed and adopted so that
users of data and consumers of results understand the potential impact of data
quality on study results.
• Factors that can cause misrepresentation of clinical events:
– Inflexible systems
– Coding practices
– Gaps in standards
• Example: Screening for high blood pressure in children
– Clinicians were directed to screen children for high blood pressure.
– After the initial screening, they continued to monitor some of the children.
– When researchers looked at the data, it appeared children were increasingly at risk
of high blood pressure.
– Examination of records showed that the ‘increase’ was caused by incorrect use of
hypertension diagnosis code
– But there was not a diagnosis code for “considering hypertension”
Academy Health Example
7
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• Person-to-person interactions
between patients and health care
providers
• Interactions influence factors that
may result in different interpretation
of symptoms
• Interactions are represented by data
• Different choices in how to
represent these interactions create
different data stories
Health Care data is about people
8
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
Definition: Data
• Data’s Latin root is dare, past participle of to give. Data means “something given.” In
math and engineering, the terms data and givens are used interchangeably.
• The New Oxford American Dictionary (NOAD) defines data as “facts and statistics
collected together for reference or analysis.”
• ISO defines data as “re-interpretable representation of information in a formalized
manner suitable for communication, interpretation, or processing” (ISO 11179).
• Observations about the concept of data
– Data tries to tell the truth about the world (“facts”)
– Data is formal – it has a shape
– Data’s function is representational
– Data is often about quantities, measurements, and other numeric
representations “facts”
– Things are done with data: reference, analysis, interpretation, processing
• What the definitions leave out:
– Data is made by people. We choose what characteristics to represent. The creation
of data implies a set of expectations about data’s condition.
– People also use data. The uses of data imply a set of expectations about data’s
condition.
9
Science
– Focuses on measurement as a
means to create knowledge
– Plans for data that is accurate
and complete
– Sets standards so that results
are collected correctly and can
be understood and reproduced
– Tests and re-tests
• Data is a PRODUCT of science.
• High quality data is a necessity.
Commerce
– Creates measurements
pragmatically to meet goals
– Generates data by accident
while executing its core
processes
• Data is a BY-PRODUCT of
commerce
• Historically, data quality has not
an end in itself for commerce.
Science, Commerce and Data Quality
10
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• The History of Measurement = history of
our ideas about data
• Science & Commerce have different
goals for and approaches to
measurement and thus to data.
• Our ideas about data come largely from
science, but we create data based on
commerce.
The Scientific Method vs. Commerce
Data Quality
• Our ideas about data quality
come largely from science,
even though we create data
based on commerce.
• Today, we are using
organizational data in
scientific ways – to learn
about our business.
• We expect the data to be fit
for this purpose, but we
have not focused on
ensuring representational
effectiveness.
12
Fitness for
purpose
Representational
effectiveness
• Even scientists are struggling with data quality in the age of Big Data.
• A February 2015 article in Science News reported researchers are struggling to find
insights as they sort through “mounds of data”:
“Just keeping track of big data is a monumental undertaking. Sharing the data
with other researchers, a critical piece of transparency and efficiency in science,
has its own set of problems. And the tools used to analyze complex datasets are
just as important as the data themselves. Each time a scientist chooses one
computer program over another or decides to investigate one variable rather than
a different one, the decision can lead to very different conclusions.”
Data Quality and Big Data
13
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• The inability to reproduce results has
brought some studies into question.
• The answer? Study the factors that
influence reproducibility.
• They have discovered a kind of “butterfly
effect” in studies using big data:
– Small changes in the initial conditions
of an experiment can have significant
effects on the outcome of replication
attempts.
Butterfly effect in the clinical space
A) Decision features
a. Framing (e.g., gain vs. losses) (2 factors)
b. Order of choices (e.g. A à B vs. BàA in a simple two
choice decision) (2 factors)
c. Choice justification (e.g., effect of regret, guilt etc. on
dissonance reduction; yes vs. no) (2 factors)
B) Situational factors
a. Time pressure (e.g., yes vs no) (2 factors)
b. Cognitive load (e.g., high vs. low) (2 factors)
c. Social context (e.g., important vs. not important) (2
factors)
C) Characteristics of decision-maker
a. Individual [e.g., age (old vs. young), gender (female vs.
male) (4 factors)
b. Group (e.g, small vs. large group) (2 factors)
c. Cultural factors (e.g., present vs. not preset/important) (2
factors)
D) Individual differences
a. Decision styles (e.g. intuitive vs. analytic) (2 factors)
b. Cognitive ability (e.g., high vs. low) (2 factors)
c. Personality (e.g., openness, conscientious, extraversion,
agreeableness, neuroticism) (“Big 5” factors)
Table 1. Minimum number of the factors affecting decision-making
From Effect of Initial Conditions on Reproducibility of Scientific
Research, by Benjamin Djulbegovic and Iztok Hozo
• Small changes in the initial
conditions of an experiment can
have significant effects on the
outcome of replication attempts.
• Researchers used Doctor/Patient
interactions to study the butterfly
effect and identified 12 factors
that influence clinical decision
making.
• Those initial factors make up
20,480 combinations that could
represent the initial conditions of
the experiment.
• Yes, 20,480! Initial conditions can
influence clinical decision making
and the data that is recorded as
part of it.
The butterfly effect takes on a whole new meaning
15
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• Recognize that there is variation within the health care system and use this variation to
improve the system.
• Reduce variation to simplify the system.
• Recognize that data is a critical product of the health care system and plan for quality
data.
Making good on the promises of health care analytics
16
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• Variation may itself be meaningful.
• For example, variation in how doctors make decisions
– If we understand how doctors make decisions, and we understand he outcome of
those decisions, then we can assess the results of those outcomes and provide that
feedback to the health care system.
• This is the basic promise of evidence-based medicine and health care analytics, but it
requires a lot of data.
Making good on the promises of health care analytics:
Recognize variation
17
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• Establish and enforce data standards within health care
– Including data quality standards
• Collect more data through instruments, devices to reduce ambiguity of meaning
– In 2015, PWC reported that 28% of consumers have at least one health related app
on their tablet or smart phone.
– Up from 13% just three years earlier.
– About two-thirds of physicians reported that they would be willing to prescribe an
app that would help patients manage a chronic disease.
– Being able to integrate this kind of data into an electronic medical record represents
an additional opportunity.
• Enable feedback about the data from health care providers, patients, and other
stakeholders
– More health care data is being exposed directly to physicians and to consumers.
– Physician compensation and Consumer incentives depend on this data.
– To reduce costs and improve quality of care, health care companies will need means
to respond to feedback and ensure data is accurate.
Making good on the promises of health care analytics:
Reduce variation
18
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
• Recognize that improvement in health care and in quality of life depend on having
trustworthy, reliable data.
– Data is not secondary to successful outcomes, it is central to successful outcomes.
• Focus from the start on ensuring the quality of that data, whether that data is little
(such as a record of the facts of an office visit) or big (petabytes of biometric data
collected from all the Fitbits in the USA).
• High quality data is:
– Clearly defined
– Consistently collected through reliable processes
– Presented through comprehensible standards
• The processes and systems that we build to create this new health care data should
be designed with the data itself in mind.
– The industry as a whole must cease to create data as a by-product and recognize
data as critical input to its primary goal: helping improve people’s health, well-being
and sense of security.
Making good on the promises of health care analytics:
Plan for data as a product
19
Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
References
• http://www.bigdatalandscape.com/blog/big-data-health-find-apps-can-improve-wellbeing
• http://www.pwc.com/us/en/health-industries/top-health-industry-issues/assets/pwc-healthcare-delivery-of-
the-future.pdf
• http://touch.courant.com/#section/-1/article/p2p-84932278/
• https://en.wikipedia.org/wiki/Electronic_Medical_Record#United_States
• http://www.academyhealth.org/
• http://www.pcori.org/about-us/why-pcori-was-created
• http://courses.washington.edu/geog482/resource/14_Beyond_Accuracy.pdf
• https://www.sciencenews.org/article/redoing-scientific-research-best-way-find-truth
• https://www.sciencenews.org/article/big-data-studies-come-replication-challenges
• http://www.scopemed.org/fulltextpdf.php?mno=162185
• http://consumerhealthchoices.org/wp-content/uploads/2015/05/HealthDataGuide-June2015.pdf
• http://www.pwc.com/us/en/health-industries/top-health-industry-issues/assets/pwc-healthcare-delivery-of-
the-future.pdf
• http://kff.org/health-reform/fact-sheet/summary-of-the-affordable-care-act/
• http://www.forbes.com/sites#/sites/emc/2014/01/22/can-big-data-and-mobile-make-health-care-more-
effective/
• http://www.nytimes.com/2016/01/17/opinion/sunday/how-measurement-fails-doctors-and-
teachers.html?_r=0
•
All Cigna products and services are provided exclusively by or through operating subsidiaries of Cigna Corporation, including Cigna Health and Life Insurance Company,
Connecticut General Life Insurance Company, Cigna Behavioral Health, Inc., and HMO or service company subsidiaries of Cigna Health Corporation. The Cigna name, logo,
and other Cigna marks are owned by Cigna Intellectual Property, Inc.
000000 00/15 © 2015 Cigna. Some content provided under license.

Contenu connexe

Tendances

Data Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8thData Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8thJonathan Woodward
 
Top 10 Social Media Tips For Financial Advisors
Top 10 Social Media Tips For Financial AdvisorsTop 10 Social Media Tips For Financial Advisors
Top 10 Social Media Tips For Financial AdvisorsFinworx
 
intro chatGPT workshop.pdf
intro chatGPT workshop.pdfintro chatGPT workshop.pdf
intro chatGPT workshop.pdfpeterpur
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?DATAVERSITY
 
Lean Analytics for Startups and Enterprises
Lean Analytics for Startups and EnterprisesLean Analytics for Startups and Enterprises
Lean Analytics for Startups and EnterprisesLean Analytics
 
Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023Rob Saker
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceSampath Kumar
 
Visualisation & Storytelling in Data Science & Analytics
Visualisation & Storytelling in Data Science & AnalyticsVisualisation & Storytelling in Data Science & Analytics
Visualisation & Storytelling in Data Science & AnalyticsFelipe Rego
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityDATAVERSITY
 
Social Media Analytics
Social Media AnalyticsSocial Media Analytics
Social Media AnalyticsRaven Tools
 
Why is Customer Data Platform (CDP) ?
Why is Customer Data Platform (CDP) ?Why is Customer Data Platform (CDP) ?
Why is Customer Data Platform (CDP) ?Trieu Nguyen
 
24 Awesome Infographic Ideas to Inspire Your Next Beautiful Creation
24 Awesome Infographic Ideas to Inspire Your Next Beautiful Creation24 Awesome Infographic Ideas to Inspire Your Next Beautiful Creation
24 Awesome Infographic Ideas to Inspire Your Next Beautiful CreationPiktochart
 
Publishing Production: From the Desktop to the Cloud
Publishing Production: From the Desktop to the CloudPublishing Production: From the Desktop to the Cloud
Publishing Production: From the Desktop to the CloudDeanta
 
The Other C Word: What makes great content marketing great
The Other C Word: What makes great content marketing greatThe Other C Word: What makes great content marketing great
The Other C Word: What makes great content marketing greatVelocity Partners
 
Why B2B Marketing is so Boring
Why B2B Marketing is so BoringWhy B2B Marketing is so Boring
Why B2B Marketing is so BoringVelocity Partners
 

Tendances (20)

Data Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8thData Culture Keynote and Exec Track Birm Dec 8th
Data Culture Keynote and Exec Track Birm Dec 8th
 
Determine Your Data Strategy
Determine Your Data StrategyDetermine Your Data Strategy
Determine Your Data Strategy
 
Big data case study collection
Big data   case study collectionBig data   case study collection
Big data case study collection
 
Data science
Data scienceData science
Data science
 
AI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra Consulting
AI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra ConsultingAI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra Consulting
AI, ChatGPT and Content Marketing - Andrew Jenkins, Volterra Consulting
 
Top 10 Social Media Tips For Financial Advisors
Top 10 Social Media Tips For Financial AdvisorsTop 10 Social Media Tips For Financial Advisors
Top 10 Social Media Tips For Financial Advisors
 
intro chatGPT workshop.pdf
intro chatGPT workshop.pdfintro chatGPT workshop.pdf
intro chatGPT workshop.pdf
 
Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?Data Catalogs Are the Answer – What Is the Question?
Data Catalogs Are the Answer – What Is the Question?
 
Lean Analytics for Startups and Enterprises
Lean Analytics for Startups and EnterprisesLean Analytics for Startups and Enterprises
Lean Analytics for Startups and Enterprises
 
Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023Future of Data and AI in Retail - NRF 2023
Future of Data and AI in Retail - NRF 2023
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Visualisation & Storytelling in Data Science & Analytics
Visualisation & Storytelling in Data Science & AnalyticsVisualisation & Storytelling in Data Science & Analytics
Visualisation & Storytelling in Data Science & Analytics
 
The Hierarchy of Engagement
The Hierarchy of EngagementThe Hierarchy of Engagement
The Hierarchy of Engagement
 
Data Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data QualityData Modeling, Data Governance, & Data Quality
Data Modeling, Data Governance, & Data Quality
 
Social Media Analytics
Social Media AnalyticsSocial Media Analytics
Social Media Analytics
 
Why is Customer Data Platform (CDP) ?
Why is Customer Data Platform (CDP) ?Why is Customer Data Platform (CDP) ?
Why is Customer Data Platform (CDP) ?
 
24 Awesome Infographic Ideas to Inspire Your Next Beautiful Creation
24 Awesome Infographic Ideas to Inspire Your Next Beautiful Creation24 Awesome Infographic Ideas to Inspire Your Next Beautiful Creation
24 Awesome Infographic Ideas to Inspire Your Next Beautiful Creation
 
Publishing Production: From the Desktop to the Cloud
Publishing Production: From the Desktop to the CloudPublishing Production: From the Desktop to the Cloud
Publishing Production: From the Desktop to the Cloud
 
The Other C Word: What makes great content marketing great
The Other C Word: What makes great content marketing greatThe Other C Word: What makes great content marketing great
The Other C Word: What makes great content marketing great
 
Why B2B Marketing is so Boring
Why B2B Marketing is so BoringWhy B2B Marketing is so Boring
Why B2B Marketing is so Boring
 

En vedette

How to reach a Data Driven culture
How to reach a Data Driven cultureHow to reach a Data Driven culture
How to reach a Data Driven cultureMark Beekman
 
CDO Webinar: 2017 Trends in Data Strategy
CDO Webinar: 2017 Trends in Data StrategyCDO Webinar: 2017 Trends in Data Strategy
CDO Webinar: 2017 Trends in Data StrategyDATAVERSITY
 
Impact Hiring: How Data Will Transform Youth Employment
Impact Hiring: How Data Will Transform Youth EmploymentImpact Hiring: How Data Will Transform Youth Employment
Impact Hiring: How Data Will Transform Youth EmploymentThe Rockefeller Foundation
 
Big Data Revolution: Are You Ready for the Data Overload?
Big Data Revolution: Are You Ready for the Data Overload?Big Data Revolution: Are You Ready for the Data Overload?
Big Data Revolution: Are You Ready for the Data Overload?Aleah Radovich
 
10 A/B Testing Mistakes that Make Your Wallet Cry
10 A/B Testing Mistakes that Make Your Wallet Cry10 A/B Testing Mistakes that Make Your Wallet Cry
10 A/B Testing Mistakes that Make Your Wallet CryConvert.com
 
PROG_UntoldStory ISV eBook_0706c FINAL
PROG_UntoldStory ISV eBook_0706c FINALPROG_UntoldStory ISV eBook_0706c FINAL
PROG_UntoldStory ISV eBook_0706c FINALSolarWinds MSP
 
Data Gravity, IoT, and Time Series - ThingMonk 2015
Data Gravity, IoT, and Time Series - ThingMonk 2015Data Gravity, IoT, and Time Series - ThingMonk 2015
Data Gravity, IoT, and Time Series - ThingMonk 2015dave.m
 
Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksData Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksBICA Labs
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data ScienceCaserta
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)Prof. Dr. Diego Kuonen
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsDavid Pittman
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Data Science London
 
Working With Big Data
Working With Big DataWorking With Big Data
Working With Big DataSeth Familian
 

En vedette (13)

How to reach a Data Driven culture
How to reach a Data Driven cultureHow to reach a Data Driven culture
How to reach a Data Driven culture
 
CDO Webinar: 2017 Trends in Data Strategy
CDO Webinar: 2017 Trends in Data StrategyCDO Webinar: 2017 Trends in Data Strategy
CDO Webinar: 2017 Trends in Data Strategy
 
Impact Hiring: How Data Will Transform Youth Employment
Impact Hiring: How Data Will Transform Youth EmploymentImpact Hiring: How Data Will Transform Youth Employment
Impact Hiring: How Data Will Transform Youth Employment
 
Big Data Revolution: Are You Ready for the Data Overload?
Big Data Revolution: Are You Ready for the Data Overload?Big Data Revolution: Are You Ready for the Data Overload?
Big Data Revolution: Are You Ready for the Data Overload?
 
10 A/B Testing Mistakes that Make Your Wallet Cry
10 A/B Testing Mistakes that Make Your Wallet Cry10 A/B Testing Mistakes that Make Your Wallet Cry
10 A/B Testing Mistakes that Make Your Wallet Cry
 
PROG_UntoldStory ISV eBook_0706c FINAL
PROG_UntoldStory ISV eBook_0706c FINALPROG_UntoldStory ISV eBook_0706c FINAL
PROG_UntoldStory ISV eBook_0706c FINAL
 
Data Gravity, IoT, and Time Series - ThingMonk 2015
Data Gravity, IoT, and Time Series - ThingMonk 2015Data Gravity, IoT, and Time Series - ThingMonk 2015
Data Gravity, IoT, and Time Series - ThingMonk 2015
 
Data Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural NetworksData Science, Machine Learning and Neural Networks
Data Science, Machine Learning and Neural Networks
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
 
A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)A Statistician's View on Big Data and Data Science (Version 1)
A Statistician's View on Big Data and Data Science (Version 1)
 
Myths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data ScientistsMyths and Mathemagical Superpowers of Data Scientists
Myths and Mathemagical Superpowers of Data Scientists
 
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?Big Data [sorry] & Data Science: What Does a Data Scientist Do?
Big Data [sorry] & Data Science: What Does a Data Scientist Do?
 
Working With Big Data
Working With Big DataWorking With Big Data
Working With Big Data
 

Similaire à DAMA Webinar - Big and Little Data Quality

Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...
Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...
Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...Innovation Enterprise
 
McGrath Health Data Analyst SXSW
McGrath Health Data Analyst SXSWMcGrath Health Data Analyst SXSW
McGrath Health Data Analyst SXSWRobert McGrath
 
Big Data in Medicine
Big Data in MedicineBig Data in Medicine
Big Data in MedicineNasir Arafat
 
Making our practice open and transparent: the potential of electronic data ca...
Making our practice open and transparent: the potential of electronic data ca...Making our practice open and transparent: the potential of electronic data ca...
Making our practice open and transparent: the potential of electronic data ca...Health Informatics New Zealand
 
Getting People to Act
Getting People to ActGetting People to Act
Getting People to ActMedullan
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcarePerficient, Inc.
 
From personal health data to a personalized advice
From personal health data to a personalized adviceFrom personal health data to a personalized advice
From personal health data to a personalized adviceWessel Kraaij
 
اینترنت اشیاء در حوزه سلامت
اینترنت  اشیاء در حوزه سلامت اینترنت  اشیاء در حوزه سلامت
اینترنت اشیاء در حوزه سلامت Mahmood Khosravi
 
Health Data Sharing Scene Setting
Health Data Sharing Scene Setting Health Data Sharing Scene Setting
Health Data Sharing Scene Setting ipposi
 
AMDIS CHIME Fall Symposium
AMDIS CHIME Fall SymposiumAMDIS CHIME Fall Symposium
AMDIS CHIME Fall SymposiumDale Sanders
 
Collaborative Leadership Insights - creating a digital health eco-system
Collaborative Leadership Insights - creating a digital health eco-systemCollaborative Leadership Insights - creating a digital health eco-system
Collaborative Leadership Insights - creating a digital health eco-systemAndrew M Saunders
 
Big Data Provides Opportunities, Challenges and a Better Future in Health and...
Big Data Provides Opportunities, Challenges and a Better Future in Health and...Big Data Provides Opportunities, Challenges and a Better Future in Health and...
Big Data Provides Opportunities, Challenges and a Better Future in Health and...Cirdan
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...Christopher Hart
 
Oncology Big Data: A Mirage or Oasis of Clinical Value?
Oncology Big Data:  A Mirage or Oasis of Clinical Value? Oncology Big Data:  A Mirage or Oasis of Clinical Value?
Oncology Big Data: A Mirage or Oasis of Clinical Value? Michael Peters
 

Similaire à DAMA Webinar - Big and Little Data Quality (20)

Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...
Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...
Big Data Analytics - Opportunities, Enablers, Challenges and Risks to Conside...
 
McGrath Health Data Analyst SXSW
McGrath Health Data Analyst SXSWMcGrath Health Data Analyst SXSW
McGrath Health Data Analyst SXSW
 
Big Data in Medicine
Big Data in MedicineBig Data in Medicine
Big Data in Medicine
 
Making our practice open and transparent: the potential of electronic data ca...
Making our practice open and transparent: the potential of electronic data ca...Making our practice open and transparent: the potential of electronic data ca...
Making our practice open and transparent: the potential of electronic data ca...
 
Getting People to Act
Getting People to ActGetting People to Act
Getting People to Act
 
The Role of Data Lakes in Healthcare
The Role of Data Lakes in HealthcareThe Role of Data Lakes in Healthcare
The Role of Data Lakes in Healthcare
 
From personal health data to a personalized advice
From personal health data to a personalized adviceFrom personal health data to a personalized advice
From personal health data to a personalized advice
 
BIG DATA.ppt
BIG DATA.pptBIG DATA.ppt
BIG DATA.ppt
 
Day 1: Real-World Data Panel
Day 1: Real-World Data Panel Day 1: Real-World Data Panel
Day 1: Real-World Data Panel
 
اینترنت اشیاء در حوزه سلامت
اینترنت  اشیاء در حوزه سلامت اینترنت  اشیاء در حوزه سلامت
اینترنت اشیاء در حوزه سلامت
 
Health Data Sharing Scene Setting
Health Data Sharing Scene Setting Health Data Sharing Scene Setting
Health Data Sharing Scene Setting
 
Predictive Health Population Analytics
Predictive Health Population AnalyticsPredictive Health Population Analytics
Predictive Health Population Analytics
 
AMDIS CHIME Fall Symposium
AMDIS CHIME Fall SymposiumAMDIS CHIME Fall Symposium
AMDIS CHIME Fall Symposium
 
Dave Taylor
Dave TaylorDave Taylor
Dave Taylor
 
Collaborative Leadership Insights - creating a digital health eco-system
Collaborative Leadership Insights - creating a digital health eco-systemCollaborative Leadership Insights - creating a digital health eco-system
Collaborative Leadership Insights - creating a digital health eco-system
 
Big Data Provides Opportunities, Challenges and a Better Future in Health and...
Big Data Provides Opportunities, Challenges and a Better Future in Health and...Big Data Provides Opportunities, Challenges and a Better Future in Health and...
Big Data Provides Opportunities, Challenges and a Better Future in Health and...
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...
 
Oncology Big Data: A Mirage or Oasis of Clinical Value?
Oncology Big Data:  A Mirage or Oasis of Clinical Value? Oncology Big Data:  A Mirage or Oasis of Clinical Value?
Oncology Big Data: A Mirage or Oasis of Clinical Value?
 
Digital Healthcare - Detailed Presentation PDF
Digital Healthcare - Detailed Presentation PDFDigital Healthcare - Detailed Presentation PDF
Digital Healthcare - Detailed Presentation PDF
 
Importance of RWD in HEOR: An Industrial Perspective
Importance of RWD in HEOR: An Industrial PerspectiveImportance of RWD in HEOR: An Industrial Perspective
Importance of RWD in HEOR: An Industrial Perspective
 

Plus de DATAVERSITY

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...DATAVERSITY
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceDATAVERSITY
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data LiteracyDATAVERSITY
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsDATAVERSITY
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for YouDATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectDATAVERSITY
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?DATAVERSITY
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...DATAVERSITY
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsDATAVERSITY
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayDATAVERSITY
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise AnalyticsDATAVERSITY
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best PracticesDATAVERSITY
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?DATAVERSITY
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best PracticesDATAVERSITY
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...DATAVERSITY
 

Plus de DATAVERSITY (20)

Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
 
Data at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and GovernanceData at the Speed of Business with Data Mastering and Governance
Data at the Speed of Business with Data Mastering and Governance
 
Exploring Levels of Data Literacy
Exploring Levels of Data LiteracyExploring Levels of Data Literacy
Exploring Levels of Data Literacy
 
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business GoalsBuilding a Data Strategy – Practical Steps for Aligning with Business Goals
Building a Data Strategy – Practical Steps for Aligning with Business Goals
 
Make Data Work for You
Make Data Work for YouMake Data Work for You
Make Data Work for You
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Showing ROI for Your Analytic Project
Showing ROI for Your Analytic ProjectShowing ROI for Your Analytic Project
Showing ROI for Your Analytic Project
 
How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?Is Enterprise Data Literacy Possible?
Is Enterprise Data Literacy Possible?
 
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and ForwardsData Governance Trends - A Look Backwards and Forwards
Data Governance Trends - A Look Backwards and Forwards
 
Data Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement TodayData Governance Trends and Best Practices To Implement Today
Data Governance Trends and Best Practices To Implement Today
 
2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics2023 Trends in Enterprise Analytics
2023 Trends in Enterprise Analytics
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?Who Should Own Data Governance – IT or Business?
Who Should Own Data Governance – IT or Business?
 
Data Management Best Practices
Data Management Best PracticesData Management Best Practices
Data Management Best Practices
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
Keeping the Pulse of Your Data – Why You Need Data Observability to Improve D...
 

Dernier

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfrahulyadav957181
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfnikeshsingh56
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformationAnnie Melnic
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfNicoChristianSunaryo
 
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...boychatmate1
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfPratikPatil591646
 
knowledge representation in artificial intelligence
knowledge representation in artificial intelligenceknowledge representation in artificial intelligence
knowledge representation in artificial intelligencePriyadharshiniG41
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectBoston Institute of Analytics
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 

Dernier (20)

Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
 
Rithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdfRithik Kumar Singh codealpha pythohn.pdf
Rithik Kumar Singh codealpha pythohn.pdf
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 
Statistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdfStatistics For Management by Richard I. Levin 8ed.pdf
Statistics For Management by Richard I. Levin 8ed.pdf
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Role of Consumer Insights in business transformation
Role of Consumer Insights in business transformationRole of Consumer Insights in business transformation
Role of Consumer Insights in business transformation
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
Digital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdfDigital Indonesia Report 2024 by We Are Social .pdf
Digital Indonesia Report 2024 by We Are Social .pdf
 
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
Introduction to Mongo DB-open-­‐source, high-­‐performance, document-­‐orient...
 
Non Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdfNon Text Magic Studio Magic Design for Presentations L&P.pdf
Non Text Magic Studio Magic Design for Presentations L&P.pdf
 
knowledge representation in artificial intelligence
knowledge representation in artificial intelligenceknowledge representation in artificial intelligence
knowledge representation in artificial intelligence
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Decoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis ProjectDecoding Patterns: Customer Churn Prediction Data Analysis Project
Decoding Patterns: Customer Churn Prediction Data Analysis Project
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 

DAMA Webinar - Big and Little Data Quality

  • 1. LITTLE DATA IN A BIG DATA WORLD The case of health care Dataversity / DAMA February 2016 Laura Sebastian-Coleman, Ph.D., IQCP Offered by: Connecticut General Life Insurance Company or Cigna Health and Life Insurance Company.
  • 2. About me • Doing data quality in health care since 2003 • Have worked in banking, manufacturing, distribution, commercial insurance, and academia. • All have influenced my understanding of data, quality, and measurement • Developed the Data Quality Assessment Framework (DQAF); published in Measuring Data Quality for Ongoing Improvement (2013). • IAIDQ Distinguished Member Award 2015. • DAMA Publications Director, beginning summer 2015 • Influences on my thinking about data: • The challenge of how to measure data quality • The concept of measurement itself: A problems of measurement is a microcosm of the general challenge of data definition and collection. • The demands of data warehousing, especially of data integration. 2
  • 3. • Abstract: – While technological innovation brings constant change to the data landscape, many organizations still struggle with the basics: ensuring they have reliable, high quality data. – In health care, the promise of insight to be gained through analytics is dependent on ensuring the interactions between providers and patients are recorded accurately and completely. – A lot of health care data is dependent on person-to-person contact. This fact influences its quality, as well as how it is captured, stored, accessed and used. – This presentation will ask the audience to think about data in old and new ways in order to gain additional insight about how to improve the quality of data, regardless of size. • Order of information – The promise of big data in health care vs. the reality of the health care system – The data quality in health care – How we think about data and data quality – Making good on the promise of health care analytics Abstract and Agenda 3 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 4. • According to the blog Big Data Landscape, Big Data Health Care apps enable people and organizations to – Make well-informed decisions – Enhance patient engagement – Encourage prescription medication compliance – Adjust lifestyle choices – Maximize levels of well-being – Predict and prepare for individual illnesses – Manage demand within the health care system • In Health Care Delivery of the Future, Price Waterhouse Coopers describes a New Health Economy in which: – Digitally-enabled care is a fundamental business imperative. – There are major shifts in how care is being delivered. – Digital technology bridges time, distance and the expectation gap between consumers and clinicians. The Promise of Big Data in Health Care 4 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 5. • At a recent insurance summit in Hartford, Connecticut, health care executives grappled with a quandary: “Only in health care does innovation lead to higher costs.” • Not only that, but within the health care system, there are inefficiencies. One executive estimated “Thirty cents on the dollar, at least, is not used effectively.” • Getting people to act in the best way for their health is “Very easy to say. Very hard to do.” The Reality of Health Care 5 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 6. • The promise of health care analytics and big data depends on having high quality data. • Technology limitations – Data output from older, legacy claim adjudication systems differs greatly within and between organizations – Medical provider transition from paper to Electronic Health Records (EHR) is relatively recent and still underway – In 2008, only 4% of Providers had full EHR, 13% had a basic system • Lack of standards / standards implemented inconsistently • No standards for data quality Challenges with the Quality of Health Care Data 6 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 7. • Goal: Improve knowledge of the quality of data gathered in non-clinical trial settings. – As research data warehouses and large scale data networks begin to become established sources of observational data, it becomes more critical that consistent methods for assessing and reporting data quality are developed and adopted so that users of data and consumers of results understand the potential impact of data quality on study results. • Factors that can cause misrepresentation of clinical events: – Inflexible systems – Coding practices – Gaps in standards • Example: Screening for high blood pressure in children – Clinicians were directed to screen children for high blood pressure. – After the initial screening, they continued to monitor some of the children. – When researchers looked at the data, it appeared children were increasingly at risk of high blood pressure. – Examination of records showed that the ‘increase’ was caused by incorrect use of hypertension diagnosis code – But there was not a diagnosis code for “considering hypertension” Academy Health Example 7 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 8. • Person-to-person interactions between patients and health care providers • Interactions influence factors that may result in different interpretation of symptoms • Interactions are represented by data • Different choices in how to represent these interactions create different data stories Health Care data is about people 8 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 9. Definition: Data • Data’s Latin root is dare, past participle of to give. Data means “something given.” In math and engineering, the terms data and givens are used interchangeably. • The New Oxford American Dictionary (NOAD) defines data as “facts and statistics collected together for reference or analysis.” • ISO defines data as “re-interpretable representation of information in a formalized manner suitable for communication, interpretation, or processing” (ISO 11179). • Observations about the concept of data – Data tries to tell the truth about the world (“facts”) – Data is formal – it has a shape – Data’s function is representational – Data is often about quantities, measurements, and other numeric representations “facts” – Things are done with data: reference, analysis, interpretation, processing • What the definitions leave out: – Data is made by people. We choose what characteristics to represent. The creation of data implies a set of expectations about data’s condition. – People also use data. The uses of data imply a set of expectations about data’s condition. 9
  • 10. Science – Focuses on measurement as a means to create knowledge – Plans for data that is accurate and complete – Sets standards so that results are collected correctly and can be understood and reproduced – Tests and re-tests • Data is a PRODUCT of science. • High quality data is a necessity. Commerce – Creates measurements pragmatically to meet goals – Generates data by accident while executing its core processes • Data is a BY-PRODUCT of commerce • Historically, data quality has not an end in itself for commerce. Science, Commerce and Data Quality 10 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna • The History of Measurement = history of our ideas about data • Science & Commerce have different goals for and approaches to measurement and thus to data. • Our ideas about data come largely from science, but we create data based on commerce.
  • 11. The Scientific Method vs. Commerce
  • 12. Data Quality • Our ideas about data quality come largely from science, even though we create data based on commerce. • Today, we are using organizational data in scientific ways – to learn about our business. • We expect the data to be fit for this purpose, but we have not focused on ensuring representational effectiveness. 12 Fitness for purpose Representational effectiveness
  • 13. • Even scientists are struggling with data quality in the age of Big Data. • A February 2015 article in Science News reported researchers are struggling to find insights as they sort through “mounds of data”: “Just keeping track of big data is a monumental undertaking. Sharing the data with other researchers, a critical piece of transparency and efficiency in science, has its own set of problems. And the tools used to analyze complex datasets are just as important as the data themselves. Each time a scientist chooses one computer program over another or decides to investigate one variable rather than a different one, the decision can lead to very different conclusions.” Data Quality and Big Data 13 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna • The inability to reproduce results has brought some studies into question. • The answer? Study the factors that influence reproducibility. • They have discovered a kind of “butterfly effect” in studies using big data: – Small changes in the initial conditions of an experiment can have significant effects on the outcome of replication attempts.
  • 14. Butterfly effect in the clinical space A) Decision features a. Framing (e.g., gain vs. losses) (2 factors) b. Order of choices (e.g. A à B vs. BàA in a simple two choice decision) (2 factors) c. Choice justification (e.g., effect of regret, guilt etc. on dissonance reduction; yes vs. no) (2 factors) B) Situational factors a. Time pressure (e.g., yes vs no) (2 factors) b. Cognitive load (e.g., high vs. low) (2 factors) c. Social context (e.g., important vs. not important) (2 factors) C) Characteristics of decision-maker a. Individual [e.g., age (old vs. young), gender (female vs. male) (4 factors) b. Group (e.g, small vs. large group) (2 factors) c. Cultural factors (e.g., present vs. not preset/important) (2 factors) D) Individual differences a. Decision styles (e.g. intuitive vs. analytic) (2 factors) b. Cognitive ability (e.g., high vs. low) (2 factors) c. Personality (e.g., openness, conscientious, extraversion, agreeableness, neuroticism) (“Big 5” factors) Table 1. Minimum number of the factors affecting decision-making From Effect of Initial Conditions on Reproducibility of Scientific Research, by Benjamin Djulbegovic and Iztok Hozo • Small changes in the initial conditions of an experiment can have significant effects on the outcome of replication attempts. • Researchers used Doctor/Patient interactions to study the butterfly effect and identified 12 factors that influence clinical decision making. • Those initial factors make up 20,480 combinations that could represent the initial conditions of the experiment. • Yes, 20,480! Initial conditions can influence clinical decision making and the data that is recorded as part of it.
  • 15. The butterfly effect takes on a whole new meaning 15 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 16. • Recognize that there is variation within the health care system and use this variation to improve the system. • Reduce variation to simplify the system. • Recognize that data is a critical product of the health care system and plan for quality data. Making good on the promises of health care analytics 16 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 17. • Variation may itself be meaningful. • For example, variation in how doctors make decisions – If we understand how doctors make decisions, and we understand he outcome of those decisions, then we can assess the results of those outcomes and provide that feedback to the health care system. • This is the basic promise of evidence-based medicine and health care analytics, but it requires a lot of data. Making good on the promises of health care analytics: Recognize variation 17 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 18. • Establish and enforce data standards within health care – Including data quality standards • Collect more data through instruments, devices to reduce ambiguity of meaning – In 2015, PWC reported that 28% of consumers have at least one health related app on their tablet or smart phone. – Up from 13% just three years earlier. – About two-thirds of physicians reported that they would be willing to prescribe an app that would help patients manage a chronic disease. – Being able to integrate this kind of data into an electronic medical record represents an additional opportunity. • Enable feedback about the data from health care providers, patients, and other stakeholders – More health care data is being exposed directly to physicians and to consumers. – Physician compensation and Consumer incentives depend on this data. – To reduce costs and improve quality of care, health care companies will need means to respond to feedback and ensure data is accurate. Making good on the promises of health care analytics: Reduce variation 18 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 19. • Recognize that improvement in health care and in quality of life depend on having trustworthy, reliable data. – Data is not secondary to successful outcomes, it is central to successful outcomes. • Focus from the start on ensuring the quality of that data, whether that data is little (such as a record of the facts of an office visit) or big (petabytes of biometric data collected from all the Fitbits in the USA). • High quality data is: – Clearly defined – Consistently collected through reliable processes – Presented through comprehensible standards • The processes and systems that we build to create this new health care data should be designed with the data itself in mind. – The industry as a whole must cease to create data as a by-product and recognize data as critical input to its primary goal: helping improve people’s health, well-being and sense of security. Making good on the promises of health care analytics: Plan for data as a product 19 Confidential, unpublished property of Cigna. Do not duplicate or distribute. Use and distribution limited solely to authorized personnel. © 2015 Cigna
  • 20. References • http://www.bigdatalandscape.com/blog/big-data-health-find-apps-can-improve-wellbeing • http://www.pwc.com/us/en/health-industries/top-health-industry-issues/assets/pwc-healthcare-delivery-of- the-future.pdf • http://touch.courant.com/#section/-1/article/p2p-84932278/ • https://en.wikipedia.org/wiki/Electronic_Medical_Record#United_States • http://www.academyhealth.org/ • http://www.pcori.org/about-us/why-pcori-was-created • http://courses.washington.edu/geog482/resource/14_Beyond_Accuracy.pdf • https://www.sciencenews.org/article/redoing-scientific-research-best-way-find-truth • https://www.sciencenews.org/article/big-data-studies-come-replication-challenges • http://www.scopemed.org/fulltextpdf.php?mno=162185 • http://consumerhealthchoices.org/wp-content/uploads/2015/05/HealthDataGuide-June2015.pdf • http://www.pwc.com/us/en/health-industries/top-health-industry-issues/assets/pwc-healthcare-delivery-of- the-future.pdf • http://kff.org/health-reform/fact-sheet/summary-of-the-affordable-care-act/ • http://www.forbes.com/sites#/sites/emc/2014/01/22/can-big-data-and-mobile-make-health-care-more- effective/ • http://www.nytimes.com/2016/01/17/opinion/sunday/how-measurement-fails-doctors-and- teachers.html?_r=0 •
  • 21. All Cigna products and services are provided exclusively by or through operating subsidiaries of Cigna Corporation, including Cigna Health and Life Insurance Company, Connecticut General Life Insurance Company, Cigna Behavioral Health, Inc., and HMO or service company subsidiaries of Cigna Health Corporation. The Cigna name, logo, and other Cigna marks are owned by Cigna Intellectual Property, Inc. 000000 00/15 © 2015 Cigna. Some content provided under license.