SlideShare une entreprise Scribd logo
1  sur  30
Agile Analysis 101 
Part 1: Introducing Basic Analysis
Analysis for Dummy’s, Dummies 
• Most Agile Teams 
– Can’t Identify Influential Delivery Factors Plus… 
– …Over-reliance on Cycle-Time & Throughput 
– Equals Shooting in the Dark! 
• Little’s Law Applies Only When ‘Predictable’ 
• ‘Fixed’ Mathematics Doesn’t Adequately Facilitate Self-Organisation 
– Morphogenesis & Chaos 
• Too hard for most 
• We don’t know enough (yet) 
• Enterprise Mathematical Models too Hard or Based on Unrealistic 
Assumptions 
– e.g. Efficient Market Hypothesis, 
• required rational investor 
• What can Agilists Do?
Analysis Forms 
Controlling Maths v Agile Stats
Traditional Mathematical Analysis 
 Modelled the environment in its entirety 
 Every variable identified and mapped 
 Every factor had to be understood in detail 
 …and managed 
 Fit command-and-control really well! 
 Provided an Exact answer 
 Useful comfort blanket 
 Exclusivity - Very few people understood it 
 Needed Masters & PhDs in numerate subjects 
 MBA’s not always enough 
 Mathematics, Physics, Operation Research, Engineering…
Area of a Circle (Traditional Way) 
• Given origin (h,k) & radius 
r 
• Typically learned for GCSE 
• Have to know: 
– equation 
– r is a factor & how to get it 
– What ‘squared’ means 
– Pi is a constant 
– Know maths 
• What if you didn’t? 
Source: Google Images
Statistical Analysis 
 Doesn’t require exact model 
 Doesn’t produce an exact answer 
 Do you need one? 
 Can you rely on one? 
 Isn’t variable/factor centric 
 Though they may come out 
 Looks for correlations 
 Which tell you where else to look for more 
 CAREFUL! Correlations aren’t causations! 
 If you find a link, it doesn’t necessarily mean it’s so 
 Can be refined, akin to ‘learning’ 
 Increasing number of samples in known range 
 …akin to reducing Kanban batch size or story size 
 Can also use Bayesian Inference 
 Fits Lean-Agility really well 
 A 10-year old can often do it!
Area of a Circle (Statistical Way) 
• Grid around the Circle 
• Count Squares at least 
half inside circle 
• Need more accuracy? 
Easy! Use finer grid! 
• Typically learned at 10 
years old! 
Question 
Take a look at the examples on the 
right, which grid is closer to Actual 
Area? 
8 x 8 x 1cm Grid 
Diameter = 8 x 1cm squares = 8cm 
Radius = Half diameter i.e. 8/2 = 4cm 
Area is the number of squares at 
least half inside circle. 
52 squares: 52x(1x1) = 52cm2 
20 x 20 x 0.4cm Grid 
Diameter = 20 x 0.4cm squares = 8cm 
Radius = Half diameter i.e. 8/2 = 4cm 
Area is the number of squares at least 
half inside circle. 
312 squares: 312x(0.4x0.4) = 49.92cm2 
Actual Area 
When r = 4 
Area = Pi x (4 x 4) 
= 50.27cm2 
Image Source: Google Images
Compare to Kanban 
• Backlog the Tickets 
• Batch together related 
epic tickets 
• If you need more 
accuracy, make the 
batches smaller! 
– …and/or sprints shorter
Technical Note! 
• Statistical form is standard in Monte Carlo 
Algorithms 
– Always Fast to run… 
– …But ‘probably’ correct 
• In any case, accurate to a particular range 
• If that range is good enough use it!
What is Good Enough? 
Guide to a Nebulous Term
Definition of Good Enough? 
Definitions 
What I tell Managers: “Any measure with an accuracy 
matching your ability to change, is good enough.” 
What I tell Techies: “Sampling twice as frequent as the 
change, is good enough.” 
- Ethar Alali 
• Any more accurate/frequent is waste 
• Any less and you can’t make decisions 
– So risk mitigation strategy may be necessary
Example: CD Quality Sound 
In ye olden days 
we had these 
• 44.1kHz sample rate 
• Stereo Sound 
• 16-bit Digital Sampling 
• CD stores 650MB 
Compact Disc 
Image Source: Google Images
Example: Compact Disc Encoding 
Focusing on Useful Data Storage 
Ignore Reed-Solomon error correction & detection 
Signed 16-bit number can segment audio into ~ 1/65,536 parts 
44.1kHz means it takes 1x 16 bit number in this range every 1/44,100ths of a second 
Stereo sound means two sets of microphones and hence 2 sample channels 
Total storage needs for a 3 minute song: 
• 44,100 samples x 2 bytes per sample x 2 channels x 3minute x 60 seconds = 31.752MB raw per song. 
• Album = 20 songs = 635 MB of digital data, which fills a 650MB CD 
Great for music :-) 
Attribution: Image Courtesy of Grahammitchell.com
What About: Telephone Voice on CD? 
Voice on Telephones is mono not stereo 
Needs only one channel! 
Telephone quality changes pitch in 3K at worst! 
Voice doesn’t have the refined nature of music! Hence can be recorded in 8-bit (256 parts) 
3kHz means it takes 1 x 8 bit number in this range ever 1/3,000th of a second 
Total storage needs for a 3 minute conversation: 
• 3,000 samples x 1 bytes per sample x 1 channels x 3 minute x 60 seconds = 540KB raw. 
• Album = 20 songs = 10.8 MB of digital data 
Stored on 650MB CD, you have almost 640MB of WASTE!
Attribution: Image Courtesy of Grahammitchell.com 
What If: We sampled less? 
• Not an Accurate Picture! 
Note: 
Dashed red edge case, which samples exactly at transition points. In 
real scenarios this never happens with sound since change isn’t 
periodic. 
RED = 2/3 as fast sampling 
AMBER = Twice as frequent sampling 
GREEN = 4 times as frequent
Which is Closer to Actual? 
RED = 2/3 as fast sampling 
AMBER = Twice as frequent sampling 
GREEN = 4 times as frequent
Traditional Samples in Business 
• Annual Accounts 
– Plc’s have mid-term or quarterly accounts 
– If they want to be more agile, make it monthly 
• Regulatory Reporting 
• Charity Commission Reports 
• Franchises Brand Inspections 
– Once every 2-3 years, inspected annually 
• FCA 
• … 
Identify: Easy! Usually associated with ‘Audit’ of some kind. 
• Self-governing/managing teams Sample themselves!
Correlation != Causation 
What they are, How to find them and 
What they mean
Causation 
• One thing occurs as a deterministic consequence of something else 
– Fingers in high-voltage socket causes death 
• Link a number of causes to establish behaviour 
• Needs Two Factors 
– Functional process, including all variables 
– Initial condition (aka Pre-condition) 
• ‘Given’ in Gherkin syntax 
• Great for Forecasting… 
– As long as causal-chain always happen 
• Near useless in chaotic environments 
– Depending on when you look at it 
• Initial condition may not be known 
• Sensitive dependence + Feedback injects uncertainty! 
• Code runs deterministically, teams normally work chaotically… 
• …until they reach predictability, then Little’s Law can apply
Example: Causation 
• y = 2 + x <- function/process 
• x = 3 <- Initial [pre]condition 
• y = 5 <- Final outcome/post-condition 
• Post-condition = acceptance test criteria 
– ‘Then’ in Gherkin Syntax 
• Really easy for code! Mostly predictable 
– Fits Gherkin, OCL, VDM, Z etc. perfectly
Correlation 
• Aims to find [statistical] links between samples 
– When causal links not known or samples appear ‘random’ 
– Also shows strength of relationship 
• First step in Factor Analysis 
– Locate influential factors for dependent variables 
• Cycle-time 
• Throughput 
• Value delivered 
• Can be plotted on graph 
• Needs Manipulation to Fit Gherkin :( 
• All aim to locate where to sniff next!
Correlations Can Be Seen 
• Correlations can be 
modelled with Linear 
Regression 
• Seen when an increase in 
one variable 
increases/decreases another 
Source: Scatterplot Image from knottwiki teaching 
Source: Image from Utah.edu Mesowest weather
Example: Burnage Library 
• Correlation Matrix
Example: Burnage Library 
• Manchester City Council claim: Library closure based on 11 
variables for deprivation 
– Tasked with saving £80 million a year 
• Correlation matrix showed strong correlations between 
Population of Library catchment area &: 
– Total Library Visitors – Larger catchments correlate with more 
library visitors 
– Active users – Larger catchments correlate with more active 
users 
– Participation in Events 
– … 
• But all factors correlated with each other!
Dependent v Independent Correlation 
Very High Correlations of dependent combined score & other allegedly 
independent factors with catchment population
Independent Variable Inter-correlation 
Lead to Q: How come they are so highly correlated? 
A: High Inter-correlation between independent variables!
Correlation: Deprivation 
Q:Was deprivation a factor? A: Deprivation wasn’t a significant consideration, 
despite the claims of Council
Example: Burnage Library Conclusion 
• Basics showed that claims weren't supported 
– Could have done better with Null Hypothesis 
• Interdependence of allegedly independent 
variables meant weighting of catchment area 
5x more important than deprivation 
– Not likely based on deprivation index, as was 
claimed 
– Potentially hinting at a political decision 
• Controversial ;)
NEXT TIME: Agile Teams 
• In Part 2, we examine how this applies to teams. 
• In summary: 
– Gather Cycle-time, Throughput & Value delivered across a few 
sprints 
– Match & Correlate Respective 
• Bugs 
• Blockers 
• Days of week 
• Team size 
• Story 
• Anything else you already have data for 
• Don’t 
– Make too many inferences early on
Thanks for Viewing 
Further Reading 
Business Planning Example 
http://www.solver.com/monte-carlo-simulation-example 
Monte Carlo Simulation Tutorial in Excel 
“Statistics in Psychosocial Research, Lecture 8 Factor Analysis I” John Hopkins University 
http://ocw.jhsph.edu/courses/statisticspsychosocialresearch/pdfs/lecture8.pdf) 
“Correlation & Dependence” Wikipedia 
http://en.wikipedia.org/wiki/Correlation_and_dependence 
Ethar Alali @EtharUK @Dynacognetics 
Managing Director & Chief Architect 
Polymath-MathMo. Programming since 9 years old. TOGAF 9 Certified, change 
agent. 
Blog: GoadingtheITGeek.blogspot.co.uk 
About Us 
Specialist ICT Strategists & Advisors. 
Member of HiveMind Network for some of 
the biggest household and corporate multi-nationals. 
Accredited Growth Voucher Advisors 
certified to deliver IT & Web Growth 
Consultancy as part of the government’s 
Growth Voucher Scheme. 
Accreditations & Associations

Contenu connexe

Tendances

Agile 2014 Software Moneyball (Troy Magennis)
Agile 2014   Software Moneyball (Troy Magennis)Agile 2014   Software Moneyball (Troy Magennis)
Agile 2014 Software Moneyball (Troy Magennis)Troy Magennis
 
Making Sense of Statistics in HCI: part 3 - gaining power
Making Sense of Statistics in HCI: part 3 - gaining powerMaking Sense of Statistics in HCI: part 3 - gaining power
Making Sense of Statistics in HCI: part 3 - gaining powerAlan Dix
 
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...tboubez
 
Making Sense of Statistics in HCI: From P to Bayes and Beyond – introduction
Making Sense of Statistics in HCI: From P to Bayes and Beyond  – introductionMaking Sense of Statistics in HCI: From P to Bayes and Beyond  – introduction
Making Sense of Statistics in HCI: From P to Bayes and Beyond – introductionAlan Dix
 
Making Sense of Statistics in HCI: part 4 - so what
Making Sense of Statistics in HCI:  part 4 - so whatMaking Sense of Statistics in HCI:  part 4 - so what
Making Sense of Statistics in HCI: part 4 - so whatAlan Dix
 
Behaviour change and intervention research
Behaviour change and intervention researchBehaviour change and intervention research
Behaviour change and intervention researchMatti Heino
 
Rapid Qualitative Inquiry (2nd Edition)
Rapid Qualitative Inquiry (2nd Edition)Rapid Qualitative Inquiry (2nd Edition)
Rapid Qualitative Inquiry (2nd Edition)James Beebe
 

Tendances (7)

Agile 2014 Software Moneyball (Troy Magennis)
Agile 2014   Software Moneyball (Troy Magennis)Agile 2014   Software Moneyball (Troy Magennis)
Agile 2014 Software Moneyball (Troy Magennis)
 
Making Sense of Statistics in HCI: part 3 - gaining power
Making Sense of Statistics in HCI: part 3 - gaining powerMaking Sense of Statistics in HCI: part 3 - gaining power
Making Sense of Statistics in HCI: part 3 - gaining power
 
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
Velocity Europe 2013: Beyond Pretty Charts: Analytics for the cloud infrastru...
 
Making Sense of Statistics in HCI: From P to Bayes and Beyond – introduction
Making Sense of Statistics in HCI: From P to Bayes and Beyond  – introductionMaking Sense of Statistics in HCI: From P to Bayes and Beyond  – introduction
Making Sense of Statistics in HCI: From P to Bayes and Beyond – introduction
 
Making Sense of Statistics in HCI: part 4 - so what
Making Sense of Statistics in HCI:  part 4 - so whatMaking Sense of Statistics in HCI:  part 4 - so what
Making Sense of Statistics in HCI: part 4 - so what
 
Behaviour change and intervention research
Behaviour change and intervention researchBehaviour change and intervention research
Behaviour change and intervention research
 
Rapid Qualitative Inquiry (2nd Edition)
Rapid Qualitative Inquiry (2nd Edition)Rapid Qualitative Inquiry (2nd Edition)
Rapid Qualitative Inquiry (2nd Edition)
 

Similaire à Agile Analysis 101: Agile Stats v Command & Control Maths

Core Methods In Educational Data Mining
Core Methods In Educational Data MiningCore Methods In Educational Data Mining
Core Methods In Educational Data Miningebelani
 
Validity and Reliability of Cranfield-like Evaluation in Information Retrieval
Validity and Reliability of Cranfield-like Evaluation in Information RetrievalValidity and Reliability of Cranfield-like Evaluation in Information Retrieval
Validity and Reliability of Cranfield-like Evaluation in Information RetrievalJulián Urbano
 
Human computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspectiveHuman computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspectiveoralonso
 
Recommender Systems in a nutshell
Recommender Systems in a nutshellRecommender Systems in a nutshell
Recommender Systems in a nutshellKonstantin Savenkov
 
Understanding Deep Learning Requires Rethinking Generalization
Understanding Deep Learning Requires Rethinking GeneralizationUnderstanding Deep Learning Requires Rethinking Generalization
Understanding Deep Learning Requires Rethinking GeneralizationAhmet Kuzubaşlı
 
2013 py con awesome big data algorithms
2013 py con awesome big data algorithms2013 py con awesome big data algorithms
2013 py con awesome big data algorithmsc.titus.brown
 
Three methodological issues for system dynamics practice
Three methodological issues for system dynamics practiceThree methodological issues for system dynamics practice
Three methodological issues for system dynamics practiceAndreas Größler
 
What Questions Are Worth Answering?
What Questions Are Worth Answering?What Questions Are Worth Answering?
What Questions Are Worth Answering?Ehren Reilly
 
Exploratory Data Analysis week 4
Exploratory Data Analysis week 4Exploratory Data Analysis week 4
Exploratory Data Analysis week 4Manzur Ashraf
 
Data Science 101
Data Science 101Data Science 101
Data Science 101ideatoipo
 
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...tboubez
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkVivian S. Zhang
 

Similaire à Agile Analysis 101: Agile Stats v Command & Control Maths (20)

Data Mining Lecture_2.pptx
Data Mining Lecture_2.pptxData Mining Lecture_2.pptx
Data Mining Lecture_2.pptx
 
205_April_22.pptx
205_April_22.pptx205_April_22.pptx
205_April_22.pptx
 
Core Methods In Educational Data Mining
Core Methods In Educational Data MiningCore Methods In Educational Data Mining
Core Methods In Educational Data Mining
 
Validity and Reliability of Cranfield-like Evaluation in Information Retrieval
Validity and Reliability of Cranfield-like Evaluation in Information RetrievalValidity and Reliability of Cranfield-like Evaluation in Information Retrieval
Validity and Reliability of Cranfield-like Evaluation in Information Retrieval
 
07 Handling of Uncertainties in the Safety Case
07 Handling of Uncertainties in the Safety Case07 Handling of Uncertainties in the Safety Case
07 Handling of Uncertainties in the Safety Case
 
Human computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspectiveHuman computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspective
 
Recommender Systems in a nutshell
Recommender Systems in a nutshellRecommender Systems in a nutshell
Recommender Systems in a nutshell
 
CS194Lec0hbh6EDA.pptx
CS194Lec0hbh6EDA.pptxCS194Lec0hbh6EDA.pptx
CS194Lec0hbh6EDA.pptx
 
Turning Information chaos into reliable data
Turning Information chaos into reliable dataTurning Information chaos into reliable data
Turning Information chaos into reliable data
 
Understanding Deep Learning Requires Rethinking Generalization
Understanding Deep Learning Requires Rethinking GeneralizationUnderstanding Deep Learning Requires Rethinking Generalization
Understanding Deep Learning Requires Rethinking Generalization
 
attention-focus on what matters
attention-focus on what mattersattention-focus on what matters
attention-focus on what matters
 
2013 py con awesome big data algorithms
2013 py con awesome big data algorithms2013 py con awesome big data algorithms
2013 py con awesome big data algorithms
 
Three methodological issues for system dynamics practice
Three methodological issues for system dynamics practiceThree methodological issues for system dynamics practice
Three methodological issues for system dynamics practice
 
AL slides.ppt
AL slides.pptAL slides.ppt
AL slides.ppt
 
What Questions Are Worth Answering?
What Questions Are Worth Answering?What Questions Are Worth Answering?
What Questions Are Worth Answering?
 
Exploratory Data Analysis week 4
Exploratory Data Analysis week 4Exploratory Data Analysis week 4
Exploratory Data Analysis week 4
 
Data Science 101
Data Science 101Data Science 101
Data Science 101
 
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
Five Things I Learned While Building Anomaly Detection Tools - Toufic Boubez ...
 
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talkNYC Open Data Meetup-- Thoughtworks chief data scientist talk
NYC Open Data Meetup-- Thoughtworks chief data scientist talk
 
E3 chap-09
E3 chap-09E3 chap-09
E3 chap-09
 

Plus de Axelisys Limited

Why Health-Climate-Economics
Why Health-Climate-EconomicsWhy Health-Climate-Economics
Why Health-Climate-EconomicsAxelisys Limited
 
BarCamp Manchester 2016: Neuro, fuzzyio, logical
BarCamp Manchester 2016: Neuro, fuzzyio, logicalBarCamp Manchester 2016: Neuro, fuzzyio, logical
BarCamp Manchester 2016: Neuro, fuzzyio, logicalAxelisys Limited
 
Ethar Alali - Agile Yorkshire September 2015
Ethar Alali - Agile Yorkshire September 2015Ethar Alali - Agile Yorkshire September 2015
Ethar Alali - Agile Yorkshire September 2015Axelisys Limited
 
Taming Uncertainty: Planning Robust A/B-Testing
Taming Uncertainty: Planning Robust A/B-TestingTaming Uncertainty: Planning Robust A/B-Testing
Taming Uncertainty: Planning Robust A/B-TestingAxelisys Limited
 
Analysis 101: What is a System?
Analysis 101: What is a System?Analysis 101: What is a System?
Analysis 101: What is a System?Axelisys Limited
 
Analysis 101 correlation v causation
Analysis 101   correlation v causationAnalysis 101   correlation v causation
Analysis 101 correlation v causationAxelisys Limited
 
What is A/B-testing? An Introduction
What is A/B-testing? An IntroductionWhat is A/B-testing? An Introduction
What is A/B-testing? An IntroductionAxelisys Limited
 
Agile Estimation @ Lean Agile Manchester: Make Estimates Small!
Agile Estimation @ Lean Agile Manchester: Make Estimates Small!Agile Estimation @ Lean Agile Manchester: Make Estimates Small!
Agile Estimation @ Lean Agile Manchester: Make Estimates Small!Axelisys Limited
 

Plus de Axelisys Limited (12)

Why Health-Climate-Economics
Why Health-Climate-EconomicsWhy Health-Climate-Economics
Why Health-Climate-Economics
 
Agile Games CRM Saturday
Agile Games CRM SaturdayAgile Games CRM Saturday
Agile Games CRM Saturday
 
BarCamp Manchester 2016: Neuro, fuzzyio, logical
BarCamp Manchester 2016: Neuro, fuzzyio, logicalBarCamp Manchester 2016: Neuro, fuzzyio, logical
BarCamp Manchester 2016: Neuro, fuzzyio, logical
 
Agile games
Agile gamesAgile games
Agile games
 
Ethar Alali - Agile Yorkshire September 2015
Ethar Alali - Agile Yorkshire September 2015Ethar Alali - Agile Yorkshire September 2015
Ethar Alali - Agile Yorkshire September 2015
 
Taming Uncertainty: Planning Robust A/B-Testing
Taming Uncertainty: Planning Robust A/B-TestingTaming Uncertainty: Planning Robust A/B-Testing
Taming Uncertainty: Planning Robust A/B-Testing
 
Start-Up: A Call To Arms
Start-Up: A Call To ArmsStart-Up: A Call To Arms
Start-Up: A Call To Arms
 
Analysis 101: What is a System?
Analysis 101: What is a System?Analysis 101: What is a System?
Analysis 101: What is a System?
 
Analysis 101 correlation v causation
Analysis 101   correlation v causationAnalysis 101   correlation v causation
Analysis 101 correlation v causation
 
What is A/B-testing? An Introduction
What is A/B-testing? An IntroductionWhat is A/B-testing? An Introduction
What is A/B-testing? An Introduction
 
Agile Estimation @ Lean Agile Manchester: Make Estimates Small!
Agile Estimation @ Lean Agile Manchester: Make Estimates Small!Agile Estimation @ Lean Agile Manchester: Make Estimates Small!
Agile Estimation @ Lean Agile Manchester: Make Estimates Small!
 
What is Cloud Computing?
What is Cloud Computing?What is Cloud Computing?
What is Cloud Computing?
 

Dernier

Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataTecnoIncentive
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data VisualizationKianJazayeri1
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Cathrine Wilhelmsen
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfSubhamKumar3239
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingsocarem879
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Boston Institute of Analytics
 

Dernier (20)

Cyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded dataCyber awareness ppt on the recorded data
Cyber awareness ppt on the recorded data
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
Principles and Practices of Data Visualization
Principles and Practices of Data VisualizationPrinciples and Practices of Data Visualization
Principles and Practices of Data Visualization
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)Data Factory in Microsoft Fabric (MsBIP #82)
Data Factory in Microsoft Fabric (MsBIP #82)
 
convolutional neural network and its applications.pdf
convolutional neural network and its applications.pdfconvolutional neural network and its applications.pdf
convolutional neural network and its applications.pdf
 
INTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processingINTRODUCTION TO Natural language processing
INTRODUCTION TO Natural language processing
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
Data Analysis Project Presentation: Unveiling Your Ideal Customer, Bank Custo...
 

Agile Analysis 101: Agile Stats v Command & Control Maths

  • 1. Agile Analysis 101 Part 1: Introducing Basic Analysis
  • 2. Analysis for Dummy’s, Dummies • Most Agile Teams – Can’t Identify Influential Delivery Factors Plus… – …Over-reliance on Cycle-Time & Throughput – Equals Shooting in the Dark! • Little’s Law Applies Only When ‘Predictable’ • ‘Fixed’ Mathematics Doesn’t Adequately Facilitate Self-Organisation – Morphogenesis & Chaos • Too hard for most • We don’t know enough (yet) • Enterprise Mathematical Models too Hard or Based on Unrealistic Assumptions – e.g. Efficient Market Hypothesis, • required rational investor • What can Agilists Do?
  • 3. Analysis Forms Controlling Maths v Agile Stats
  • 4. Traditional Mathematical Analysis  Modelled the environment in its entirety  Every variable identified and mapped  Every factor had to be understood in detail  …and managed  Fit command-and-control really well!  Provided an Exact answer  Useful comfort blanket  Exclusivity - Very few people understood it  Needed Masters & PhDs in numerate subjects  MBA’s not always enough  Mathematics, Physics, Operation Research, Engineering…
  • 5. Area of a Circle (Traditional Way) • Given origin (h,k) & radius r • Typically learned for GCSE • Have to know: – equation – r is a factor & how to get it – What ‘squared’ means – Pi is a constant – Know maths • What if you didn’t? Source: Google Images
  • 6. Statistical Analysis  Doesn’t require exact model  Doesn’t produce an exact answer  Do you need one?  Can you rely on one?  Isn’t variable/factor centric  Though they may come out  Looks for correlations  Which tell you where else to look for more  CAREFUL! Correlations aren’t causations!  If you find a link, it doesn’t necessarily mean it’s so  Can be refined, akin to ‘learning’  Increasing number of samples in known range  …akin to reducing Kanban batch size or story size  Can also use Bayesian Inference  Fits Lean-Agility really well  A 10-year old can often do it!
  • 7. Area of a Circle (Statistical Way) • Grid around the Circle • Count Squares at least half inside circle • Need more accuracy? Easy! Use finer grid! • Typically learned at 10 years old! Question Take a look at the examples on the right, which grid is closer to Actual Area? 8 x 8 x 1cm Grid Diameter = 8 x 1cm squares = 8cm Radius = Half diameter i.e. 8/2 = 4cm Area is the number of squares at least half inside circle. 52 squares: 52x(1x1) = 52cm2 20 x 20 x 0.4cm Grid Diameter = 20 x 0.4cm squares = 8cm Radius = Half diameter i.e. 8/2 = 4cm Area is the number of squares at least half inside circle. 312 squares: 312x(0.4x0.4) = 49.92cm2 Actual Area When r = 4 Area = Pi x (4 x 4) = 50.27cm2 Image Source: Google Images
  • 8. Compare to Kanban • Backlog the Tickets • Batch together related epic tickets • If you need more accuracy, make the batches smaller! – …and/or sprints shorter
  • 9. Technical Note! • Statistical form is standard in Monte Carlo Algorithms – Always Fast to run… – …But ‘probably’ correct • In any case, accurate to a particular range • If that range is good enough use it!
  • 10. What is Good Enough? Guide to a Nebulous Term
  • 11. Definition of Good Enough? Definitions What I tell Managers: “Any measure with an accuracy matching your ability to change, is good enough.” What I tell Techies: “Sampling twice as frequent as the change, is good enough.” - Ethar Alali • Any more accurate/frequent is waste • Any less and you can’t make decisions – So risk mitigation strategy may be necessary
  • 12. Example: CD Quality Sound In ye olden days we had these • 44.1kHz sample rate • Stereo Sound • 16-bit Digital Sampling • CD stores 650MB Compact Disc Image Source: Google Images
  • 13. Example: Compact Disc Encoding Focusing on Useful Data Storage Ignore Reed-Solomon error correction & detection Signed 16-bit number can segment audio into ~ 1/65,536 parts 44.1kHz means it takes 1x 16 bit number in this range every 1/44,100ths of a second Stereo sound means two sets of microphones and hence 2 sample channels Total storage needs for a 3 minute song: • 44,100 samples x 2 bytes per sample x 2 channels x 3minute x 60 seconds = 31.752MB raw per song. • Album = 20 songs = 635 MB of digital data, which fills a 650MB CD Great for music :-) Attribution: Image Courtesy of Grahammitchell.com
  • 14. What About: Telephone Voice on CD? Voice on Telephones is mono not stereo Needs only one channel! Telephone quality changes pitch in 3K at worst! Voice doesn’t have the refined nature of music! Hence can be recorded in 8-bit (256 parts) 3kHz means it takes 1 x 8 bit number in this range ever 1/3,000th of a second Total storage needs for a 3 minute conversation: • 3,000 samples x 1 bytes per sample x 1 channels x 3 minute x 60 seconds = 540KB raw. • Album = 20 songs = 10.8 MB of digital data Stored on 650MB CD, you have almost 640MB of WASTE!
  • 15. Attribution: Image Courtesy of Grahammitchell.com What If: We sampled less? • Not an Accurate Picture! Note: Dashed red edge case, which samples exactly at transition points. In real scenarios this never happens with sound since change isn’t periodic. RED = 2/3 as fast sampling AMBER = Twice as frequent sampling GREEN = 4 times as frequent
  • 16. Which is Closer to Actual? RED = 2/3 as fast sampling AMBER = Twice as frequent sampling GREEN = 4 times as frequent
  • 17. Traditional Samples in Business • Annual Accounts – Plc’s have mid-term or quarterly accounts – If they want to be more agile, make it monthly • Regulatory Reporting • Charity Commission Reports • Franchises Brand Inspections – Once every 2-3 years, inspected annually • FCA • … Identify: Easy! Usually associated with ‘Audit’ of some kind. • Self-governing/managing teams Sample themselves!
  • 18. Correlation != Causation What they are, How to find them and What they mean
  • 19. Causation • One thing occurs as a deterministic consequence of something else – Fingers in high-voltage socket causes death • Link a number of causes to establish behaviour • Needs Two Factors – Functional process, including all variables – Initial condition (aka Pre-condition) • ‘Given’ in Gherkin syntax • Great for Forecasting… – As long as causal-chain always happen • Near useless in chaotic environments – Depending on when you look at it • Initial condition may not be known • Sensitive dependence + Feedback injects uncertainty! • Code runs deterministically, teams normally work chaotically… • …until they reach predictability, then Little’s Law can apply
  • 20. Example: Causation • y = 2 + x <- function/process • x = 3 <- Initial [pre]condition • y = 5 <- Final outcome/post-condition • Post-condition = acceptance test criteria – ‘Then’ in Gherkin Syntax • Really easy for code! Mostly predictable – Fits Gherkin, OCL, VDM, Z etc. perfectly
  • 21. Correlation • Aims to find [statistical] links between samples – When causal links not known or samples appear ‘random’ – Also shows strength of relationship • First step in Factor Analysis – Locate influential factors for dependent variables • Cycle-time • Throughput • Value delivered • Can be plotted on graph • Needs Manipulation to Fit Gherkin :( • All aim to locate where to sniff next!
  • 22. Correlations Can Be Seen • Correlations can be modelled with Linear Regression • Seen when an increase in one variable increases/decreases another Source: Scatterplot Image from knottwiki teaching Source: Image from Utah.edu Mesowest weather
  • 23. Example: Burnage Library • Correlation Matrix
  • 24. Example: Burnage Library • Manchester City Council claim: Library closure based on 11 variables for deprivation – Tasked with saving £80 million a year • Correlation matrix showed strong correlations between Population of Library catchment area &: – Total Library Visitors – Larger catchments correlate with more library visitors – Active users – Larger catchments correlate with more active users – Participation in Events – … • But all factors correlated with each other!
  • 25. Dependent v Independent Correlation Very High Correlations of dependent combined score & other allegedly independent factors with catchment population
  • 26. Independent Variable Inter-correlation Lead to Q: How come they are so highly correlated? A: High Inter-correlation between independent variables!
  • 27. Correlation: Deprivation Q:Was deprivation a factor? A: Deprivation wasn’t a significant consideration, despite the claims of Council
  • 28. Example: Burnage Library Conclusion • Basics showed that claims weren't supported – Could have done better with Null Hypothesis • Interdependence of allegedly independent variables meant weighting of catchment area 5x more important than deprivation – Not likely based on deprivation index, as was claimed – Potentially hinting at a political decision • Controversial ;)
  • 29. NEXT TIME: Agile Teams • In Part 2, we examine how this applies to teams. • In summary: – Gather Cycle-time, Throughput & Value delivered across a few sprints – Match & Correlate Respective • Bugs • Blockers • Days of week • Team size • Story • Anything else you already have data for • Don’t – Make too many inferences early on
  • 30. Thanks for Viewing Further Reading Business Planning Example http://www.solver.com/monte-carlo-simulation-example Monte Carlo Simulation Tutorial in Excel “Statistics in Psychosocial Research, Lecture 8 Factor Analysis I” John Hopkins University http://ocw.jhsph.edu/courses/statisticspsychosocialresearch/pdfs/lecture8.pdf) “Correlation & Dependence” Wikipedia http://en.wikipedia.org/wiki/Correlation_and_dependence Ethar Alali @EtharUK @Dynacognetics Managing Director & Chief Architect Polymath-MathMo. Programming since 9 years old. TOGAF 9 Certified, change agent. Blog: GoadingtheITGeek.blogspot.co.uk About Us Specialist ICT Strategists & Advisors. Member of HiveMind Network for some of the biggest household and corporate multi-nationals. Accredited Growth Voucher Advisors certified to deliver IT & Web Growth Consultancy as part of the government’s Growth Voucher Scheme. Accreditations & Associations