Organizational Structure Running A Successful Business
Craig Sullivan - Oh Boy! These A/B tests look like total bullshit! MKTFEST 2014
1. Oh
Boy!
These A/B
tests look like
total bullshit!
@OptimiseOrDie
2. @OptimiseOrDie
• UX, Analytics, Split Testing and Growth Rate Optimisation
• Started doing testing & CRO 2004
• Split tested over 40M visitors in 19 languages
• 60+ mistakes I MADE with AB testing
• Like riding a bike…
• Want to optimise your optimisation? Get in touch!
3.
4. Top
Tes'ng
F***ups
for
2014
1. Tes'ng
in
the
wrong
place
2. Your
hypothesis
inputs
are
crap
3. No
analy'cs
integra'on
4. Your
test
will
finish
a=er
you
die
5. You
don’t
test
for
long
enough
6. You
peek
before
it’s
ready
7. No
QA
for
your
split
test
8. Opportuni'es
are
not
priori'sed
9. Tes'ng
cycles
are
too
slow
10. You
don’t
know
when
tests
are
ready
11. Your
test
fails
12. The
test
is
‘about
the
same’
13. Test
flips
behaviour
14. Test
keeps
moving
around
15. You
run
an
A/A
test
and
waste
'me
16. Nobody
‘feels’
the
test
17. You
forgot
you
were
responsive
18. You
forgot
you
had
no
traffic
19. You
ran
the
wrong
test
type
20. You
didn’t
try
all
the
flavours
of
tes'ng
@OptimiseOrDie
slidesha.re/1wBbZ9c
12. The 95% Stopping Problem
• Many people use 95, 99% ‘confidence’ to stop
• This value is unreliable
• Read this Nature article : bit.ly/1dwk0if
• You can hit 95% early in a test
• If you stop, it could be a false result
• Testing Tools need to be smarter about what they imply!
• This 95% thingy – it’s the last signal you should use to stop a test
• Let me explain
@OptimiseOrDie
16. The 95% Stopping Problem
“You should know that stopping a test once it’s significant is
deadly sin number 1 in A/B testing land.
77% of A/A tests (testing the same thing as A and B) will reach
significance at a certain point.”
Ton Wesseling, Online Dialogue
“I always tell people that you need a representative sample if
your data needs to be valid. What does ‘representative’ mean?
First of all you need to include all the weekdays and weekends.
You need different weather, because it impacts buyer behaviour.
But most important: Your traffic needs to have all traffic
sources, especially newsletter, special campaigns, TV,…
everything!”
Andre Morys, Web Arts
17. Three Articles you MUST read
“Statistical Significance does not equal Validity”
http://bit.ly/1wMfmY2
“Why every Internet Marketer should be a Statistician”
http://bit.ly/1wMfs1G
“Understanding the Cycles in your site”
http://mklnd.com/1pGSOUP
18. Business & Purchase Cycles
@OptimiseOrDie
Start Test Finish Avg Cycle
• Customers change
• Your traffic mix changes
• Markets, competitors
• Be aware of all the waves
• Always test whole cycles
• Minimum 2 cycles (wk/mo)
• Don’t exclude slower buyers
20. • TWO BUSINESS CYCLES minimum (week/mo)
• 1 PURCHASE CYCLE minimum
• 250 CONVERSIONS minimum per creative (e.g. checkouts)
• 350 & MORE! if response is very similar
• FULL WEEKS/CYCLES never part of one
• KNOW what marketing, competitors and cycles are doing
• RUN a test length calculator - bit.ly/XqCxuu
• SET your test run time , RUN IT, STOP IT, ANALYSE IT
• ONLY RUN LONGER if you need more data
• DON’T RUN LONGER just because the test isn’t giving the result you want!
@OptimiseOrDie
How Long? Simple Rules to follow
22. QA
Test
or
lose
loads
of
MONEY!!!
• Over
40%
of
AB
tests
I’ve
worked
on
were
broken
(some
seriously)
• I’ve
also
found
over
£20M
p.a.
of
browser
bugs
in
the
last
18
months
• It’s
very
easy
to
break
or
bias
your
tes'ng
Browser testing
www.crossbrowsertesting.com
www.browserstack.com
www.spoon.net
www.saucelabs.com
www.multibrowserviewer.com
Mobile devices
www.appthwack.com
www.deviceanywhere.com
www.opendevicelab.com
Read this article bit.ly/1wBccsJ
@OptimiseOrDie
23. Gamble the Company AWAY!
• I get 60-65% right
• UX and Copywriters good at picking!
• C level execs are easy marks
• Ironically, many decide ‘designs’
• You need collaborative test design
• It’s a team game, with customers
• Flip a coin, anyone?
30. Business Future Testing?
Congratulations!
Today you’re the lucky
winner of our random
awards programme.
You get all these extra
features for free, on us.
Enjoy.
Mr D. Vader
31. #1 : CULTURE
• Smart Talented Polymath People
• Flexible and Agile ‘One Team’ approach
• Smash the Silos
• Proper Agile, Rapid, Iterative
The 5 Legged Optimisation Barstool
36. #4 : PERSUASIVE COPYWRITING
“On the average, five times as many people
read the headline as read the body copy.
When you have written your headline, you
have spent eighty cents out of your dollar.”
David Ogilvy
“In 9 years and 40M split tests with visitors,
the majority of my testing success came
from playing with the words.”
@OptimiseOrDie
44. #12 : The Best Companies…
• Invest
con'nually
in
analy'cs
instrumenta'on,
tools,
people
• Use
an
Agile,
itera've,
cross-‐silo,
one
team
project
culture
• Prefer
collabora've
tools
to
having
lots
of
mee'ngs
• Priori'se
development
based
on
numbers
and
insight
• Prac'ce
real
con'nuous
product
improvement,
not
SLEDD*
• Are
fixing
bugs,
cru=,
bad
stuff
as
well
as
op'mising
• Source
photos
and
content
that
support
persuasion
and
u'lity
• Have
cross
channel,
cross
device
design,
tes'ng
and
QA
• Segment
their
data
for
valuable
insights,
every
test
or
change
• Con'nually
reduce
cycle
(itera'on)
'me
in
their
process
• Blend
‘long’
design,
con'nuous
improvement
AND
split
tests
• Make
op'misa'on
the
engine
of
change,
not
the
slave
of
ego
* Single Large Expensive Doomed Developments