SlideShare une entreprise Scribd logo
1  sur  15
1
Making your data lovely!
Prioritising, cleaning, extraction, transformation, automation
Pia Waugh
Director of Gov 2.0 and Data
Department of Finance
Soon to be Prime Minister & Cabinet
22
Key Benefits to the Public Service in Opening Data
• Efficiencies from proactively publishing common requests
• Cheaper and more modular services delivery
• Reduced regulatory burden through machine readable
data supporting compliance and automated reporting
• Better policy outcomes by leveraging cross-agency data
• More consistency & less duplication across government
• Improved opportunities to leverage innovation and
collaboration (citizens, industry, other depts)
• Opportunities to improve data quality through
verifiable public contributions
33
Tips for ensuring benefits realisation of open data
• Adopt an approach of “data user and developer empathy”
• Data publishing built into your BAU
• Initial focus on data that supports you  build capability
• Consume your own data APIs (apps, datavis, BI, etc)
• Ensure you consider:
• Quality – no one can use bad data, but perfect is enemy of the good
• Currency – is it up to date? How often is it updated?
• APIs – is it programmatically available?
• Publishing – have you provided supporting materials (taxonomies)?
• Discoverability – is it hosted or linked on data.gov.au?
• Reusability – have you tested it with data users?
• Licensing – Creative Commons By Attribution the default
• Automation wherever possible!
44
Data on the inside
• Do you know what data you have internally?
• Are you considering all data types?
• How embedded is data driven decision making?
• How can you upskill the whole organisation?
• Do you know what your external data needs are?
• How are you measuring and monitoring success?
Data infrastructure to support your organisation
should be extendable to support sharing/publishing
55
Rub a dub data
• If a machine can’t read it, a machine can’t make an API
• Some data has specialised data formats, some commonalities
• Tabular, spatial, real time, unstructured, etc
• Most data comes from somewhere, use the source Luke!
• Machines and humans have different needs
66
What you need is clean sheets
• Don’t merge cells. Sorting and other manipulations people may want to apply to your data assume
that each cell belongs to one row and column.
• Don’t mix data and metadata (e.g. date of release, name of author) in the same sheet.
• The first row of a data sheet should contain column headers. None of these headers should be
duplicates or blank. The column header should clearly indicate which units are used in that column,
where this makes sense.
• The remaining rows should contain data, one datum per row. Don’t include aggregate statistics such
as TOTAL or AVERAGE. You can put aggregate statistics in a separate sheet, if they are important.
• Numbers in cells should just be numbers. Don’t put commas in them, or stars after them, or
anything else. If you need to add an annotation to some rows, use a separate column.
• Use standard identifiers: e.g. identify countries using ISO 3166 codes rather than names.
• Don’t use only colour or other stylistic cues to encode information. If you want to colour cells
according to their value, use conditional formatting.
• Leave the cell blank if a value is not available.
• If you provide pivot tables, make sure the underlying data is available separately too.
• If you also want to create a human-friendly presentation of the data, do so by creating another sheet
in the same workbook and referencing the appropriate cells in the canonical data sheet
http://www.clean-sheet.org/
77
Automate your reporting
http://ckan.org/2015/09/18/pyramids-pipelines-and-a-can-of-sweave-ckan-asia-pacific-meetup/
88
Automating updates
Automation involves system to system updates to save you time & money.
Three broad approaches:
1. Write scripts to push or pull data updates using an API directly from
the source. Usually doesn’t require much data manipulation.
2. Adopt a tool like Taverna, FME or Splunk to extract, clean/manipulate,
and then push data to the data.gov.au (CKAN/geoserver) API directly.
3. Use the data.gov.au (CKAN) to schedule pull updates from your data,
but most agencies don’t do that as they prefer to push updates.
The data.gov.au team strongly encourage you to gain at least one geek in you
data team so you can experiment with code and tools to best meet your needs.
“With much help and encouragement from the support team at data.gov.au, we dipped our toes into the CKAN API waters. As a
DotNet shop we were keen to limit the technology landscape and sought to automate the upload using DotNet. The CKAN API is
refreshingly lightweight with a simple authentication process and messaging.” -- ABN Lookup Team
Code at https://github.com/datagovau/ckan-api-examples
99
Support
• http://toolkit.data.gov.au is updated regularly. Recent updates include:
• How to automate data updates to data.gov.au with FME
• Improved information on how to clean data
• How to manage your own catalogue harvesting
• Government data landscape to identify projects of use
• Open Data Community Forum – soon to be moved to analyticsspace
• Talk to your colleagues across government(s)
• Other sources
• Communities of interest: Data Science Meetup groups, Data
Analytics Centre of Excellence, Linked Data Working Group,
National Statistical Service, etc
• GovHack Developers Kit: Become a data scientist in an hour, data
tools, APIs, datavis, spatial, mashup techniques, statistical
1010
Quality – improve over time
The 5 Star Data Quality standard developed by
Sir Tim Berners-Lee will be used on data.gov.au in
the coming month or two to indicate data quality.
Aim for quality web services.
API quality will also be looked at
soon, including potentially
a 5 star API standard.
http://5stardata.info/en/
1111
Data integration and aggregation
• Challenging but great potential for improved policy/services.
• Unit record sharing is complex, privacy concerns for personal data.
• Personal unit record data is mostly useful to researchers, appropriate
mechanisms with legal, technical, ethical constraints to access such data.
• Data aggregated by common spatial boundaries is comparative across
datasets and over time.
• Unfortunately, data owners traditionally aggregate to boundaries that
constantly change (electorates, postcodes, etc).
• The Australian Statistical Geography Standard (ASGS) provides a
consistent set of spatial boundaries that can be mapped to other needs.
• Anonymisation on the fly APIs also provide mechanism for appropriate
public/agency access to unit record level data (e.g. ABS.Stat)
http://statistical-data-integration.govspace.gov.au/
https://toolkit.data.gov.au/index.php?title=Definitions#Types_of_data
1212
data.gov.au
Free, cloud, scalable API enabled platform for hosting government data.
Staged approach
1. Publishing (2013 – mid 2014)
Improving the functionality and ease of
publishing for agencies with training and
documentation
2. Value realisation (2014-2015)
Providing useful front end tools for data.gov.au
including data visualisation and analysis tools.
Publishing quality data a pre-requisite.
3. Data quality (2014-2015)
Looking at ways to provide agencies the ability
to accept iterative data improvements in a
verifiable way
Features
• Support for tabular, spatial and data models
• Options for hosting, linking or catalogue harvesting
• Manual and automated publishing options
• API access to government data
• Easy to publish, download & interact
• Use cases and site|data|org analytics
• Data Request Site
• Metadata harvesting from gov data gateways
• National Map integration
• Federated search for discoverability
In Planning
• 5 star quality plugin
• Selective crowdsourcing for updates
• League Table
1313
Open Data Portals
Council Portals:
• City of Melbourne
• City of Brisbane
1414
Some Case Studies
• Publishing Budget 2014 Data Report
• Open data – Transforming the Provider / Stakeholder Paradigm
• On the Value of Open Roof Prints
• 100 years of patent and IP data released on data.gov.au
More available along with tech support at http://toolkit.data.gov.au
Other Australian case studies/documentation
• SA Open Data Toolkit
• QLD Government Case Studies
• Victorian Government Showcase
• NSW Apps Showcase
• ACT examples
1515
The future is here....
And it is already widely distributed
http://www.flickr.com/photos/mr_matt/35688926
22/
Challenge #1: Collaborate
Challenge #2: Share
Challenge #3: Measure
Challenge #4: Play
Questions?
@piawaugh
@datagovau
data.gov.au
toolkit.data.gov.au

Contenu connexe

Tendances

Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data DisseminationWorkshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data DisseminationZoltan Nagy
 
Over View of Open Government Data Platform India
Over View of Open Government Data Platform IndiaOver View of Open Government Data Platform India
Over View of Open Government Data Platform IndiaData Portal India
 
Sheffield why open data matters to local government officers - tim adams lga
Sheffield   why open data matters to local government officers - tim adams lgaSheffield   why open data matters to local government officers - tim adams lga
Sheffield why open data matters to local government officers - tim adams lgaLG Inform Plus
 
Linked Open Government Data: What’s Next?
Linked Open Government Data:  What’s Next?Linked Open Government Data:  What’s Next?
Linked Open Government Data: What’s Next?Li Ding
 
Data as a service
Data as a serviceData as a service
Data as a serviceZoltan Nagy
 
Data-as-a-Service: DataGraft
Data-as-a-Service: DataGraftData-as-a-Service: DataGraft
Data-as-a-Service: DataGraftdapaasproject
 
Open Data Ireland: Developing a national open data strategy
Open Data Ireland: Developing a national open data strategyOpen Data Ireland: Developing a national open data strategy
Open Data Ireland: Developing a national open data strategyDublinked .
 
Victoria Online: Seek and ye shall find: 26 May 2010
Victoria Online: Seek and ye shall find: 26 May 2010 Victoria Online: Seek and ye shall find: 26 May 2010
Victoria Online: Seek and ye shall find: 26 May 2010 eGovernment Resource Centre
 
Open data for development
Open data for developmentOpen data for development
Open data for developmentmlepage
 
Roberts swirrl odi_korea_dec2015
Roberts swirrl odi_korea_dec2015Roberts swirrl odi_korea_dec2015
Roberts swirrl odi_korea_dec2015Bill Roberts
 
Apm poster collective action in addis
Apm poster  collective action in addisApm poster  collective action in addis
Apm poster collective action in addiskensir
 
"FENIX platform OVERVIEW OF THE NEW SOFTWARE PLATFORM AND SYSTEM SETUP"
"FENIX platform OVERVIEW OF THE NEW SOFTWARE PLATFORM AND SYSTEM SETUP""FENIX platform OVERVIEW OF THE NEW SOFTWARE PLATFORM AND SYSTEM SETUP"
"FENIX platform OVERVIEW OF THE NEW SOFTWARE PLATFORM AND SYSTEM SETUP"FAO
 
Memory Connected
Memory ConnectedMemory Connected
Memory ConnectedLi Ding
 
IAOS 2018 - Making official statistics usable by non-statisticians, A. Sporle...
IAOS 2018 - Making official statistics usable by non-statisticians, A. Sporle...IAOS 2018 - Making official statistics usable by non-statisticians, A. Sporle...
IAOS 2018 - Making official statistics usable by non-statisticians, A. Sporle...StatsCommunications
 
Regional web-based health and wellbeing data
Regional web-based health and wellbeing dataRegional web-based health and wellbeing data
Regional web-based health and wellbeing dataHelen Thompson
 
Pre-open access data sharing
Pre-open access data sharingPre-open access data sharing
Pre-open access data sharingCGIAR
 

Tendances (20)

Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data DisseminationWorkshop Rio de Janeiro Strategies for Web Based Data Dissemination
Workshop Rio de Janeiro Strategies for Web Based Data Dissemination
 
Over View of Open Government Data Platform India
Over View of Open Government Data Platform IndiaOver View of Open Government Data Platform India
Over View of Open Government Data Platform India
 
Sheffield why open data matters to local government officers - tim adams lga
Sheffield   why open data matters to local government officers - tim adams lgaSheffield   why open data matters to local government officers - tim adams lga
Sheffield why open data matters to local government officers - tim adams lga
 
Linked Open Government Data: What’s Next?
Linked Open Government Data:  What’s Next?Linked Open Government Data:  What’s Next?
Linked Open Government Data: What’s Next?
 
Data as a service
Data as a serviceData as a service
Data as a service
 
Data-as-a-Service: DataGraft
Data-as-a-Service: DataGraftData-as-a-Service: DataGraft
Data-as-a-Service: DataGraft
 
Open Addresses - MRS
Open Addresses - MRSOpen Addresses - MRS
Open Addresses - MRS
 
Open Data Ireland: Developing a national open data strategy
Open Data Ireland: Developing a national open data strategyOpen Data Ireland: Developing a national open data strategy
Open Data Ireland: Developing a national open data strategy
 
Open Data Initiatives
Open Data InitiativesOpen Data Initiatives
Open Data Initiatives
 
Victoria Online: Seek and ye shall find: 26 May 2010
Victoria Online: Seek and ye shall find: 26 May 2010 Victoria Online: Seek and ye shall find: 26 May 2010
Victoria Online: Seek and ye shall find: 26 May 2010
 
Open data for development
Open data for developmentOpen data for development
Open data for development
 
Roberts swirrl odi_korea_dec2015
Roberts swirrl odi_korea_dec2015Roberts swirrl odi_korea_dec2015
Roberts swirrl odi_korea_dec2015
 
Broadband mapping and crowdsourced data
Broadband mapping and crowdsourced dataBroadband mapping and crowdsourced data
Broadband mapping and crowdsourced data
 
Apm poster collective action in addis
Apm poster  collective action in addisApm poster  collective action in addis
Apm poster collective action in addis
 
"FENIX platform OVERVIEW OF THE NEW SOFTWARE PLATFORM AND SYSTEM SETUP"
"FENIX platform OVERVIEW OF THE NEW SOFTWARE PLATFORM AND SYSTEM SETUP""FENIX platform OVERVIEW OF THE NEW SOFTWARE PLATFORM AND SYSTEM SETUP"
"FENIX platform OVERVIEW OF THE NEW SOFTWARE PLATFORM AND SYSTEM SETUP"
 
Memory Connected
Memory ConnectedMemory Connected
Memory Connected
 
IAOS 2018 - Making official statistics usable by non-statisticians, A. Sporle...
IAOS 2018 - Making official statistics usable by non-statisticians, A. Sporle...IAOS 2018 - Making official statistics usable by non-statisticians, A. Sporle...
IAOS 2018 - Making official statistics usable by non-statisticians, A. Sporle...
 
Regional web-based health and wellbeing data
Regional web-based health and wellbeing dataRegional web-based health and wellbeing data
Regional web-based health and wellbeing data
 
Pre-open access data sharing
Pre-open access data sharingPre-open access data sharing
Pre-open access data sharing
 
Fieldwork2 0
Fieldwork2 0Fieldwork2 0
Fieldwork2 0
 

Similaire à Making your data lovely with open data benefits and best practices

ADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonDATAVERSITY
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo
 
Predicting Mission Success through Improved Data Collection, Reuse and Analysis
Predicting Mission Success through Improved Data Collection, Reuse and AnalysisPredicting Mission Success through Improved Data Collection, Reuse and Analysis
Predicting Mission Success through Improved Data Collection, Reuse and AnalysisBooz Allen Hamilton
 
Getting Started Using ACL in Your Next Audit
Getting Started Using ACL in Your Next AuditGetting Started Using ACL in Your Next Audit
Getting Started Using ACL in Your Next AuditJim Kaplan CIA CFE
 
Agility for big data
Agility for big data Agility for big data
Agility for big data Charlie Cheng
 
Atlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfAtlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfSubrat Kumar Dash
 
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...Vasu S
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Geoffrey Fox
 
Establish a 360-view of your data with UiPath and Tableau
Establish a 360-view of your data with UiPath and TableauEstablish a 360-view of your data with UiPath and Tableau
Establish a 360-view of your data with UiPath and TableauCristina Vidu
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big DataSpringPeople
 
The Rise of Self -service Business Intelligence
The Rise of Self -service Business IntelligenceThe Rise of Self -service Business Intelligence
The Rise of Self -service Business Intelligenceskewdlogix
 
data_blending
data_blendingdata_blending
data_blendingsubit1615
 
Visualising montioring and evaluation data
Visualising montioring and evaluation dataVisualising montioring and evaluation data
Visualising montioring and evaluation dataRob Worthington
 
Self-service analytics risk_September_2016
Self-service analytics risk_September_2016Self-service analytics risk_September_2016
Self-service analytics risk_September_2016Leigh Ulpen
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsAbhishek Sood
 
Turbocharging your Google Analytics data
Turbocharging your Google Analytics dataTurbocharging your Google Analytics data
Turbocharging your Google Analytics dataNeil Barnes
 
2. Business Data Analytics and Technology.pptx
2. Business Data Analytics and Technology.pptx2. Business Data Analytics and Technology.pptx
2. Business Data Analytics and Technology.pptxnirmalanr2
 
Getting Meta at Mesa
Getting Meta at MesaGetting Meta at Mesa
Getting Meta at MesaSafe Software
 
Does Your Data Catalog Tool Have These Capabilities?
Does Your Data Catalog Tool Have These Capabilities?Does Your Data Catalog Tool Have These Capabilities?
Does Your Data Catalog Tool Have These Capabilities?Vivek Mishra
 

Similaire à Making your data lovely with open data benefits and best practices (20)

ADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and ComparisonADV Slides: Data Pipelines in the Enterprise and Comparison
ADV Slides: Data Pipelines in the Enterprise and Comparison
 
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
Denodo DataFest 2016: Comparing and Contrasting Data Virtualization With Data...
 
Predicting Mission Success through Improved Data Collection, Reuse and Analysis
Predicting Mission Success through Improved Data Collection, Reuse and AnalysisPredicting Mission Success through Improved Data Collection, Reuse and Analysis
Predicting Mission Success through Improved Data Collection, Reuse and Analysis
 
Getting Started Using ACL in Your Next Audit
Getting Started Using ACL in Your Next AuditGetting Started Using ACL in Your Next Audit
Getting Started Using ACL in Your Next Audit
 
Agility for big data
Agility for big data Agility for big data
Agility for big data
 
Atlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdfAtlan_Product metering_Subrat.pdf
Atlan_Product metering_Subrat.pdf
 
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
TDWI Checklist - The Automation and Optimization of Advanced Analytics Based ...
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
 
Establish a 360-view of your data with UiPath and Tableau
Establish a 360-view of your data with UiPath and TableauEstablish a 360-view of your data with UiPath and Tableau
Establish a 360-view of your data with UiPath and Tableau
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
The Rise of Self -service Business Intelligence
The Rise of Self -service Business IntelligenceThe Rise of Self -service Business Intelligence
The Rise of Self -service Business Intelligence
 
data_blending
data_blendingdata_blending
data_blending
 
Visualising montioring and evaluation data
Visualising montioring and evaluation dataVisualising montioring and evaluation data
Visualising montioring and evaluation data
 
Self-service analytics risk_September_2016
Self-service analytics risk_September_2016Self-service analytics risk_September_2016
Self-service analytics risk_September_2016
 
Tips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data AnalyticsTips --Break Down the Barriers to Better Data Analytics
Tips --Break Down the Barriers to Better Data Analytics
 
Turbocharging your Google Analytics data
Turbocharging your Google Analytics dataTurbocharging your Google Analytics data
Turbocharging your Google Analytics data
 
2. Business Data Analytics and Technology.pptx
2. Business Data Analytics and Technology.pptx2. Business Data Analytics and Technology.pptx
2. Business Data Analytics and Technology.pptx
 
Getting Meta at Mesa
Getting Meta at MesaGetting Meta at Mesa
Getting Meta at Mesa
 
Does Your Data Catalog Tool Have These Capabilities?
Does Your Data Catalog Tool Have These Capabilities?Does Your Data Catalog Tool Have These Capabilities?
Does Your Data Catalog Tool Have These Capabilities?
 

Plus de Pia Waugh

linux.conf.au 2017 keynote - choose your own adventure please
linux.conf.au 2017 keynote - choose your own adventure pleaselinux.conf.au 2017 keynote - choose your own adventure please
linux.conf.au 2017 keynote - choose your own adventure pleasePia Waugh
 
Young Women's Leadership Seminar
Young Women's Leadership SeminarYoung Women's Leadership Seminar
Young Women's Leadership SeminarPia Waugh
 
Data useful for open cities
Data useful for open citiesData useful for open cities
Data useful for open citiesPia Waugh
 
Over the Edge - Online 2015 talk for ALIA
Over the Edge - Online 2015 talk for ALIAOver the Edge - Online 2015 talk for ALIA
Over the Edge - Online 2015 talk for ALIAPia Waugh
 
State of the nation talk - opengov miniconf 2014
State of the nation talk - opengov miniconf 2014State of the nation talk - opengov miniconf 2014
State of the nation talk - opengov miniconf 2014Pia Waugh
 
Public Sphere: Gov 2.0 - Justin Freeman
Public Sphere: Gov 2.0 - Justin FreemanPublic Sphere: Gov 2.0 - Justin Freeman
Public Sphere: Gov 2.0 - Justin FreemanPia Waugh
 
Public Sphere: Gov 2.0 - Des Walsh
Public Sphere: Gov 2.0 - Des WalshPublic Sphere: Gov 2.0 - Des Walsh
Public Sphere: Gov 2.0 - Des WalshPia Waugh
 
Public Sphere: Gov 2.0 - Brian Fitzgerald
Public Sphere: Gov 2.0 - Brian FitzgeraldPublic Sphere: Gov 2.0 - Brian Fitzgerald
Public Sphere: Gov 2.0 - Brian FitzgeraldPia Waugh
 
Public Sphere: Gov 2.0 - Mark Spain
Public Sphere: Gov 2.0 - Mark SpainPublic Sphere: Gov 2.0 - Mark Spain
Public Sphere: Gov 2.0 - Mark SpainPia Waugh
 
Public Sphere: Gov 2.0 - Shoaib Burq
Public Sphere: Gov 2.0 - Shoaib BurqPublic Sphere: Gov 2.0 - Shoaib Burq
Public Sphere: Gov 2.0 - Shoaib BurqPia Waugh
 
Public Sphere: Gov 2.0 - John Shanahan
Public Sphere: Gov 2.0 - John ShanahanPublic Sphere: Gov 2.0 - John Shanahan
Public Sphere: Gov 2.0 - John ShanahanPia Waugh
 
Public Sphere: Gov 2.0 - Andrew Boyd
Public Sphere: Gov 2.0 - Andrew BoydPublic Sphere: Gov 2.0 - Andrew Boyd
Public Sphere: Gov 2.0 - Andrew BoydPia Waugh
 
Public Sphere: Gov 2.0 - Crispin Butteriss
Public Sphere: Gov 2.0 - Crispin ButterissPublic Sphere: Gov 2.0 - Crispin Butteriss
Public Sphere: Gov 2.0 - Crispin ButterissPia Waugh
 
Public Sphere: Gov 2.0 - Ben Searle
Public Sphere: Gov 2.0 - Ben SearlePublic Sphere: Gov 2.0 - Ben Searle
Public Sphere: Gov 2.0 - Ben SearlePia Waugh
 
Public Sphere: Gov 2.0 - Damien Donnelly
Public Sphere: Gov 2.0 - Damien DonnellyPublic Sphere: Gov 2.0 - Damien Donnelly
Public Sphere: Gov 2.0 - Damien DonnellyPia Waugh
 
Public Sphere: Gov 2.0 - Matthew Landauer
Public Sphere: Gov 2.0 - Matthew LandauerPublic Sphere: Gov 2.0 - Matthew Landauer
Public Sphere: Gov 2.0 - Matthew LandauerPia Waugh
 
Public Sphere: Gov 2.0 - Nerida Hart
Public Sphere: Gov 2.0 - Nerida HartPublic Sphere: Gov 2.0 - Nerida Hart
Public Sphere: Gov 2.0 - Nerida HartPia Waugh
 
Public Sphere: Gov 2.0 - Raul Vera
Public Sphere: Gov 2.0 -  Raul VeraPublic Sphere: Gov 2.0 -  Raul Vera
Public Sphere: Gov 2.0 - Raul VeraPia Waugh
 
Public Sphere: Gov 2.0 - Zachary Zeus
Public Sphere: Gov 2.0 -  Zachary ZeusPublic Sphere: Gov 2.0 -  Zachary Zeus
Public Sphere: Gov 2.0 - Zachary ZeusPia Waugh
 
Public Sphere: Gov 2.0 - Owen Cameron
Public Sphere: Gov 2.0 - Owen CameronPublic Sphere: Gov 2.0 - Owen Cameron
Public Sphere: Gov 2.0 - Owen CameronPia Waugh
 

Plus de Pia Waugh (20)

linux.conf.au 2017 keynote - choose your own adventure please
linux.conf.au 2017 keynote - choose your own adventure pleaselinux.conf.au 2017 keynote - choose your own adventure please
linux.conf.au 2017 keynote - choose your own adventure please
 
Young Women's Leadership Seminar
Young Women's Leadership SeminarYoung Women's Leadership Seminar
Young Women's Leadership Seminar
 
Data useful for open cities
Data useful for open citiesData useful for open cities
Data useful for open cities
 
Over the Edge - Online 2015 talk for ALIA
Over the Edge - Online 2015 talk for ALIAOver the Edge - Online 2015 talk for ALIA
Over the Edge - Online 2015 talk for ALIA
 
State of the nation talk - opengov miniconf 2014
State of the nation talk - opengov miniconf 2014State of the nation talk - opengov miniconf 2014
State of the nation talk - opengov miniconf 2014
 
Public Sphere: Gov 2.0 - Justin Freeman
Public Sphere: Gov 2.0 - Justin FreemanPublic Sphere: Gov 2.0 - Justin Freeman
Public Sphere: Gov 2.0 - Justin Freeman
 
Public Sphere: Gov 2.0 - Des Walsh
Public Sphere: Gov 2.0 - Des WalshPublic Sphere: Gov 2.0 - Des Walsh
Public Sphere: Gov 2.0 - Des Walsh
 
Public Sphere: Gov 2.0 - Brian Fitzgerald
Public Sphere: Gov 2.0 - Brian FitzgeraldPublic Sphere: Gov 2.0 - Brian Fitzgerald
Public Sphere: Gov 2.0 - Brian Fitzgerald
 
Public Sphere: Gov 2.0 - Mark Spain
Public Sphere: Gov 2.0 - Mark SpainPublic Sphere: Gov 2.0 - Mark Spain
Public Sphere: Gov 2.0 - Mark Spain
 
Public Sphere: Gov 2.0 - Shoaib Burq
Public Sphere: Gov 2.0 - Shoaib BurqPublic Sphere: Gov 2.0 - Shoaib Burq
Public Sphere: Gov 2.0 - Shoaib Burq
 
Public Sphere: Gov 2.0 - John Shanahan
Public Sphere: Gov 2.0 - John ShanahanPublic Sphere: Gov 2.0 - John Shanahan
Public Sphere: Gov 2.0 - John Shanahan
 
Public Sphere: Gov 2.0 - Andrew Boyd
Public Sphere: Gov 2.0 - Andrew BoydPublic Sphere: Gov 2.0 - Andrew Boyd
Public Sphere: Gov 2.0 - Andrew Boyd
 
Public Sphere: Gov 2.0 - Crispin Butteriss
Public Sphere: Gov 2.0 - Crispin ButterissPublic Sphere: Gov 2.0 - Crispin Butteriss
Public Sphere: Gov 2.0 - Crispin Butteriss
 
Public Sphere: Gov 2.0 - Ben Searle
Public Sphere: Gov 2.0 - Ben SearlePublic Sphere: Gov 2.0 - Ben Searle
Public Sphere: Gov 2.0 - Ben Searle
 
Public Sphere: Gov 2.0 - Damien Donnelly
Public Sphere: Gov 2.0 - Damien DonnellyPublic Sphere: Gov 2.0 - Damien Donnelly
Public Sphere: Gov 2.0 - Damien Donnelly
 
Public Sphere: Gov 2.0 - Matthew Landauer
Public Sphere: Gov 2.0 - Matthew LandauerPublic Sphere: Gov 2.0 - Matthew Landauer
Public Sphere: Gov 2.0 - Matthew Landauer
 
Public Sphere: Gov 2.0 - Nerida Hart
Public Sphere: Gov 2.0 - Nerida HartPublic Sphere: Gov 2.0 - Nerida Hart
Public Sphere: Gov 2.0 - Nerida Hart
 
Public Sphere: Gov 2.0 - Raul Vera
Public Sphere: Gov 2.0 -  Raul VeraPublic Sphere: Gov 2.0 -  Raul Vera
Public Sphere: Gov 2.0 - Raul Vera
 
Public Sphere: Gov 2.0 - Zachary Zeus
Public Sphere: Gov 2.0 -  Zachary ZeusPublic Sphere: Gov 2.0 -  Zachary Zeus
Public Sphere: Gov 2.0 - Zachary Zeus
 
Public Sphere: Gov 2.0 - Owen Cameron
Public Sphere: Gov 2.0 - Owen CameronPublic Sphere: Gov 2.0 - Owen Cameron
Public Sphere: Gov 2.0 - Owen Cameron
 

Dernier

2024: The FAR, Federal Acquisition Regulations - Part 27
2024: The FAR, Federal Acquisition Regulations - Part 272024: The FAR, Federal Acquisition Regulations - Part 27
2024: The FAR, Federal Acquisition Regulations - Part 27JSchaus & Associates
 
YHR Fall 2023 Issue (Joseph Manning Interview) (2).pdf
YHR Fall 2023 Issue (Joseph Manning Interview) (2).pdfYHR Fall 2023 Issue (Joseph Manning Interview) (2).pdf
YHR Fall 2023 Issue (Joseph Manning Interview) (2).pdfyalehistoricalreview
 
call girls in DLF Phase 1 gurgaon 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
call girls in DLF Phase 1  gurgaon  🔝 >༒9540349809 🔝 genuine Escort Service 🔝...call girls in DLF Phase 1  gurgaon  🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
call girls in DLF Phase 1 gurgaon 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...saminamagar
 
Angels_EDProgrammes & Services 2024.pptx
Angels_EDProgrammes & Services 2024.pptxAngels_EDProgrammes & Services 2024.pptx
Angels_EDProgrammes & Services 2024.pptxLizelle Coombs
 
Club of Rome: Eco-nomics for an Ecological Civilization
Club of Rome: Eco-nomics for an Ecological CivilizationClub of Rome: Eco-nomics for an Ecological Civilization
Club of Rome: Eco-nomics for an Ecological CivilizationEnergy for One World
 
Enhancing Indigenous Peoples' right to self-determination in the context of t...
Enhancing Indigenous Peoples' right to self-determination in the context of t...Enhancing Indigenous Peoples' right to self-determination in the context of t...
Enhancing Indigenous Peoples' right to self-determination in the context of t...Christina Parmionova
 
call girls in Mayapuri DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Mayapuri DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in Mayapuri DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Mayapuri DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️saminamagar
 
call girls in Tilak Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Tilak Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in Tilak Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Tilak Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️saminamagar
 
WORLD CREATIVITY AND INNOVATION DAY 2024.
WORLD CREATIVITY AND INNOVATION DAY 2024.WORLD CREATIVITY AND INNOVATION DAY 2024.
WORLD CREATIVITY AND INNOVATION DAY 2024.Christina Parmionova
 
Russian Call Girl Hebbagodi ! 7001305949 ₹2999 Only and Free Hotel Delivery 2...
Russian Call Girl Hebbagodi ! 7001305949 ₹2999 Only and Free Hotel Delivery 2...Russian Call Girl Hebbagodi ! 7001305949 ₹2999 Only and Free Hotel Delivery 2...
Russian Call Girl Hebbagodi ! 7001305949 ₹2999 Only and Free Hotel Delivery 2...narwatsonia7
 
call girls in Narela DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Narela DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in Narela DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Narela DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️saminamagar
 
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...narwatsonia7
 
productionpost-productiondiary-240320114322-5004daf6.pptx
productionpost-productiondiary-240320114322-5004daf6.pptxproductionpost-productiondiary-240320114322-5004daf6.pptx
productionpost-productiondiary-240320114322-5004daf6.pptxHenryBriggs2
 
Powering Britain: Can we decarbonise electricity without disadvantaging poore...
Powering Britain: Can we decarbonise electricity without disadvantaging poore...Powering Britain: Can we decarbonise electricity without disadvantaging poore...
Powering Britain: Can we decarbonise electricity without disadvantaging poore...ResolutionFoundation
 
Monastic-Supremacy-in-the-Philippines-_20240328_092725_0000.pdf
Monastic-Supremacy-in-the-Philippines-_20240328_092725_0000.pdfMonastic-Supremacy-in-the-Philippines-_20240328_092725_0000.pdf
Monastic-Supremacy-in-the-Philippines-_20240328_092725_0000.pdfCharlynTorres1
 
call girls in sector 22 Gurgaon 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in sector 22 Gurgaon  🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in sector 22 Gurgaon  🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in sector 22 Gurgaon 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️saminamagar
 
call girls in Kirti Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Kirti Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in Kirti Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Kirti Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️saminamagar
 
call girls in moti bagh DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in moti bagh DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in moti bagh DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in moti bagh DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️saminamagar
 
call girls in Mukherjee Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
call girls in Mukherjee Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...call girls in Mukherjee Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
call girls in Mukherjee Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...saminamagar
 

Dernier (20)

2024: The FAR, Federal Acquisition Regulations - Part 27
2024: The FAR, Federal Acquisition Regulations - Part 272024: The FAR, Federal Acquisition Regulations - Part 27
2024: The FAR, Federal Acquisition Regulations - Part 27
 
YHR Fall 2023 Issue (Joseph Manning Interview) (2).pdf
YHR Fall 2023 Issue (Joseph Manning Interview) (2).pdfYHR Fall 2023 Issue (Joseph Manning Interview) (2).pdf
YHR Fall 2023 Issue (Joseph Manning Interview) (2).pdf
 
call girls in DLF Phase 1 gurgaon 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
call girls in DLF Phase 1  gurgaon  🔝 >༒9540349809 🔝 genuine Escort Service 🔝...call girls in DLF Phase 1  gurgaon  🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
call girls in DLF Phase 1 gurgaon 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
 
Angels_EDProgrammes & Services 2024.pptx
Angels_EDProgrammes & Services 2024.pptxAngels_EDProgrammes & Services 2024.pptx
Angels_EDProgrammes & Services 2024.pptx
 
Hot Sexy call girls in Palam Vihar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Palam Vihar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Palam Vihar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Palam Vihar🔝 9953056974 🔝 escort Service
 
Club of Rome: Eco-nomics for an Ecological Civilization
Club of Rome: Eco-nomics for an Ecological CivilizationClub of Rome: Eco-nomics for an Ecological Civilization
Club of Rome: Eco-nomics for an Ecological Civilization
 
Enhancing Indigenous Peoples' right to self-determination in the context of t...
Enhancing Indigenous Peoples' right to self-determination in the context of t...Enhancing Indigenous Peoples' right to self-determination in the context of t...
Enhancing Indigenous Peoples' right to self-determination in the context of t...
 
call girls in Mayapuri DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Mayapuri DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in Mayapuri DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Mayapuri DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
 
call girls in Tilak Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Tilak Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in Tilak Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Tilak Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
 
WORLD CREATIVITY AND INNOVATION DAY 2024.
WORLD CREATIVITY AND INNOVATION DAY 2024.WORLD CREATIVITY AND INNOVATION DAY 2024.
WORLD CREATIVITY AND INNOVATION DAY 2024.
 
Russian Call Girl Hebbagodi ! 7001305949 ₹2999 Only and Free Hotel Delivery 2...
Russian Call Girl Hebbagodi ! 7001305949 ₹2999 Only and Free Hotel Delivery 2...Russian Call Girl Hebbagodi ! 7001305949 ₹2999 Only and Free Hotel Delivery 2...
Russian Call Girl Hebbagodi ! 7001305949 ₹2999 Only and Free Hotel Delivery 2...
 
call girls in Narela DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Narela DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in Narela DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Narela DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
 
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
High Class Call Girls Bangalore Komal 7001305949 Independent Escort Service B...
 
productionpost-productiondiary-240320114322-5004daf6.pptx
productionpost-productiondiary-240320114322-5004daf6.pptxproductionpost-productiondiary-240320114322-5004daf6.pptx
productionpost-productiondiary-240320114322-5004daf6.pptx
 
Powering Britain: Can we decarbonise electricity without disadvantaging poore...
Powering Britain: Can we decarbonise electricity without disadvantaging poore...Powering Britain: Can we decarbonise electricity without disadvantaging poore...
Powering Britain: Can we decarbonise electricity without disadvantaging poore...
 
Monastic-Supremacy-in-the-Philippines-_20240328_092725_0000.pdf
Monastic-Supremacy-in-the-Philippines-_20240328_092725_0000.pdfMonastic-Supremacy-in-the-Philippines-_20240328_092725_0000.pdf
Monastic-Supremacy-in-the-Philippines-_20240328_092725_0000.pdf
 
call girls in sector 22 Gurgaon 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in sector 22 Gurgaon  🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in sector 22 Gurgaon  🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in sector 22 Gurgaon 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
 
call girls in Kirti Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Kirti Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in Kirti Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in Kirti Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
 
call girls in moti bagh DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in moti bagh DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️call girls in moti bagh DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
call girls in moti bagh DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝✔️✔️
 
call girls in Mukherjee Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
call girls in Mukherjee Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...call girls in Mukherjee Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
call girls in Mukherjee Nagar DELHI 🔝 >༒9540349809 🔝 genuine Escort Service 🔝...
 

Making your data lovely with open data benefits and best practices

  • 1. 1 Making your data lovely! Prioritising, cleaning, extraction, transformation, automation Pia Waugh Director of Gov 2.0 and Data Department of Finance Soon to be Prime Minister & Cabinet
  • 2. 22 Key Benefits to the Public Service in Opening Data • Efficiencies from proactively publishing common requests • Cheaper and more modular services delivery • Reduced regulatory burden through machine readable data supporting compliance and automated reporting • Better policy outcomes by leveraging cross-agency data • More consistency & less duplication across government • Improved opportunities to leverage innovation and collaboration (citizens, industry, other depts) • Opportunities to improve data quality through verifiable public contributions
  • 3. 33 Tips for ensuring benefits realisation of open data • Adopt an approach of “data user and developer empathy” • Data publishing built into your BAU • Initial focus on data that supports you  build capability • Consume your own data APIs (apps, datavis, BI, etc) • Ensure you consider: • Quality – no one can use bad data, but perfect is enemy of the good • Currency – is it up to date? How often is it updated? • APIs – is it programmatically available? • Publishing – have you provided supporting materials (taxonomies)? • Discoverability – is it hosted or linked on data.gov.au? • Reusability – have you tested it with data users? • Licensing – Creative Commons By Attribution the default • Automation wherever possible!
  • 4. 44 Data on the inside • Do you know what data you have internally? • Are you considering all data types? • How embedded is data driven decision making? • How can you upskill the whole organisation? • Do you know what your external data needs are? • How are you measuring and monitoring success? Data infrastructure to support your organisation should be extendable to support sharing/publishing
  • 5. 55 Rub a dub data • If a machine can’t read it, a machine can’t make an API • Some data has specialised data formats, some commonalities • Tabular, spatial, real time, unstructured, etc • Most data comes from somewhere, use the source Luke! • Machines and humans have different needs
  • 6. 66 What you need is clean sheets • Don’t merge cells. Sorting and other manipulations people may want to apply to your data assume that each cell belongs to one row and column. • Don’t mix data and metadata (e.g. date of release, name of author) in the same sheet. • The first row of a data sheet should contain column headers. None of these headers should be duplicates or blank. The column header should clearly indicate which units are used in that column, where this makes sense. • The remaining rows should contain data, one datum per row. Don’t include aggregate statistics such as TOTAL or AVERAGE. You can put aggregate statistics in a separate sheet, if they are important. • Numbers in cells should just be numbers. Don’t put commas in them, or stars after them, or anything else. If you need to add an annotation to some rows, use a separate column. • Use standard identifiers: e.g. identify countries using ISO 3166 codes rather than names. • Don’t use only colour or other stylistic cues to encode information. If you want to colour cells according to their value, use conditional formatting. • Leave the cell blank if a value is not available. • If you provide pivot tables, make sure the underlying data is available separately too. • If you also want to create a human-friendly presentation of the data, do so by creating another sheet in the same workbook and referencing the appropriate cells in the canonical data sheet http://www.clean-sheet.org/
  • 8. 88 Automating updates Automation involves system to system updates to save you time & money. Three broad approaches: 1. Write scripts to push or pull data updates using an API directly from the source. Usually doesn’t require much data manipulation. 2. Adopt a tool like Taverna, FME or Splunk to extract, clean/manipulate, and then push data to the data.gov.au (CKAN/geoserver) API directly. 3. Use the data.gov.au (CKAN) to schedule pull updates from your data, but most agencies don’t do that as they prefer to push updates. The data.gov.au team strongly encourage you to gain at least one geek in you data team so you can experiment with code and tools to best meet your needs. “With much help and encouragement from the support team at data.gov.au, we dipped our toes into the CKAN API waters. As a DotNet shop we were keen to limit the technology landscape and sought to automate the upload using DotNet. The CKAN API is refreshingly lightweight with a simple authentication process and messaging.” -- ABN Lookup Team Code at https://github.com/datagovau/ckan-api-examples
  • 9. 99 Support • http://toolkit.data.gov.au is updated regularly. Recent updates include: • How to automate data updates to data.gov.au with FME • Improved information on how to clean data • How to manage your own catalogue harvesting • Government data landscape to identify projects of use • Open Data Community Forum – soon to be moved to analyticsspace • Talk to your colleagues across government(s) • Other sources • Communities of interest: Data Science Meetup groups, Data Analytics Centre of Excellence, Linked Data Working Group, National Statistical Service, etc • GovHack Developers Kit: Become a data scientist in an hour, data tools, APIs, datavis, spatial, mashup techniques, statistical
  • 10. 1010 Quality – improve over time The 5 Star Data Quality standard developed by Sir Tim Berners-Lee will be used on data.gov.au in the coming month or two to indicate data quality. Aim for quality web services. API quality will also be looked at soon, including potentially a 5 star API standard. http://5stardata.info/en/
  • 11. 1111 Data integration and aggregation • Challenging but great potential for improved policy/services. • Unit record sharing is complex, privacy concerns for personal data. • Personal unit record data is mostly useful to researchers, appropriate mechanisms with legal, technical, ethical constraints to access such data. • Data aggregated by common spatial boundaries is comparative across datasets and over time. • Unfortunately, data owners traditionally aggregate to boundaries that constantly change (electorates, postcodes, etc). • The Australian Statistical Geography Standard (ASGS) provides a consistent set of spatial boundaries that can be mapped to other needs. • Anonymisation on the fly APIs also provide mechanism for appropriate public/agency access to unit record level data (e.g. ABS.Stat) http://statistical-data-integration.govspace.gov.au/ https://toolkit.data.gov.au/index.php?title=Definitions#Types_of_data
  • 12. 1212 data.gov.au Free, cloud, scalable API enabled platform for hosting government data. Staged approach 1. Publishing (2013 – mid 2014) Improving the functionality and ease of publishing for agencies with training and documentation 2. Value realisation (2014-2015) Providing useful front end tools for data.gov.au including data visualisation and analysis tools. Publishing quality data a pre-requisite. 3. Data quality (2014-2015) Looking at ways to provide agencies the ability to accept iterative data improvements in a verifiable way Features • Support for tabular, spatial and data models • Options for hosting, linking or catalogue harvesting • Manual and automated publishing options • API access to government data • Easy to publish, download & interact • Use cases and site|data|org analytics • Data Request Site • Metadata harvesting from gov data gateways • National Map integration • Federated search for discoverability In Planning • 5 star quality plugin • Selective crowdsourcing for updates • League Table
  • 13. 1313 Open Data Portals Council Portals: • City of Melbourne • City of Brisbane
  • 14. 1414 Some Case Studies • Publishing Budget 2014 Data Report • Open data – Transforming the Provider / Stakeholder Paradigm • On the Value of Open Roof Prints • 100 years of patent and IP data released on data.gov.au More available along with tech support at http://toolkit.data.gov.au Other Australian case studies/documentation • SA Open Data Toolkit • QLD Government Case Studies • Victorian Government Showcase • NSW Apps Showcase • ACT examples
  • 15. 1515 The future is here.... And it is already widely distributed http://www.flickr.com/photos/mr_matt/35688926 22/ Challenge #1: Collaborate Challenge #2: Share Challenge #3: Measure Challenge #4: Play Questions? @piawaugh @datagovau data.gov.au toolkit.data.gov.au