SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
Andrea Volpini
@cyberandy
@multilingweb - Dipartimento di Informatica, Sapienza Università di Roma
6th July 2015
WordLift for Digital Publishers
This fine event is hosted
by:
@multilingweb // LIDER
future of journalism
opendata
@wordliftit v3
@mico_project
Hello, I am:
@cyberandy
No.8 - MARK ROTHKO
This workshop is about:
Meet Your
Audience
Some are
humans
and some
…are not.
Astro Boy Comic
“Hi Stacey! Would you like me to read
your favourite news?”
“ok Hound, When will the
sun rise in Japan two days
before Christmas in 2021?”
Friendly, helpful and intelligent
a complete new class of voice-enabled
assistants has just arrived
Beta Testing the Apocalypse - TOM KACZYNSKI
ANTI MONEY LAUNDRY COMPLIANCE
AND INVESTMENT STRATEGIES
BANKS &
INVESTORS
CHECKING IF THERE ARE ON-GOING OR
PAST LEGAL PROCESSES
LAW
FIRMS
POLICY
MAKERS
NEWS AS VALUABLE INPUT IN THE LAW
MAKING PROCESS
BUSINESS
CREATING BUSINESS VALUES AND
TAKING DECISIONS BY READING NEWS
(Humans)…creating value with News
Meet Your
New Colleagues
can interpret your data and turn
it into meaningful, personalised
content.
Associated Press announced last year
that corporate earnings stories and
sport stories are written
automatically.
Text Generation Algorithms
Logan Ingalls / Flickr
Analysts expect higher profit for
Paychex when the company reports
its fourth quarter results on Tuesday,
July 1, 2014. The consensus
estimate is calling for profit of 40
cents a share, reflecting a rise from
38 cents per share a year ago.
Your New Colleague…the Algorithm
has just written a new piece.
but remember…
you still are
“Uniquely Human”
Pay a visit to http://nextdraft.com/
“If our role as journalists is to
help communities better
organize their knowledge and
themselves, then it is apparent
that we are in the service
business and that we must draw
on many tools, including
content, and place value on the
relationships we build with
members of our communities,
which will also take many forms.
Thus we are in the relationship
business.”
Jeff Jarvis
Human Factor is key!
Introducing
MEANINGFULLY ORGANISE YOUR CONTENT
A Semantic Editor for WordPress
for journalists and bloggers to:
ASSIST THE WRITING PROCESS
WITH CONTEXTUAL INFORMATION
ADD STRUCTURED METADATA
ENRICH CONTENT SUGGESTING
IMAGES, LINKS AND WIDGETS
RECOMMEND RELEVANT CONTENT
TO READERS
BUILD AN OPEN DATASET
(ENTITIES + ANNOTATIONS + CONTENT)
ASSIST THE WRITING PROCESS
WITH CONTEXTUAL INFORMATION
Fact-based information are derived
from open datasets and are
contextually relevant to the article.
Editors can choose what datasets
will be used for the enrichment.
ENRICH CONTENT SUGGESTING
IMAGES, LINKS AND WIDGETS
Relevant and free to use
photos and illustrations from
the Commons community
meaningful
navigation
systems
for internal
interlinking
Bringing to the audience an
overview of all the content
being written around a
specific topic using the chord
widget.
RECOMMEND RELEVANT CONTENT
content evolution over time
INTRODUCING THE NAVIGATOR WIDGET
WHERE /entity/earth
WHO /entity/michael-caine
schema:Person
schema:Place
schema:Organisation
WHO /entity/nasa
type: /BlogPosting
/2015/07/04/coopers-endurance-crew/
Creates links to entity
pages and related
articles by using the
WHO, WHERE,
WHAT and WHEN
classifications.
ADD STRUCTURED METADATA
The blog post, entities (dct:references),
publishing information (schema:datePublished
and schema:dateModified), the author
(schema:author), and the number of comments
(schema:interactionCount) are published as
Linked Open Data and printed using schema.org
for on-page SEO.
http://data.redlink.io/91/be2/post/Interstellar.html
Editors identify the basic 'WHO, WHAT, WHEN and WHERE'of an
article and structure information around it by creating new
entities in their custom vocabulary.
Content, vocabulary and annotations constitutes the
publisher’s knowledge graph and can be queried via SPARQL.
BUILD AN OPEN DATASET
(ENTITIES + ANNOTATIONS + CONTENT)
(using and )
How does a blog post look in the knowledge graph?
Special thanks to @dvcama :)
owl:sameAs connects entities, detected in the blog post, such as
Wormhole (with the same entity
on DBpedia and Freebase).
Starting this coming September WordLift and the technologies of MICO (for
cross-media analysis) are going to be used and validated by Greenpeace Italy
on their subscribers magazine website (magazine.greenpeace.it).
Let’s move now to a real-world use case
where ecologists, journalists and visionaries
stand to defend the natural world and to
promote peace.
CONTENT ANALYSIS
LINKED DATA PUBLISHING
1
3
Technology Stack
Text
Legacy Data
Audio/Images
CONTENT DISCOVERY2
MICO is a 3yrs EU-
funded research project
(grant no. 610480) that
brings to the platform
Cross-Media Extraction
Cross-Media Metadata Publishing
Cross-Media Querying
Cross-Media Recommendation
• Enterprise Linked
Data
• Content Analysis
• Semantic Search
• Semantic Media
Analysis and
Search
Media extractors available in MICO today:
Animal detection, video quality, temporal segmentation,
automatic speech recognition, speech-music discrimination,
face detection and audio tampering detection.
Multimedia Retrieval
Cross-Media Querying:
Introducing the SPARQL extension SPARQL-MM, which adds
multimedia specific features to the standard query
language for the Semantic Web.
How can we help
Greenpeace Italy?
•Connect videos with text using
cross-media recommendations
•Provide compact contextual
information for media assets
•Create new discovery path for
their readers and subscribers
Spation-Temporal Object Model in SPARQL-MM
“Point me to scenes within
videos where Barack
Obama is standing to left
of the MD of Greenpeace
while talking about whale
hunting”
Find out more on the SPARQL extension SPARQL-MM by reading this presentation by Thomas Kurz
Lessons learned so far…
• The bond between data and journalism is growing stronger and even for
independent news organisation like Greenpeace providing context, clarity
and building relationships (and knowledge graphs) is vital
• Algorithms are great and AI has entered the newsrooms but journalists
shall preserve their authorship and role when crafting content - always
leave the control in the hands of humans
• Providing immediate added value in the UX of semantic apps like
WordLift is key to engage journalists and not only marketers and
management
• Tags don’t help organising contents and named entities are much better
• Linked Data is a service NOT a technology: users want to see images,
meaningful links, recommendation and interactive widgets - they don’t
care about underlying technologies like RDF and SPARQL
• Creating datasets as a side effect while editing contents helps journalists
make an impact and connect with policy makers, business and other
communities.
JOIN.WORDLIFT.IT
Grazie!
“[SLIDES] Creating an open database of
knowledge by tagging the WHO, WHAT,
WHERE, WHEN of your contents #journalism”
Lclick to share it on Twitter!
mico-project.eu wordlift.it insideout.io
CREDITS
Wilfried Runde of Deutsche Welle, “In Praise of Robots and Humans”
Justin Kosslyn from Google Ideas, on thinking about how journalists'
work gets used
Luca Rosati from News to Experience
BBC News Labs A manifesto for structured journalism
this presentation is the result of many inspiring ideas and amazing work from
media experts, journalists and technologists and here is the list:
any idea, graphics or meme belonging to us is available
for sharing, copying and re-mixing under
creative commons license 3.0
This presentation and the work behind it was partially developed within the
MICO project (Media in Context - European Commission 7th Framework Programme
grant agreement no: 610480).
FIND OUT MORE ABOUT OUR PRODUCTS
Video Hosting Platform Semantic Editor Semantic Search

Contenu connexe

Plus de Andrea Volpini

Making Websites Talk: the rise of Voice Search and Conversational Interfaces
Making Websites Talk: the rise of Voice Search and Conversational InterfacesMaking Websites Talk: the rise of Voice Search and Conversational Interfaces
Making Websites Talk: the rise of Voice Search and Conversational InterfacesAndrea Volpini
 
Wordlift Roadmap for 2018
Wordlift Roadmap for 2018Wordlift Roadmap for 2018
Wordlift Roadmap for 2018Andrea Volpini
 
AI-powered SEO - Structured Data & Semantics - WordLift for SMXL Milan 2017
AI-powered SEO - Structured Data & Semantics - WordLift for SMXL Milan 2017AI-powered SEO - Structured Data & Semantics - WordLift for SMXL Milan 2017
AI-powered SEO - Structured Data & Semantics - WordLift for SMXL Milan 2017Andrea Volpini
 
Is semantic markup really helping websites improve their online visibility?
Is semantic markup really helping websites improve their online visibility?Is semantic markup really helping websites improve their online visibility?
Is semantic markup really helping websites improve their online visibility?Andrea Volpini
 
New Thinking in the Practice of Digital Journalism
New Thinking in the Practice of Digital Journalism New Thinking in the Practice of Digital Journalism
New Thinking in the Practice of Digital Journalism Andrea Volpini
 
Semantic SEO nell’Era Post Hummingbird e WordLift 3.0
Semantic SEO nell’Era Post Hummingbird e WordLift 3.0 Semantic SEO nell’Era Post Hummingbird e WordLift 3.0
Semantic SEO nell’Era Post Hummingbird e WordLift 3.0 Andrea Volpini
 
Linked Open GeoData for Enel Drive (W3C LOD2014)
Linked Open GeoData for Enel Drive (W3C LOD2014)Linked Open GeoData for Enel Drive (W3C LOD2014)
Linked Open GeoData for Enel Drive (W3C LOD2014)Andrea Volpini
 
WordLift 3.0 - Dynamic Semantic Publishing for WordPress
WordLift 3.0 - Dynamic Semantic Publishing for WordPress WordLift 3.0 - Dynamic Semantic Publishing for WordPress
WordLift 3.0 - Dynamic Semantic Publishing for WordPress Andrea Volpini
 
Hybrid TV & OTT TV for Telco 3.0
Hybrid TV & OTT TV for Telco 3.0Hybrid TV & OTT TV for Telco 3.0
Hybrid TV & OTT TV for Telco 3.0Andrea Volpini
 
Wordlift 2.5 Sneak-Peek
Wordlift 2.5 Sneak-PeekWordlift 2.5 Sneak-Peek
Wordlift 2.5 Sneak-PeekAndrea Volpini
 
RedLink GmbH (Introduction)
RedLink GmbH (Introduction)  RedLink GmbH (Introduction)
RedLink GmbH (Introduction) Andrea Volpini
 
WordLift 2.0 presented on the Semantic Web Meetup in Rome
WordLift 2.0 presented on the Semantic Web Meetup in RomeWordLift 2.0 presented on the Semantic Web Meetup in Rome
WordLift 2.0 presented on the Semantic Web Meetup in RomeAndrea Volpini
 
WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)
WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)
WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)Andrea Volpini
 
Open Street Map vs Google Maps
Open Street Map vs Google MapsOpen Street Map vs Google Maps
Open Street Map vs Google MapsAndrea Volpini
 
WordLift 2.0 (Pitch at JBoye11 in Aarhus)
WordLift 2.0 (Pitch at JBoye11 in Aarhus)WordLift 2.0 (Pitch at JBoye11 in Aarhus)
WordLift 2.0 (Pitch at JBoye11 in Aarhus)Andrea Volpini
 
Google+1 - Guida Introduttiva
Google+1 - Guida IntroduttivaGoogle+1 - Guida Introduttiva
Google+1 - Guida IntroduttivaAndrea Volpini
 

Plus de Andrea Volpini (20)

Making Websites Talk: the rise of Voice Search and Conversational Interfaces
Making Websites Talk: the rise of Voice Search and Conversational InterfacesMaking Websites Talk: the rise of Voice Search and Conversational Interfaces
Making Websites Talk: the rise of Voice Search and Conversational Interfaces
 
Wordlift Roadmap for 2018
Wordlift Roadmap for 2018Wordlift Roadmap for 2018
Wordlift Roadmap for 2018
 
AI-powered SEO - Structured Data & Semantics - WordLift for SMXL Milan 2017
AI-powered SEO - Structured Data & Semantics - WordLift for SMXL Milan 2017AI-powered SEO - Structured Data & Semantics - WordLift for SMXL Milan 2017
AI-powered SEO - Structured Data & Semantics - WordLift for SMXL Milan 2017
 
Is semantic markup really helping websites improve their online visibility?
Is semantic markup really helping websites improve their online visibility?Is semantic markup really helping websites improve their online visibility?
Is semantic markup really helping websites improve their online visibility?
 
New Thinking in the Practice of Digital Journalism
New Thinking in the Practice of Digital Journalism New Thinking in the Practice of Digital Journalism
New Thinking in the Practice of Digital Journalism
 
Semantic SEO nell’Era Post Hummingbird e WordLift 3.0
Semantic SEO nell’Era Post Hummingbird e WordLift 3.0 Semantic SEO nell’Era Post Hummingbird e WordLift 3.0
Semantic SEO nell’Era Post Hummingbird e WordLift 3.0
 
Linked Open GeoData for Enel Drive (W3C LOD2014)
Linked Open GeoData for Enel Drive (W3C LOD2014)Linked Open GeoData for Enel Drive (W3C LOD2014)
Linked Open GeoData for Enel Drive (W3C LOD2014)
 
WordLift 3.0 - Dynamic Semantic Publishing for WordPress
WordLift 3.0 - Dynamic Semantic Publishing for WordPress WordLift 3.0 - Dynamic Semantic Publishing for WordPress
WordLift 3.0 - Dynamic Semantic Publishing for WordPress
 
Hybrid TV & OTT TV for Telco 3.0
Hybrid TV & OTT TV for Telco 3.0Hybrid TV & OTT TV for Telco 3.0
Hybrid TV & OTT TV for Telco 3.0
 
Wordlift 2.5 Sneak-Peek
Wordlift 2.5 Sneak-PeekWordlift 2.5 Sneak-Peek
Wordlift 2.5 Sneak-Peek
 
RedLink GmbH (Introduction)
RedLink GmbH (Introduction)  RedLink GmbH (Introduction)
RedLink GmbH (Introduction)
 
HelixCloud Webinar
HelixCloud WebinarHelixCloud Webinar
HelixCloud Webinar
 
Semantic Marketing
Semantic MarketingSemantic Marketing
Semantic Marketing
 
WordLift 2.0 presented on the Semantic Web Meetup in Rome
WordLift 2.0 presented on the Semantic Web Meetup in RomeWordLift 2.0 presented on the Semantic Web Meetup in Rome
WordLift 2.0 presented on the Semantic Web Meetup in Rome
 
Google Currents
Google Currents Google Currents
Google Currents
 
WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)
WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)
WordLift 2.0 (presentation for the IKS annual review in Saarbrücken)
 
Open Street Map vs Google Maps
Open Street Map vs Google MapsOpen Street Map vs Google Maps
Open Street Map vs Google Maps
 
WordLift 2.0 (Pitch at JBoye11 in Aarhus)
WordLift 2.0 (Pitch at JBoye11 in Aarhus)WordLift 2.0 (Pitch at JBoye11 in Aarhus)
WordLift 2.0 (Pitch at JBoye11 in Aarhus)
 
Quizzing
QuizzingQuizzing
Quizzing
 
Google+1 - Guida Introduttiva
Google+1 - Guida IntroduttivaGoogle+1 - Guida Introduttiva
Google+1 - Guida Introduttiva
 

Dernier

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Dernier (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

WordLift for Digital Publishers and how to create an Open Database of Knowledge

  • 1. Andrea Volpini @cyberandy @multilingweb - Dipartimento di Informatica, Sapienza Università di Roma 6th July 2015 WordLift for Digital Publishers
  • 2. This fine event is hosted by: @multilingweb // LIDER future of journalism opendata @wordliftit v3 @mico_project Hello, I am: @cyberandy No.8 - MARK ROTHKO This workshop is about:
  • 4. Some are humans and some …are not. Astro Boy Comic
  • 5. “Hi Stacey! Would you like me to read your favourite news?”
  • 6. “ok Hound, When will the sun rise in Japan two days before Christmas in 2021?” Friendly, helpful and intelligent a complete new class of voice-enabled assistants has just arrived
  • 7. Beta Testing the Apocalypse - TOM KACZYNSKI ANTI MONEY LAUNDRY COMPLIANCE AND INVESTMENT STRATEGIES BANKS & INVESTORS CHECKING IF THERE ARE ON-GOING OR PAST LEGAL PROCESSES LAW FIRMS POLICY MAKERS NEWS AS VALUABLE INPUT IN THE LAW MAKING PROCESS BUSINESS CREATING BUSINESS VALUES AND TAKING DECISIONS BY READING NEWS (Humans)…creating value with News
  • 9. can interpret your data and turn it into meaningful, personalised content. Associated Press announced last year that corporate earnings stories and sport stories are written automatically. Text Generation Algorithms Logan Ingalls / Flickr
  • 10. Analysts expect higher profit for Paychex when the company reports its fourth quarter results on Tuesday, July 1, 2014. The consensus estimate is calling for profit of 40 cents a share, reflecting a rise from 38 cents per share a year ago. Your New Colleague…the Algorithm has just written a new piece.
  • 11. but remember… you still are “Uniquely Human” Pay a visit to http://nextdraft.com/
  • 12. “If our role as journalists is to help communities better organize their knowledge and themselves, then it is apparent that we are in the service business and that we must draw on many tools, including content, and place value on the relationships we build with members of our communities, which will also take many forms. Thus we are in the relationship business.” Jeff Jarvis Human Factor is key!
  • 14. MEANINGFULLY ORGANISE YOUR CONTENT A Semantic Editor for WordPress for journalists and bloggers to: ASSIST THE WRITING PROCESS WITH CONTEXTUAL INFORMATION ADD STRUCTURED METADATA ENRICH CONTENT SUGGESTING IMAGES, LINKS AND WIDGETS RECOMMEND RELEVANT CONTENT TO READERS BUILD AN OPEN DATASET (ENTITIES + ANNOTATIONS + CONTENT)
  • 15. ASSIST THE WRITING PROCESS WITH CONTEXTUAL INFORMATION Fact-based information are derived from open datasets and are contextually relevant to the article. Editors can choose what datasets will be used for the enrichment.
  • 16. ENRICH CONTENT SUGGESTING IMAGES, LINKS AND WIDGETS Relevant and free to use photos and illustrations from the Commons community meaningful navigation systems for internal interlinking
  • 17. Bringing to the audience an overview of all the content being written around a specific topic using the chord widget. RECOMMEND RELEVANT CONTENT content evolution over time INTRODUCING THE NAVIGATOR WIDGET WHERE /entity/earth WHO /entity/michael-caine schema:Person schema:Place schema:Organisation WHO /entity/nasa type: /BlogPosting /2015/07/04/coopers-endurance-crew/ Creates links to entity pages and related articles by using the WHO, WHERE, WHAT and WHEN classifications.
  • 18. ADD STRUCTURED METADATA The blog post, entities (dct:references), publishing information (schema:datePublished and schema:dateModified), the author (schema:author), and the number of comments (schema:interactionCount) are published as Linked Open Data and printed using schema.org for on-page SEO. http://data.redlink.io/91/be2/post/Interstellar.html
  • 19. Editors identify the basic 'WHO, WHAT, WHEN and WHERE'of an article and structure information around it by creating new entities in their custom vocabulary. Content, vocabulary and annotations constitutes the publisher’s knowledge graph and can be queried via SPARQL. BUILD AN OPEN DATASET (ENTITIES + ANNOTATIONS + CONTENT)
  • 20. (using and ) How does a blog post look in the knowledge graph? Special thanks to @dvcama :) owl:sameAs connects entities, detected in the blog post, such as Wormhole (with the same entity on DBpedia and Freebase).
  • 21. Starting this coming September WordLift and the technologies of MICO (for cross-media analysis) are going to be used and validated by Greenpeace Italy on their subscribers magazine website (magazine.greenpeace.it). Let’s move now to a real-world use case where ecologists, journalists and visionaries stand to defend the natural world and to promote peace.
  • 22. CONTENT ANALYSIS LINKED DATA PUBLISHING 1 3 Technology Stack Text Legacy Data Audio/Images CONTENT DISCOVERY2 MICO is a 3yrs EU- funded research project (grant no. 610480) that brings to the platform Cross-Media Extraction Cross-Media Metadata Publishing Cross-Media Querying Cross-Media Recommendation • Enterprise Linked Data • Content Analysis • Semantic Search • Semantic Media Analysis and Search Media extractors available in MICO today: Animal detection, video quality, temporal segmentation, automatic speech recognition, speech-music discrimination, face detection and audio tampering detection.
  • 23. Multimedia Retrieval Cross-Media Querying: Introducing the SPARQL extension SPARQL-MM, which adds multimedia specific features to the standard query language for the Semantic Web. How can we help Greenpeace Italy? •Connect videos with text using cross-media recommendations •Provide compact contextual information for media assets •Create new discovery path for their readers and subscribers Spation-Temporal Object Model in SPARQL-MM “Point me to scenes within videos where Barack Obama is standing to left of the MD of Greenpeace while talking about whale hunting” Find out more on the SPARQL extension SPARQL-MM by reading this presentation by Thomas Kurz
  • 24. Lessons learned so far… • The bond between data and journalism is growing stronger and even for independent news organisation like Greenpeace providing context, clarity and building relationships (and knowledge graphs) is vital • Algorithms are great and AI has entered the newsrooms but journalists shall preserve their authorship and role when crafting content - always leave the control in the hands of humans • Providing immediate added value in the UX of semantic apps like WordLift is key to engage journalists and not only marketers and management • Tags don’t help organising contents and named entities are much better • Linked Data is a service NOT a technology: users want to see images, meaningful links, recommendation and interactive widgets - they don’t care about underlying technologies like RDF and SPARQL • Creating datasets as a side effect while editing contents helps journalists make an impact and connect with policy makers, business and other communities.
  • 25. JOIN.WORDLIFT.IT Grazie! “[SLIDES] Creating an open database of knowledge by tagging the WHO, WHAT, WHERE, WHEN of your contents #journalism” Lclick to share it on Twitter! mico-project.eu wordlift.it insideout.io
  • 26. CREDITS Wilfried Runde of Deutsche Welle, “In Praise of Robots and Humans” Justin Kosslyn from Google Ideas, on thinking about how journalists' work gets used Luca Rosati from News to Experience BBC News Labs A manifesto for structured journalism this presentation is the result of many inspiring ideas and amazing work from media experts, journalists and technologists and here is the list: any idea, graphics or meme belonging to us is available for sharing, copying and re-mixing under creative commons license 3.0 This presentation and the work behind it was partially developed within the MICO project (Media in Context - European Commission 7th Framework Programme grant agreement no: 610480). FIND OUT MORE ABOUT OUR PRODUCTS Video Hosting Platform Semantic Editor Semantic Search