SlideShare une entreprise Scribd logo
1  sur  25
Télécharger pour lire hors ligne
Identifying the business cases for automatic
metadata in the Finnish broadcasting company
Kim Viljanen, Elina Selkälä
The Finnish Broadcasting Company
Get to know Yle in 120 seconds
Yle – Your Story
• Founded 1926
• CEO Merja Ylä-Anttila
• Four television channels and three channel slots
• Six nationwide radio channels, three radio
services and varied web services
• Present all around in Finland, in 25 areas
• 18 Finnish, 5 Swedish and one Sami speaking
news rooms
• 10 regional television news broadcasts
• 2,811 permanent employees (12/2018)
• 90 % of employees create content and
programmes
• The company is 99.9 percent state-owned and
operates under the Act on Yleisradio Oy.
• Yle is financed by a tax paid by both individuals
and companies.
Yle - The AI Company
Planning Production Publication Consume
- Robot Journalist Voitto:
Automatic news content
generation IN PRODUCTION
- Automated tagging for
online texts IN PRODUCTION
- Image analysis and tagging,
stills library POC*
- On demand speech
recognition tool POC*
- Automatically created
transcriptions as an aid in
archive database searches
POC*
- Automatic metadata
generation: speech & image
recognition, text analysis &
automatic annotation etc.
POC*
- Automatic content analysis of
Yle Areena publications POC
- Speech recognition in
subtitling POC
- Audio description POC
- Music identification POC
- ASR based content
recommendation, Yle Areena IN
PRODUCTION
- News Watch App content
recommendation, Yle News IN
PRODUCTION
- Deduction of customer
demographics IN PRODUCTION
- Automatic moderation of web
discussions POC
- 360 view to content
and users with
metadata
- Editor’s assistant
Onnibot predicts
article’s performance
IN PRODUCTION
Functions done or assisted by AI (* Done at the Archives)
content analysis Metadata
new functionalities
for the end user
What’s next?
Company-wide automatic metadata processor
Analyse every content item at the right phase of the process
Steps in progress
The Metadata Machine to analyse content
with the help of AI/ML
Vision: The Metadata Machine
All audiovisual content is automatically analysed as early as possible
Content creation
(raw material)
Procurement
(ready-made content)
Publishing
(published content)
Archiving
(what do we have?)
Automatic content analysis engine
Speech
recognition
Image
recognition
Person identifier Fingerprinting
Sound
identifying
Video frame
color analysis
Music identifier
Text analysis
Company-wide metadata database on all content items
Language
identifier
...
A growing and fast moving market of
automatic metadata extractors (AME)
- Cloud companies
- Companies focusing on one or several extractors
- AME orchestration companies
- Media product vendors (e.g. MAMs) that
incorporate AME as part of the service
- ...
Differentiators between service providers:
- What metadata extractors do each provide?
- Focus only on media business?
- Onsite vs. cloud
- Ready-made vs. tailored
- Pricing model
- Ease of integration into supply chain
- Quality of metadata results
- Ability to train Machine Learning models
- Speed of developing their products and services
- ...
- Speech recognition
- Face recognition
- Optical character
recognition (OCR)
- Sound detection
- Language detection
- Visual object detection
- Landmark detection
- Logo detection
- Automatic translation
- ...
Do we have use cases
for metadata that
contains errors?
Metadata machine - Proof of concept project spring
2019
1. Buy a metadata machine (Graymeta Curio)
2. Involve as many teams as possible around the company to identify and test their
business cases for the Metadata machine.
3. Run lots of Yle content through the machine, extract as much as possible metadata
4. Run a wide variety of Yle content to test the capabilities of the extractors
5. Collect the results from the teams, identify the most prominent business cases
6. Final verdict: is the combined benefits bigger than the required investment
test round 1 test round 2 test round 3
analysis &
next steps
A production ready platform that
powers automated metadata collection
using best of breed and custom
machine learning services.
Identifying the business cases for automatic metadata
100+ ideas 10+ proof of concepts 1+ to production
How to evaluate individual ideas?
What kind of metadata
does it require?
Does it improve existing
processes?
Does it enable something
completely new?
How much money or time
does it save?
How much does it increase
customer satisfaction?
Is the technology solution
available today?
What are the direct and
indirect costs involved?
How to optimize the costs?
How does it affect the
surrounding production
process / way of working?
How to combine human work
with automation?
What are the success criterias
/ KPIs … ?
...
Use case: sport (editing)
- The need: speed up making a video
compilation around a specific topic.
- Test case: Make a compilation about the
Finnish athlete Iivo Niskanen.
- Material: All Yle content about the Seefeld
competitions 2019
- Extractors: faces, OCR, speech recognition,
…
- Possible next step: EU leader identification
(to speed up editing of reports from EU
meetings in Helsinki later this year)
Use case: Content ingest and processing
- automatic slate identification
- black and silence detection
- end credit detection
- …
- Tools to automate quality
checking of incoming
material. Is this the media we
ordered?
Use case: Understanding the content for Archive
and Analytics
- “All” metadata potentially useful
for archive and analytics use.
- face recognition
- speech recognition
- visual description
- main topics
- natural language processing
- contains music?
- …
- In addition to having lots of
metadata, it is important to make
the data easy to use and view.
Use case: Spoken language detection
- The need: Identification of what languages are spoken in which parts of a program.
Important information for multiple teams inside Yle, e.g. the translation
department.
- Current commercial services (to our knowledge) can identify the main language of
the whole media, but not individual language segments inside the media.
Three horizons of automatic metadata
Horizon 1
Improve core business
Horizon 2
New opportunities
Horizon 3
Visionaries
The possible and in
production at Yle
The possible, but not yet in
production at Yle.
The impossible. Not near to
production yet.
Speech recognition Finnish
(in limited production from 2017).
The Metadata machine project
(and other projects)
The MeMAD project
How to improve the existing? What can/should we buy now?
What are the business cases?
How does the future look like?
How to co-operate with ML
researchers/visionaries?
Time
MeMAD project has received funding from the European Union’s Horizon 2020 research and innovation
programme under grant agreement No 780069. This presentation has been produced by theMeMAD
project. The content in this presentation represents the views of the authors, and the European
Commission has no liability in respect of the content.
- Three year research project in the Horizon 2020 program.
- Started in 2018, ends in end of 2020.
- Four universities, four companies: Aalto University, University of
Helsinki, University of Surrey, Eurecom, INA, Limecraft, Lingsoft, Yle
- MeMAD is about:
- methods for efficient re-use and re-purposing of multilingual audiovisual content for
video management and digital storytelling in broadcasting and media production.
- combines automatic efficiency with human accuracy.
- produces a rich description of moving images, speech and audio.
- www.memad.eu
The MeMAD project shortly
Example: Multimodal translation
Image caption translation
● WMT 2018 multimodal translation task
● EN text + image → DE/FR text (skipped Czech)
Speech-to-text translation
● IWSLT 2018, English audio (TED talks) to
German text
● Is the translation more accurate if part of the
speech recognition system itself?
Example: Automatic captioning of video image
● Currently: create a human readable
natural language description of what
is happening in each shot.
● Towards automatic recognition of the
narrational structure of a shot (and
the whole program).
● MeMAD project / Aalto University
● Based on deep neural network
features and LSTM language model
Potential use case: Automatic Audio Description
● “Steven”is a producer in charge of
delivering audio descriptions for
documentaries.
Thanks to automatically generated audio
descriptions, that are reviewed and
corrected manually, Steven can deliver
audio descriptions to end users for a
smaller budget, enabling more content to
be audio described. (UC4)
Auto®
Outcome so far (work in progress)
- Better understanding on:
- our needs
- what can be solved with current ML and AME technologies
- the limitations of the technology - and how to work around the problems
- (what should not be solved with ML and AME technologies)
- what is available in the market and what is not; different ways to buy services
- how to work with automatic metadata companies and the academics
- how to share data for ML research (legal, technical, process, …)
- how to integrate a Metadata machine (Graymeta Curio) to Yle systems
- how to combine human metadata work with automated metadata
- …
- We are starting to understand the impacts and new requirements on our processes,
human skills, ...
- Company wide commitment and involvement ⇒ AME relevant to many departments!
- More practical attempt on AME than ever! We are moving from visions to reality.
Lessons learned
- The technology is tempting, but identifying the business cases is difficult and
requires lots of work. ⇒ Learning what are the realistic expectations for AME.
- Current off the shelf automatic metadata extraction services work and provide
value ⇒ Work in progress to estimate the business value for individual cases.
- Tuning the settings of “ready to use” ML services require time and skills. ⇒ Impact
on future skill requirements for Yle personnel.
- Machine learning requires a lot of teaching data. Preferably pairs of data (e.g. two
language pairs) ⇒ The “huge” Yle archive turns out to be relatively small and limited
from ML point of view.
- Taking a company wide perspective on automatic metadata seems to work for the
time being ― instead of solving each business case as an individual project.
- Accept the imperfect! Start now!
The future
- Analyse the whole archive? Where to start? How to optimize the costs?
- Legal and privacy impacts. Can our material become too easy to find? (e.g. face
recognition)
- Continue following the markets and technologies
- Deciding on the next steps:
- What should we implement right away?
- What should we test more?
- What should we research more?
- Who should we do co-operation with in this area?
Kim.Viljanen@yle.fi, Elina.Selkala@yle.fi
The Finnish Broadcasting Company

Contenu connexe

Similaire à Automatic metadata generation in the Finnish broadcasting company

Thinking the archives of 2020: Opportunitiws, priorities, Issues
Thinking the archives of 2020: Opportunitiws, priorities, IssuesThinking the archives of 2020: Opportunitiws, priorities, Issues
Thinking the archives of 2020: Opportunitiws, priorities, IssuesFIAT/IFTA
 
Qbt nlp en_2014
Qbt nlp en_2014Qbt nlp en_2014
Qbt nlp en_2014Qbtsagl3
 
[MindsLab] Company Introduction
[MindsLab] Company Introduction[MindsLab] Company Introduction
[MindsLab] Company IntroductionTaejoon Yoo
 
XMANAI Technical Project Overview
XMANAI Technical Project OverviewXMANAI Technical Project Overview
XMANAI Technical Project OverviewXMANAI
 
La Digital Trans(IN)formation
La Digital Trans(IN)formationLa Digital Trans(IN)formation
La Digital Trans(IN)formationLuca Bonesini
 
Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1ISSIP
 
Stermedia - AI and software solutions for manufacturing/industry 4.0
Stermedia - AI and software solutions for manufacturing/industry 4.0Stermedia - AI and software solutions for manufacturing/industry 4.0
Stermedia - AI and software solutions for manufacturing/industry 4.0stermedia
 
Sironta at OpenOffice.org Conference 2010
Sironta at OpenOffice.org Conference  2010Sironta at OpenOffice.org Conference  2010
Sironta at OpenOffice.org Conference 2010Manu Arjó
 
"Implementing Machine Learning and Big Data soluctions using IDOL"
"Implementing Machine Learning and Big Data soluctions using IDOL""Implementing Machine Learning and Big Data soluctions using IDOL"
"Implementing Machine Learning and Big Data soluctions using IDOL"Erick Delgado
 
NRB SAP DAY 2017 - William Poos
NRB SAP DAY 2017 - William PoosNRB SAP DAY 2017 - William Poos
NRB SAP DAY 2017 - William PoosNRB
 
[MindsLab] company intro 201711
[MindsLab] company intro 201711[MindsLab] company intro 201711
[MindsLab] company intro 201711Taejoon Yoo
 
IT Landscape and Talent Opportunities in DPRK (Democratic People’s Republic o...
IT Landscape and Talent Opportunities in DPRK (Democratic People’s Republic o...IT Landscape and Talent Opportunities in DPRK (Democratic People’s Republic o...
IT Landscape and Talent Opportunities in DPRK (Democratic People’s Republic o...Technopreneurs Association of Malaysia
 
Mainstream development presentation
Mainstream development presentationMainstream development presentation
Mainstream development presentationAnna Vyrostak
 
How Mistral AI raised €105m with no pitch deck or product
How Mistral AI raised €105m with no pitch deck or productHow Mistral AI raised €105m with no pitch deck or product
How Mistral AI raised €105m with no pitch deck or productPitch Decks
 
Mistral AI Strategic Memo.pdf
Mistral AI Strategic Memo.pdfMistral AI Strategic Memo.pdf
Mistral AI Strategic Memo.pdfOliver Molander
 
Who We Are
Who We AreWho We Are
Who We Arekurilo
 
Semantic Technologies and Information Integration
Semantic Technologies and Information IntegrationSemantic Technologies and Information Integration
Semantic Technologies and Information IntegrationAI4BD GmbH
 

Similaire à Automatic metadata generation in the Finnish broadcasting company (20)

Thinking the archives of 2020: Opportunitiws, priorities, Issues
Thinking the archives of 2020: Opportunitiws, priorities, IssuesThinking the archives of 2020: Opportunitiws, priorities, Issues
Thinking the archives of 2020: Opportunitiws, priorities, Issues
 
Qbt nlp en_2014
Qbt nlp en_2014Qbt nlp en_2014
Qbt nlp en_2014
 
[MindsLab] Company Introduction
[MindsLab] Company Introduction[MindsLab] Company Introduction
[MindsLab] Company Introduction
 
My back on track
My back on trackMy back on track
My back on track
 
XMANAI Technical Project Overview
XMANAI Technical Project OverviewXMANAI Technical Project Overview
XMANAI Technical Project Overview
 
La Digital Trans(IN)formation
La Digital Trans(IN)formationLa Digital Trans(IN)formation
La Digital Trans(IN)formation
 
Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1Tutorial helsinki 20180313 v1
Tutorial helsinki 20180313 v1
 
Stermedia - AI and software solutions for manufacturing/industry 4.0
Stermedia - AI and software solutions for manufacturing/industry 4.0Stermedia - AI and software solutions for manufacturing/industry 4.0
Stermedia - AI and software solutions for manufacturing/industry 4.0
 
Sironta at OpenOffice.org Conference 2010
Sironta at OpenOffice.org Conference  2010Sironta at OpenOffice.org Conference  2010
Sironta at OpenOffice.org Conference 2010
 
Sironta
SirontaSironta
Sironta
 
"Implementing Machine Learning and Big Data soluctions using IDOL"
"Implementing Machine Learning and Big Data soluctions using IDOL""Implementing Machine Learning and Big Data soluctions using IDOL"
"Implementing Machine Learning and Big Data soluctions using IDOL"
 
NRB SAP DAY 2017 - William Poos
NRB SAP DAY 2017 - William PoosNRB SAP DAY 2017 - William Poos
NRB SAP DAY 2017 - William Poos
 
[MindsLab] company intro 201711
[MindsLab] company intro 201711[MindsLab] company intro 201711
[MindsLab] company intro 201711
 
IT Landscape and Talent Opportunities in DPRK (Democratic People’s Republic o...
IT Landscape and Talent Opportunities in DPRK (Democratic People’s Republic o...IT Landscape and Talent Opportunities in DPRK (Democratic People’s Republic o...
IT Landscape and Talent Opportunities in DPRK (Democratic People’s Republic o...
 
Mainstream development presentation
Mainstream development presentationMainstream development presentation
Mainstream development presentation
 
1 ecso general_ext_sept2017_bari
1 ecso general_ext_sept2017_bari1 ecso general_ext_sept2017_bari
1 ecso general_ext_sept2017_bari
 
How Mistral AI raised €105m with no pitch deck or product
How Mistral AI raised €105m with no pitch deck or productHow Mistral AI raised €105m with no pitch deck or product
How Mistral AI raised €105m with no pitch deck or product
 
Mistral AI Strategic Memo.pdf
Mistral AI Strategic Memo.pdfMistral AI Strategic Memo.pdf
Mistral AI Strategic Memo.pdf
 
Who We Are
Who We AreWho We Are
Who We Are
 
Semantic Technologies and Information Integration
Semantic Technologies and Information IntegrationSemantic Technologies and Information Integration
Semantic Technologies and Information Integration
 

Plus de FIAT/IFTA

2021 FIAT/IFTA Timeline Survey
2021 FIAT/IFTA Timeline Survey2021 FIAT/IFTA Timeline Survey
2021 FIAT/IFTA Timeline SurveyFIAT/IFTA
 
20211021 FIAT/IFTA Most Wanted List
20211021 FIAT/IFTA Most Wanted List20211021 FIAT/IFTA Most Wanted List
20211021 FIAT/IFTA Most Wanted ListFIAT/IFTA
 
WARBURTON FIAT/IFTA Timeline Survey results 2020
WARBURTON FIAT/IFTA Timeline Survey results 2020WARBURTON FIAT/IFTA Timeline Survey results 2020
WARBURTON FIAT/IFTA Timeline Survey results 2020FIAT/IFTA
 
OOMEN MEZARIS ReTV
OOMEN MEZARIS ReTVOOMEN MEZARIS ReTV
OOMEN MEZARIS ReTVFIAT/IFTA
 
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)FIAT/IFTA
 
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉCULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉFIAT/IFTA
 
HULSENBECK Value Use and Copyright Comission initiatives
HULSENBECK Value Use and Copyright Comission initiativesHULSENBECK Value Use and Copyright Comission initiatives
HULSENBECK Value Use and Copyright Comission initiativesFIAT/IFTA
 
WILSON Film digitisation at BBC Scotland
WILSON Film digitisation at BBC ScotlandWILSON Film digitisation at BBC Scotland
WILSON Film digitisation at BBC ScotlandFIAT/IFTA
 
GOLODNOFF We need to make our past accessible!
GOLODNOFF We need to make our past accessible!GOLODNOFF We need to make our past accessible!
GOLODNOFF We need to make our past accessible!FIAT/IFTA
 
LORENZ Building an integrated digital media archive and legal deposit
LORENZ Building an integrated digital media archive and legal depositLORENZ Building an integrated digital media archive and legal deposit
LORENZ Building an integrated digital media archive and legal depositFIAT/IFTA
 
BIRATUNGANYE Shock of formats
BIRATUNGANYE Shock of formatsBIRATUNGANYE Shock of formats
BIRATUNGANYE Shock of formatsFIAT/IFTA
 
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...FIAT/IFTA
 
BERGER RIPPON BBC Music memories
BERGER RIPPON BBC Music memoriesBERGER RIPPON BBC Music memories
BERGER RIPPON BBC Music memoriesFIAT/IFTA
 
AOIBHINN and CHOISTIN Rehash your archive
AOIBHINN and CHOISTIN Rehash your archiveAOIBHINN and CHOISTIN Rehash your archive
AOIBHINN and CHOISTIN Rehash your archiveFIAT/IFTA
 
HULSENBECK BLOM A blast from the past open up
HULSENBECK BLOM A blast from the past open upHULSENBECK BLOM A blast from the past open up
HULSENBECK BLOM A blast from the past open upFIAT/IFTA
 
PERVIZ Automated evolvable media console systems in digital archives
PERVIZ Automated evolvable media console systems in digital archivesPERVIZ Automated evolvable media console systems in digital archives
PERVIZ Automated evolvable media console systems in digital archivesFIAT/IFTA
 
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AIAICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AIFIAT/IFTA
 
VINSON Accuracy and cost assessment for archival video transcription methods
VINSON Accuracy and cost assessment for archival video transcription methodsVINSON Accuracy and cost assessment for archival video transcription methods
VINSON Accuracy and cost assessment for archival video transcription methodsFIAT/IFTA
 
LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?FIAT/IFTA
 
AZIZ BABBUCCI Let's play with the archive
AZIZ BABBUCCI Let's play with the archiveAZIZ BABBUCCI Let's play with the archive
AZIZ BABBUCCI Let's play with the archiveFIAT/IFTA
 

Plus de FIAT/IFTA (20)

2021 FIAT/IFTA Timeline Survey
2021 FIAT/IFTA Timeline Survey2021 FIAT/IFTA Timeline Survey
2021 FIAT/IFTA Timeline Survey
 
20211021 FIAT/IFTA Most Wanted List
20211021 FIAT/IFTA Most Wanted List20211021 FIAT/IFTA Most Wanted List
20211021 FIAT/IFTA Most Wanted List
 
WARBURTON FIAT/IFTA Timeline Survey results 2020
WARBURTON FIAT/IFTA Timeline Survey results 2020WARBURTON FIAT/IFTA Timeline Survey results 2020
WARBURTON FIAT/IFTA Timeline Survey results 2020
 
OOMEN MEZARIS ReTV
OOMEN MEZARIS ReTVOOMEN MEZARIS ReTV
OOMEN MEZARIS ReTV
 
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
 
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉCULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
 
HULSENBECK Value Use and Copyright Comission initiatives
HULSENBECK Value Use and Copyright Comission initiativesHULSENBECK Value Use and Copyright Comission initiatives
HULSENBECK Value Use and Copyright Comission initiatives
 
WILSON Film digitisation at BBC Scotland
WILSON Film digitisation at BBC ScotlandWILSON Film digitisation at BBC Scotland
WILSON Film digitisation at BBC Scotland
 
GOLODNOFF We need to make our past accessible!
GOLODNOFF We need to make our past accessible!GOLODNOFF We need to make our past accessible!
GOLODNOFF We need to make our past accessible!
 
LORENZ Building an integrated digital media archive and legal deposit
LORENZ Building an integrated digital media archive and legal depositLORENZ Building an integrated digital media archive and legal deposit
LORENZ Building an integrated digital media archive and legal deposit
 
BIRATUNGANYE Shock of formats
BIRATUNGANYE Shock of formatsBIRATUNGANYE Shock of formats
BIRATUNGANYE Shock of formats
 
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
 
BERGER RIPPON BBC Music memories
BERGER RIPPON BBC Music memoriesBERGER RIPPON BBC Music memories
BERGER RIPPON BBC Music memories
 
AOIBHINN and CHOISTIN Rehash your archive
AOIBHINN and CHOISTIN Rehash your archiveAOIBHINN and CHOISTIN Rehash your archive
AOIBHINN and CHOISTIN Rehash your archive
 
HULSENBECK BLOM A blast from the past open up
HULSENBECK BLOM A blast from the past open upHULSENBECK BLOM A blast from the past open up
HULSENBECK BLOM A blast from the past open up
 
PERVIZ Automated evolvable media console systems in digital archives
PERVIZ Automated evolvable media console systems in digital archivesPERVIZ Automated evolvable media console systems in digital archives
PERVIZ Automated evolvable media console systems in digital archives
 
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AIAICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
 
VINSON Accuracy and cost assessment for archival video transcription methods
VINSON Accuracy and cost assessment for archival video transcription methodsVINSON Accuracy and cost assessment for archival video transcription methods
VINSON Accuracy and cost assessment for archival video transcription methods
 
LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?
 
AZIZ BABBUCCI Let's play with the archive
AZIZ BABBUCCI Let's play with the archiveAZIZ BABBUCCI Let's play with the archive
AZIZ BABBUCCI Let's play with the archive
 

Dernier

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationshipsccctableauusergroup
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 

Dernier (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships04242024_CCC TUG_Joins and Relationships
04242024_CCC TUG_Joins and Relationships
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 

Automatic metadata generation in the Finnish broadcasting company

  • 1. Identifying the business cases for automatic metadata in the Finnish broadcasting company Kim Viljanen, Elina Selkälä The Finnish Broadcasting Company
  • 2. Get to know Yle in 120 seconds Yle – Your Story • Founded 1926 • CEO Merja Ylä-Anttila • Four television channels and three channel slots • Six nationwide radio channels, three radio services and varied web services • Present all around in Finland, in 25 areas • 18 Finnish, 5 Swedish and one Sami speaking news rooms • 10 regional television news broadcasts • 2,811 permanent employees (12/2018) • 90 % of employees create content and programmes • The company is 99.9 percent state-owned and operates under the Act on Yleisradio Oy. • Yle is financed by a tax paid by both individuals and companies.
  • 3. Yle - The AI Company Planning Production Publication Consume - Robot Journalist Voitto: Automatic news content generation IN PRODUCTION - Automated tagging for online texts IN PRODUCTION - Image analysis and tagging, stills library POC* - On demand speech recognition tool POC* - Automatically created transcriptions as an aid in archive database searches POC* - Automatic metadata generation: speech & image recognition, text analysis & automatic annotation etc. POC* - Automatic content analysis of Yle Areena publications POC - Speech recognition in subtitling POC - Audio description POC - Music identification POC - ASR based content recommendation, Yle Areena IN PRODUCTION - News Watch App content recommendation, Yle News IN PRODUCTION - Deduction of customer demographics IN PRODUCTION - Automatic moderation of web discussions POC - 360 view to content and users with metadata - Editor’s assistant Onnibot predicts article’s performance IN PRODUCTION Functions done or assisted by AI (* Done at the Archives)
  • 4. content analysis Metadata new functionalities for the end user What’s next? Company-wide automatic metadata processor Analyse every content item at the right phase of the process
  • 5. Steps in progress The Metadata Machine to analyse content with the help of AI/ML
  • 6. Vision: The Metadata Machine All audiovisual content is automatically analysed as early as possible Content creation (raw material) Procurement (ready-made content) Publishing (published content) Archiving (what do we have?) Automatic content analysis engine Speech recognition Image recognition Person identifier Fingerprinting Sound identifying Video frame color analysis Music identifier Text analysis Company-wide metadata database on all content items Language identifier ...
  • 7. A growing and fast moving market of automatic metadata extractors (AME) - Cloud companies - Companies focusing on one or several extractors - AME orchestration companies - Media product vendors (e.g. MAMs) that incorporate AME as part of the service - ... Differentiators between service providers: - What metadata extractors do each provide? - Focus only on media business? - Onsite vs. cloud - Ready-made vs. tailored - Pricing model - Ease of integration into supply chain - Quality of metadata results - Ability to train Machine Learning models - Speed of developing their products and services - ... - Speech recognition - Face recognition - Optical character recognition (OCR) - Sound detection - Language detection - Visual object detection - Landmark detection - Logo detection - Automatic translation - ...
  • 8. Do we have use cases for metadata that contains errors?
  • 9. Metadata machine - Proof of concept project spring 2019 1. Buy a metadata machine (Graymeta Curio) 2. Involve as many teams as possible around the company to identify and test their business cases for the Metadata machine. 3. Run lots of Yle content through the machine, extract as much as possible metadata 4. Run a wide variety of Yle content to test the capabilities of the extractors 5. Collect the results from the teams, identify the most prominent business cases 6. Final verdict: is the combined benefits bigger than the required investment test round 1 test round 2 test round 3 analysis & next steps
  • 10. A production ready platform that powers automated metadata collection using best of breed and custom machine learning services.
  • 11. Identifying the business cases for automatic metadata 100+ ideas 10+ proof of concepts 1+ to production
  • 12. How to evaluate individual ideas? What kind of metadata does it require? Does it improve existing processes? Does it enable something completely new? How much money or time does it save? How much does it increase customer satisfaction? Is the technology solution available today? What are the direct and indirect costs involved? How to optimize the costs? How does it affect the surrounding production process / way of working? How to combine human work with automation? What are the success criterias / KPIs … ? ...
  • 13. Use case: sport (editing) - The need: speed up making a video compilation around a specific topic. - Test case: Make a compilation about the Finnish athlete Iivo Niskanen. - Material: All Yle content about the Seefeld competitions 2019 - Extractors: faces, OCR, speech recognition, … - Possible next step: EU leader identification (to speed up editing of reports from EU meetings in Helsinki later this year)
  • 14. Use case: Content ingest and processing - automatic slate identification - black and silence detection - end credit detection - … - Tools to automate quality checking of incoming material. Is this the media we ordered?
  • 15. Use case: Understanding the content for Archive and Analytics - “All” metadata potentially useful for archive and analytics use. - face recognition - speech recognition - visual description - main topics - natural language processing - contains music? - … - In addition to having lots of metadata, it is important to make the data easy to use and view.
  • 16. Use case: Spoken language detection - The need: Identification of what languages are spoken in which parts of a program. Important information for multiple teams inside Yle, e.g. the translation department. - Current commercial services (to our knowledge) can identify the main language of the whole media, but not individual language segments inside the media.
  • 17. Three horizons of automatic metadata Horizon 1 Improve core business Horizon 2 New opportunities Horizon 3 Visionaries The possible and in production at Yle The possible, but not yet in production at Yle. The impossible. Not near to production yet. Speech recognition Finnish (in limited production from 2017). The Metadata machine project (and other projects) The MeMAD project How to improve the existing? What can/should we buy now? What are the business cases? How does the future look like? How to co-operate with ML researchers/visionaries? Time
  • 18. MeMAD project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 780069. This presentation has been produced by theMeMAD project. The content in this presentation represents the views of the authors, and the European Commission has no liability in respect of the content. - Three year research project in the Horizon 2020 program. - Started in 2018, ends in end of 2020. - Four universities, four companies: Aalto University, University of Helsinki, University of Surrey, Eurecom, INA, Limecraft, Lingsoft, Yle - MeMAD is about: - methods for efficient re-use and re-purposing of multilingual audiovisual content for video management and digital storytelling in broadcasting and media production. - combines automatic efficiency with human accuracy. - produces a rich description of moving images, speech and audio. - www.memad.eu The MeMAD project shortly
  • 19. Example: Multimodal translation Image caption translation ● WMT 2018 multimodal translation task ● EN text + image → DE/FR text (skipped Czech) Speech-to-text translation ● IWSLT 2018, English audio (TED talks) to German text ● Is the translation more accurate if part of the speech recognition system itself?
  • 20. Example: Automatic captioning of video image ● Currently: create a human readable natural language description of what is happening in each shot. ● Towards automatic recognition of the narrational structure of a shot (and the whole program). ● MeMAD project / Aalto University ● Based on deep neural network features and LSTM language model
  • 21. Potential use case: Automatic Audio Description ● “Steven”is a producer in charge of delivering audio descriptions for documentaries. Thanks to automatically generated audio descriptions, that are reviewed and corrected manually, Steven can deliver audio descriptions to end users for a smaller budget, enabling more content to be audio described. (UC4) Auto®
  • 22. Outcome so far (work in progress) - Better understanding on: - our needs - what can be solved with current ML and AME technologies - the limitations of the technology - and how to work around the problems - (what should not be solved with ML and AME technologies) - what is available in the market and what is not; different ways to buy services - how to work with automatic metadata companies and the academics - how to share data for ML research (legal, technical, process, …) - how to integrate a Metadata machine (Graymeta Curio) to Yle systems - how to combine human metadata work with automated metadata - … - We are starting to understand the impacts and new requirements on our processes, human skills, ... - Company wide commitment and involvement ⇒ AME relevant to many departments! - More practical attempt on AME than ever! We are moving from visions to reality.
  • 23. Lessons learned - The technology is tempting, but identifying the business cases is difficult and requires lots of work. ⇒ Learning what are the realistic expectations for AME. - Current off the shelf automatic metadata extraction services work and provide value ⇒ Work in progress to estimate the business value for individual cases. - Tuning the settings of “ready to use” ML services require time and skills. ⇒ Impact on future skill requirements for Yle personnel. - Machine learning requires a lot of teaching data. Preferably pairs of data (e.g. two language pairs) ⇒ The “huge” Yle archive turns out to be relatively small and limited from ML point of view. - Taking a company wide perspective on automatic metadata seems to work for the time being ― instead of solving each business case as an individual project. - Accept the imperfect! Start now!
  • 24. The future - Analyse the whole archive? Where to start? How to optimize the costs? - Legal and privacy impacts. Can our material become too easy to find? (e.g. face recognition) - Continue following the markets and technologies - Deciding on the next steps: - What should we implement right away? - What should we test more? - What should we research more? - Who should we do co-operation with in this area?