SlideShare une entreprise Scribd logo
1  sur  19
Télécharger pour lire hors ligne
Automated metadata generation projects at Yle
Elina Selkälä
Manager, archive publishing and metadata
Yle Archives
elina.selkala@yle.fi
FIAT/IFTA Media Management Seminar
Lugano
8.-9.6.2017
Agenda
Automated metadata generation projects at Yle
• Yle in a nutshell
• Yle Archives, collections and materials
• Production of metadata at Yle
• What we experimented on: examples of automatic content analysis projects
• What we learned
• What is happening next
• What is the role of the information professional in the age of AI
This is Yle
Automated metadata generation projects at Yle
• Public service broadcasting company
• 3 nationwide television & 6 radio channels, 24 regional radio stations
• Extensive online presence: yle.fi, svenska.yle.fi, Yle Areena, Yle Elävä arkisto
• In addition to Finnish and Swedish, has broadcasts in 11 languages, e.g. Sami,
English and Russian
• National programming hours per year:
50,000 hours of radio programming 20,000 hours of TV programming
5,000 hours of audio content online 15,000 hours of video content online
Yle Archives
Automated metadata generation projects at Yle
• Archives and catalogues Yle produced and co-produced radio and TV programmes
• Fosters and curates the archive collections of Yle
• Offers information services and training for Yle staff
• Publishes archive material online
Collections
• TV and radio materials, photographs, sound effects and music
• Archived in Media Asset Management System ”Metro” (Avid)
• Represents an important part of Finnish cultural heritage
• Archive has also sheet music, books and online resources e.g. papers, magazines,
databases
Radio and TV Archive collections
Automated metadata generation projects at Yle
TV materials
• TV programmes and raw material from
1957 onwards & film materials from 1906
onward
• Collection consists of around 700,000
programmes and clips
• All Yle productions / co-productions have
been systematically archived since 1984
• Archiving in native digital form since 2009
• Around 10,000 hours of video content is
archived / year
• Relatively good metadata
Radio materials
• Yle produced programmes and raw
material, oldest surviving clip from 1935
• The collection consists of around 2 million
programmes and clips
• Currently around 10% of radio
transmissions are archived (e.g. News and
works of art)
• Archiving in native digital form from the
beginning of the 2000s
• Around 20,000 hours of audio content is
archived / year
• Metadata of varying quality
Metadata production at Yle
Archived radio and TV programmes
Automated metadata generation projects at Yle
• Yle’s archive materials are widely used as whole programmes (reruns) and clips
• Metadata incomplete or insufficient for many reasons → hinders findability and safe
re-use
• Alongside tape collections digitization projects, related programme metadata is
updated and improved
• Huge endeavour, therefore prioritization is needed (most used, customer orders)
• Descriptive metadata is done manually
• Done by Archives’ information specialists (about 15 people)
Metadata production at Yle
New audio and video content
Automated metadata generation projects at Yle
• Metadata production decentralized
Metadata added and stored throughout the production and publishing process
Some metadata from production and publishing systems, descriptive metadata
filled out manually
Done by Yle staff; production coordinators, editors, producers, etc.
• Company-wide Archiving Policy
Defines the responsibilities, contents to be archived, metadata and formats
• Growing amount of published content
• Metadata is used for archiving and reuse purposes, as well as reporting
• New needs for metadata: improve discoverability and visibility on
Automated content analysis projects at Yle
Fall 2016
• Automated content analysis (virtual) team with participants from different parts of Yle
• Improve discoverability on web services (Yle Areena)
• Improve discoverability from archive databases
• New ways to subtitle video content
• Management of raw materials and versions
• Team’s goals were to:
• Learn about AI, machine learning and automatic content analysis methods in
theory and practice
• Carry out pilot projects (PoCs) with some companies
• Find solutions for automated metadata production in practic
Automated metadata generation projects at Yle
Case 1
Automatic content analysis of TV programmes (1/2)
Pilot project with Valossa Labs
Goal
• Test and evaluate the quality and
suitability of automatically produced
(descriptive) metadata in Yle’s metadata
production
Tested methods
• Text analysis of subtitles → tagging,
annotation
• Image recognition: object and face
recognition
• OCR of captions
• Automatic segmentation
Automated metadata generation projects at Yle
Case 1
Automatic content analysis of TV programmes (2/2)
Results
• Face recognition works well, object recognition is somewhat unreliable and too detailed
• Subtitles could also be used for content analysis
• Automatic segmentation (scenes, inserts) works well
• Test period was too short, no experiences about the learning capabilities of the system
• Speech recognition alongside image recognition would probably be profitable, but the
tested application did not support this feature
Automated metadata generation projects at Yle
Case 2
Automatic content analysis of audio content (1/2)
Pilot project with Lingsoft
Goal
• Test and evaluate the quality of speech &
music recognition and automatic
annotation
Tested methods
• Speech recognition → textual data for text
analysis
• Automatic annotation and indexing
• Music recognition (distinguish music from
speech)
Automated metadata generation projects at Yle
Case 2
Automatic content analysis of audio content (2/2)
Results
• Quality of the audio and speaker's way to speak have a significant impact
• Accuracy of the transcription is sufficient for annotation → relevant keywords, tags
• Music recognition works fairly well
• Speaker recognition would be useful, but the tested service did not support this feature
Automated metadata generation projects at Yle
Case 3
Automatic content analysis of Yle Areena content (1/3)
Pilot project with Qvik, Valossa Labs and Aalto University
Goal
• Improve findability and usability of audio and video content in Yle Areena online service
Three experiments
• Speech recognition: Time-code based transcriptions of audio files
• Image / structure recognition: fast forward opening & closing credits, inserts
• Text analysis: automatic annotation
Yle Areena
content
New functionalities
for the end user
Automatic
content
analysis
Media Metadata
Automated metadata generation projects at Yle
Case 3
Automatic content analysis of Yle Areena content (2/3)
Speech-to-text & text analysis
• Time-coded transcription and
automatic annotation of audio and
video content
Results
• Transcriptions were added to Yle
Areena web page, search engines
were able to index contents →
searches to verbal content was
made possible
• Identification of relevant concepts
was successful
Automated metadata generation projects at Yle
Case 3
Automatic content analysis of
Yle Areena content (3/3)
Identifying the structure of the content
• Automatic segmentation and identification
of recurrent elements (opening & closing
credits)
• Object recognition
Results
• Recurring elements (based on images) and
topics (based on subtitling) can be
identified → intelligent fast forward is
possible (Demo)
• Object recognition is somewhat unreliable
Automated metadata generation projects at Yle
Lessons learned
Define needs, requirements, and goals
• What is needed and who needs
• Costs and benefits
Define how success is measured
• Define how success is measured
• Evaluation criteria
Plan lead-through of projects
• Time and other resources
Cooperation with outside partners
• Ready-made test material packages
Contract and copyrights issues
Share your information
Automated metadata generation projects at Yle
On-going projects
Production
• Robot journalism, Voitto-robotti (pilot project)
• Automatic annotation of Yle’s web articles (in production)
Publishing
• Automatic metadata production by speech recognition and
image recognition (PoC)
• Speech recognition in subtitling (PoC)
Consumption / use
• Recommendation for Yle Areena content (in production)
• Yle Uutisvahti application, recommendation engine (in
production)
• Automatic moderation of web discussions (PoC)
• Deduction of customer demographics (in production)
Automated metadata generation projects at Yle
Information professionals changing role
What is the role of information professionals in the age of AI?
• Machine’s teacher
• Quality assessor, quality control manager
• Curator and valuer of metadata
• Customer value assessor
• Publisher of (archived) content
New skills are needed
• Comprehension of the methods to assess the opportunities available
• Technical know-how
Information professional and the machine need to coexist
Automated metadata generation projects at Yle
Automated metadata projects at Yle

Contenu connexe

En vedette

Data Journalism and Social Media, Media Archives as Information Service Provi...
Data Journalism and Social Media, Media Archives as Information Service Provi...Data Journalism and Social Media, Media Archives as Information Service Provi...
Data Journalism and Social Media, Media Archives as Information Service Provi...FIAT/IFTA
 
Anne couteux - Audiovisual archiving at Ina
Anne couteux - Audiovisual archiving at InaAnne couteux - Audiovisual archiving at Ina
Anne couteux - Audiovisual archiving at InaFIAT/IFTA
 
Hernani Heffner - FIAT/IFTA Cinemateca do MAM
Hernani Heffner - FIAT/IFTA Cinemateca do MAMHernani Heffner - FIAT/IFTA Cinemateca do MAM
Hernani Heffner - FIAT/IFTA Cinemateca do MAMFIAT/IFTA
 
Gracia ramirez - Czechoslovakia 1968: U.S. Propaganda at a Turning Point
Gracia ramirez -  Czechoslovakia 1968: U.S. Propaganda at a Turning PointGracia ramirez -  Czechoslovakia 1968: U.S. Propaganda at a Turning Point
Gracia ramirez - Czechoslovakia 1968: U.S. Propaganda at a Turning PointFIAT/IFTA
 
Results of the 2nd fiatifta mam survey - Declercq, Stanz
Results of the 2nd fiatifta mam survey - Declercq, Stanz Results of the 2nd fiatifta mam survey - Declercq, Stanz
Results of the 2nd fiatifta mam survey - Declercq, Stanz FIAT/IFTA
 
Richard Legay - May 1968 in Paris lived and told by peripheral radio stations
Richard Legay - May 1968 in Paris lived and told by peripheral radio stationsRichard Legay - May 1968 in Paris lived and told by peripheral radio stations
Richard Legay - May 1968 in Paris lived and told by peripheral radio stationsFIAT/IFTA
 
Automagically archiving the bbc's tv programmes - 2017 Dent, Allcorn
Automagically archiving the bbc's tv programmes - 2017 Dent, Allcorn Automagically archiving the bbc's tv programmes - 2017 Dent, Allcorn
Automagically archiving the bbc's tv programmes - 2017 Dent, Allcorn FIAT/IFTA
 
Break out: Project Communication and Dissemination - Jeroen Poppe
Break out: Project Communication and Dissemination - Jeroen PoppeBreak out: Project Communication and Dissemination - Jeroen Poppe
Break out: Project Communication and Dissemination - Jeroen Poppeimec.archive
 
FIAT/IFTA MMC Seminar May 2015. MAM and Metadata. David Klee. Univision
FIAT/IFTA MMC Seminar May 2015. MAM and Metadata. David Klee. UnivisionFIAT/IFTA MMC Seminar May 2015. MAM and Metadata. David Klee. Univision
FIAT/IFTA MMC Seminar May 2015. MAM and Metadata. David Klee. UnivisionFIAT/IFTA
 
Europeana Uncensored Keynote at FIAT/IFTA World Conference 2014, Harry Verway...
Europeana Uncensored Keynote at FIAT/IFTA World Conference 2014, Harry Verway...Europeana Uncensored Keynote at FIAT/IFTA World Conference 2014, Harry Verway...
Europeana Uncensored Keynote at FIAT/IFTA World Conference 2014, Harry Verway...FIAT/IFTA
 
Film processing in a digital wold, Jean Varra | Ina
Film processing in a digital wold, Jean Varra | InaFilm processing in a digital wold, Jean Varra | Ina
Film processing in a digital wold, Jean Varra | InaFIAT/IFTA
 
Private broadcast, public access. digitisation and semiautomatic indexation o...
Private broadcast, public access. digitisation and semiautomatic indexation o...Private broadcast, public access. digitisation and semiautomatic indexation o...
Private broadcast, public access. digitisation and semiautomatic indexation o...FIAT/IFTA
 
archives 2020 - Derighetti, Marco
archives 2020 - Derighetti, Marco archives 2020 - Derighetti, Marco
archives 2020 - Derighetti, Marco FIAT/IFTA
 
Todd M. Goehle - Media, Activism, and Democratization: The News Coverage and ...
Todd M. Goehle - Media, Activism, and Democratization: The News Coverage and ...Todd M. Goehle - Media, Activism, and Democratization: The News Coverage and ...
Todd M. Goehle - Media, Activism, and Democratization: The News Coverage and ...FIAT/IFTA
 
TOOLS and SOLUTIONS, Steny Solitude, Perfect Memory
TOOLS and SOLUTIONS, Steny Solitude, Perfect MemoryTOOLS and SOLUTIONS, Steny Solitude, Perfect Memory
TOOLS and SOLUTIONS, Steny Solitude, Perfect MemoryFIAT/IFTA
 
where's the line,the intersection of cloud based and internal mam systems - K...
where's the line,the intersection of cloud based and internal mam systems - K...where's the line,the intersection of cloud based and internal mam systems - K...
where's the line,the intersection of cloud based and internal mam systems - K...FIAT/IFTA
 
FIAT/IFTA MMC Seminar May 2015. The BBC Twitter Archive. Carl Davies. BBC Arc...
FIAT/IFTA MMC Seminar May 2015. The BBC Twitter Archive. Carl Davies. BBC Arc...FIAT/IFTA MMC Seminar May 2015. The BBC Twitter Archive. Carl Davies. BBC Arc...
FIAT/IFTA MMC Seminar May 2015. The BBC Twitter Archive. Carl Davies. BBC Arc...FIAT/IFTA
 
S.O.S The Live Romanian Revolution, Save Your Archive, Irina Negaro, TVR
S.O.S The Live Romanian Revolution, Save Your Archive, Irina Negaro, TVRS.O.S The Live Romanian Revolution, Save Your Archive, Irina Negaro, TVR
S.O.S The Live Romanian Revolution, Save Your Archive, Irina Negaro, TVRFIAT/IFTA
 

En vedette (18)

Data Journalism and Social Media, Media Archives as Information Service Provi...
Data Journalism and Social Media, Media Archives as Information Service Provi...Data Journalism and Social Media, Media Archives as Information Service Provi...
Data Journalism and Social Media, Media Archives as Information Service Provi...
 
Anne couteux - Audiovisual archiving at Ina
Anne couteux - Audiovisual archiving at InaAnne couteux - Audiovisual archiving at Ina
Anne couteux - Audiovisual archiving at Ina
 
Hernani Heffner - FIAT/IFTA Cinemateca do MAM
Hernani Heffner - FIAT/IFTA Cinemateca do MAMHernani Heffner - FIAT/IFTA Cinemateca do MAM
Hernani Heffner - FIAT/IFTA Cinemateca do MAM
 
Gracia ramirez - Czechoslovakia 1968: U.S. Propaganda at a Turning Point
Gracia ramirez -  Czechoslovakia 1968: U.S. Propaganda at a Turning PointGracia ramirez -  Czechoslovakia 1968: U.S. Propaganda at a Turning Point
Gracia ramirez - Czechoslovakia 1968: U.S. Propaganda at a Turning Point
 
Results of the 2nd fiatifta mam survey - Declercq, Stanz
Results of the 2nd fiatifta mam survey - Declercq, Stanz Results of the 2nd fiatifta mam survey - Declercq, Stanz
Results of the 2nd fiatifta mam survey - Declercq, Stanz
 
Richard Legay - May 1968 in Paris lived and told by peripheral radio stations
Richard Legay - May 1968 in Paris lived and told by peripheral radio stationsRichard Legay - May 1968 in Paris lived and told by peripheral radio stations
Richard Legay - May 1968 in Paris lived and told by peripheral radio stations
 
Automagically archiving the bbc's tv programmes - 2017 Dent, Allcorn
Automagically archiving the bbc's tv programmes - 2017 Dent, Allcorn Automagically archiving the bbc's tv programmes - 2017 Dent, Allcorn
Automagically archiving the bbc's tv programmes - 2017 Dent, Allcorn
 
Break out: Project Communication and Dissemination - Jeroen Poppe
Break out: Project Communication and Dissemination - Jeroen PoppeBreak out: Project Communication and Dissemination - Jeroen Poppe
Break out: Project Communication and Dissemination - Jeroen Poppe
 
FIAT/IFTA MMC Seminar May 2015. MAM and Metadata. David Klee. Univision
FIAT/IFTA MMC Seminar May 2015. MAM and Metadata. David Klee. UnivisionFIAT/IFTA MMC Seminar May 2015. MAM and Metadata. David Klee. Univision
FIAT/IFTA MMC Seminar May 2015. MAM and Metadata. David Klee. Univision
 
Europeana Uncensored Keynote at FIAT/IFTA World Conference 2014, Harry Verway...
Europeana Uncensored Keynote at FIAT/IFTA World Conference 2014, Harry Verway...Europeana Uncensored Keynote at FIAT/IFTA World Conference 2014, Harry Verway...
Europeana Uncensored Keynote at FIAT/IFTA World Conference 2014, Harry Verway...
 
Film processing in a digital wold, Jean Varra | Ina
Film processing in a digital wold, Jean Varra | InaFilm processing in a digital wold, Jean Varra | Ina
Film processing in a digital wold, Jean Varra | Ina
 
Private broadcast, public access. digitisation and semiautomatic indexation o...
Private broadcast, public access. digitisation and semiautomatic indexation o...Private broadcast, public access. digitisation and semiautomatic indexation o...
Private broadcast, public access. digitisation and semiautomatic indexation o...
 
archives 2020 - Derighetti, Marco
archives 2020 - Derighetti, Marco archives 2020 - Derighetti, Marco
archives 2020 - Derighetti, Marco
 
Todd M. Goehle - Media, Activism, and Democratization: The News Coverage and ...
Todd M. Goehle - Media, Activism, and Democratization: The News Coverage and ...Todd M. Goehle - Media, Activism, and Democratization: The News Coverage and ...
Todd M. Goehle - Media, Activism, and Democratization: The News Coverage and ...
 
TOOLS and SOLUTIONS, Steny Solitude, Perfect Memory
TOOLS and SOLUTIONS, Steny Solitude, Perfect MemoryTOOLS and SOLUTIONS, Steny Solitude, Perfect Memory
TOOLS and SOLUTIONS, Steny Solitude, Perfect Memory
 
where's the line,the intersection of cloud based and internal mam systems - K...
where's the line,the intersection of cloud based and internal mam systems - K...where's the line,the intersection of cloud based and internal mam systems - K...
where's the line,the intersection of cloud based and internal mam systems - K...
 
FIAT/IFTA MMC Seminar May 2015. The BBC Twitter Archive. Carl Davies. BBC Arc...
FIAT/IFTA MMC Seminar May 2015. The BBC Twitter Archive. Carl Davies. BBC Arc...FIAT/IFTA MMC Seminar May 2015. The BBC Twitter Archive. Carl Davies. BBC Arc...
FIAT/IFTA MMC Seminar May 2015. The BBC Twitter Archive. Carl Davies. BBC Arc...
 
S.O.S The Live Romanian Revolution, Save Your Archive, Irina Negaro, TVR
S.O.S The Live Romanian Revolution, Save Your Archive, Irina Negaro, TVRS.O.S The Live Romanian Revolution, Save Your Archive, Irina Negaro, TVR
S.O.S The Live Romanian Revolution, Save Your Archive, Irina Negaro, TVR
 

Similaire à Automated metadata projects at Yle

The National Library of Australia's New Discovery Service
The National Library of Australia's New Discovery ServiceThe National Library of Australia's New Discovery Service
The National Library of Australia's New Discovery ServiceOCLC Research
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyArchiver
 
Europeana Network Association Members Council Meeting, Copenhagen by Stephan ...
Europeana Network Association Members Council Meeting, Copenhagen by Stephan ...Europeana Network Association Members Council Meeting, Copenhagen by Stephan ...
Europeana Network Association Members Council Meeting, Copenhagen by Stephan ...Europeana
 
AMIA: Examining AV Enterprise at a Regional Academic Archive
AMIA: Examining AV Enterprise at a Regional Academic ArchiveAMIA: Examining AV Enterprise at a Regional Academic Archive
AMIA: Examining AV Enterprise at a Regional Academic ArchiveJessica Breiman
 
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...IMPACT Centre of Competence
 
The Europeana Newspapers Project at IMPACT Final Event
The Europeana Newspapers Project at IMPACT Final EventThe Europeana Newspapers Project at IMPACT Final Event
The Europeana Newspapers Project at IMPACT Final EventEuropeana Newspapers
 
NeoLibre for the Latvian Society of the Blind
NeoLibre for the Latvian Society of the BlindNeoLibre for the Latvian Society of the Blind
NeoLibre for the Latvian Society of the BlindNeoLibre
 
Dag Hensten - Nasjonalmuseet collections online
Dag Hensten - Nasjonalmuseet collections onlineDag Hensten - Nasjonalmuseet collections online
Dag Hensten - Nasjonalmuseet collections onlinelab_SNG
 
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Nuno Freire
 
SCAPE Information Day at BL - Some of the SCAPE Outputs Available
SCAPE Information Day at BL - Some of the SCAPE Outputs AvailableSCAPE Information Day at BL - Some of the SCAPE Outputs Available
SCAPE Information Day at BL - Some of the SCAPE Outputs AvailableSCAPE Project
 
2014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-20142014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-2014Christophe Debruyne
 
RTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe DebruyneRTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe Debruynedri_ireland
 
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...FIAT/IFTA
 
SoundSoftware.ac.uk: Sustainable software for audio and music research (DMRN 5+)
SoundSoftware.ac.uk: Sustainable software for audio and music research (DMRN 5+)SoundSoftware.ac.uk: Sustainable software for audio and music research (DMRN 5+)
SoundSoftware.ac.uk: Sustainable software for audio and music research (DMRN 5+)SoundSoftware ac.uk
 
Summary of Day 1
Summary of Day 1Summary of Day 1
Summary of Day 1Europeana
 
OOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology RepositoriesOOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology RepositoriesKim Viljanen
 
Good Enough: Finding what works for processing born-digital archives at the B...
Good Enough: Finding what works for processing born-digital archives at the B...Good Enough: Finding what works for processing born-digital archives at the B...
Good Enough: Finding what works for processing born-digital archives at the B...mikeum
 
The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers ProjectEuropeana Newspapers
 

Similaire à Automated metadata projects at Yle (20)

The National Library of Australia's New Discovery Service
The National Library of Australia's New Discovery ServiceThe National Library of Australia's New Discovery Service
The National Library of Australia's New Discovery Service
 
Prototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and CeremonyPrototype Phase Kick-off Event and Ceremony
Prototype Phase Kick-off Event and Ceremony
 
Europeana Network Association Members Council Meeting, Copenhagen by Stephan ...
Europeana Network Association Members Council Meeting, Copenhagen by Stephan ...Europeana Network Association Members Council Meeting, Copenhagen by Stephan ...
Europeana Network Association Members Council Meeting, Copenhagen by Stephan ...
 
AMIA: Examining AV Enterprise at a Regional Academic Archive
AMIA: Examining AV Enterprise at a Regional Academic ArchiveAMIA: Examining AV Enterprise at a Regional Academic Archive
AMIA: Examining AV Enterprise at a Regional Academic Archive
 
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...IMPACT Final Event 26-06-2012  - Use of IMPACT tools in the Europeana Newspap...
IMPACT Final Event 26-06-2012 - Use of IMPACT tools in the Europeana Newspap...
 
The Europeana Newspapers Project at IMPACT Final Event
The Europeana Newspapers Project at IMPACT Final EventThe Europeana Newspapers Project at IMPACT Final Event
The Europeana Newspapers Project at IMPACT Final Event
 
NeoLibre for the Latvian Society of the Blind
NeoLibre for the Latvian Society of the BlindNeoLibre for the Latvian Society of the Blind
NeoLibre for the Latvian Society of the Blind
 
Hamooya
HamooyaHamooya
Hamooya
 
Dag Hensten - Nasjonalmuseet collections online
Dag Hensten - Nasjonalmuseet collections onlineDag Hensten - Nasjonalmuseet collections online
Dag Hensten - Nasjonalmuseet collections online
 
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
Metadata Aggregation: Assessing the Application of IIIF and Sitemaps within C...
 
SCAPE Information Day at BL - Some of the SCAPE Outputs Available
SCAPE Information Day at BL - Some of the SCAPE Outputs AvailableSCAPE Information Day at BL - Some of the SCAPE Outputs Available
SCAPE Information Day at BL - Some of the SCAPE Outputs Available
 
2014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-20142014 06-04-presentation-mdn-2014
2014 06-04-presentation-mdn-2014
 
RTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe DebruyneRTÉ Content Discovery Project - Christophe Debruyne
RTÉ Content Discovery Project - Christophe Debruyne
 
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...VERDOODT Measuring clouds. A large scale acquisition and preservation service...
VERDOODT Measuring clouds. A large scale acquisition and preservation service...
 
SoundSoftware.ac.uk: Sustainable software for audio and music research (DMRN 5+)
SoundSoftware.ac.uk: Sustainable software for audio and music research (DMRN 5+)SoundSoftware.ac.uk: Sustainable software for audio and music research (DMRN 5+)
SoundSoftware.ac.uk: Sustainable software for audio and music research (DMRN 5+)
 
Summary of Day 1
Summary of Day 1Summary of Day 1
Summary of Day 1
 
OOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology RepositoriesOOR Architecture - Towards a Network of Linked Ontology Repositories
OOR Architecture - Towards a Network of Linked Ontology Repositories
 
Good Enough: Finding what works for processing born-digital archives at the B...
Good Enough: Finding what works for processing born-digital archives at the B...Good Enough: Finding what works for processing born-digital archives at the B...
Good Enough: Finding what works for processing born-digital archives at the B...
 
The European(a) Newspapers Project
The European(a) Newspapers ProjectThe European(a) Newspapers Project
The European(a) Newspapers Project
 
Ukgs2013 dave pattern
Ukgs2013 dave patternUkgs2013 dave pattern
Ukgs2013 dave pattern
 

Plus de FIAT/IFTA

2021 FIAT/IFTA Timeline Survey
2021 FIAT/IFTA Timeline Survey2021 FIAT/IFTA Timeline Survey
2021 FIAT/IFTA Timeline SurveyFIAT/IFTA
 
20211021 FIAT/IFTA Most Wanted List
20211021 FIAT/IFTA Most Wanted List20211021 FIAT/IFTA Most Wanted List
20211021 FIAT/IFTA Most Wanted ListFIAT/IFTA
 
WARBURTON FIAT/IFTA Timeline Survey results 2020
WARBURTON FIAT/IFTA Timeline Survey results 2020WARBURTON FIAT/IFTA Timeline Survey results 2020
WARBURTON FIAT/IFTA Timeline Survey results 2020FIAT/IFTA
 
OOMEN MEZARIS ReTV
OOMEN MEZARIS ReTVOOMEN MEZARIS ReTV
OOMEN MEZARIS ReTVFIAT/IFTA
 
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)FIAT/IFTA
 
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉCULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉFIAT/IFTA
 
HULSENBECK Value Use and Copyright Comission initiatives
HULSENBECK Value Use and Copyright Comission initiativesHULSENBECK Value Use and Copyright Comission initiatives
HULSENBECK Value Use and Copyright Comission initiativesFIAT/IFTA
 
WILSON Film digitisation at BBC Scotland
WILSON Film digitisation at BBC ScotlandWILSON Film digitisation at BBC Scotland
WILSON Film digitisation at BBC ScotlandFIAT/IFTA
 
GOLODNOFF We need to make our past accessible!
GOLODNOFF We need to make our past accessible!GOLODNOFF We need to make our past accessible!
GOLODNOFF We need to make our past accessible!FIAT/IFTA
 
LORENZ Building an integrated digital media archive and legal deposit
LORENZ Building an integrated digital media archive and legal depositLORENZ Building an integrated digital media archive and legal deposit
LORENZ Building an integrated digital media archive and legal depositFIAT/IFTA
 
BIRATUNGANYE Shock of formats
BIRATUNGANYE Shock of formatsBIRATUNGANYE Shock of formats
BIRATUNGANYE Shock of formatsFIAT/IFTA
 
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...FIAT/IFTA
 
BERGER RIPPON BBC Music memories
BERGER RIPPON BBC Music memoriesBERGER RIPPON BBC Music memories
BERGER RIPPON BBC Music memoriesFIAT/IFTA
 
AOIBHINN and CHOISTIN Rehash your archive
AOIBHINN and CHOISTIN Rehash your archiveAOIBHINN and CHOISTIN Rehash your archive
AOIBHINN and CHOISTIN Rehash your archiveFIAT/IFTA
 
HULSENBECK BLOM A blast from the past open up
HULSENBECK BLOM A blast from the past open upHULSENBECK BLOM A blast from the past open up
HULSENBECK BLOM A blast from the past open upFIAT/IFTA
 
PERVIZ Automated evolvable media console systems in digital archives
PERVIZ Automated evolvable media console systems in digital archivesPERVIZ Automated evolvable media console systems in digital archives
PERVIZ Automated evolvable media console systems in digital archivesFIAT/IFTA
 
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AIAICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AIFIAT/IFTA
 
VINSON Accuracy and cost assessment for archival video transcription methods
VINSON Accuracy and cost assessment for archival video transcription methodsVINSON Accuracy and cost assessment for archival video transcription methods
VINSON Accuracy and cost assessment for archival video transcription methodsFIAT/IFTA
 
LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?FIAT/IFTA
 
AZIZ BABBUCCI Let's play with the archive
AZIZ BABBUCCI Let's play with the archiveAZIZ BABBUCCI Let's play with the archive
AZIZ BABBUCCI Let's play with the archiveFIAT/IFTA
 

Plus de FIAT/IFTA (20)

2021 FIAT/IFTA Timeline Survey
2021 FIAT/IFTA Timeline Survey2021 FIAT/IFTA Timeline Survey
2021 FIAT/IFTA Timeline Survey
 
20211021 FIAT/IFTA Most Wanted List
20211021 FIAT/IFTA Most Wanted List20211021 FIAT/IFTA Most Wanted List
20211021 FIAT/IFTA Most Wanted List
 
WARBURTON FIAT/IFTA Timeline Survey results 2020
WARBURTON FIAT/IFTA Timeline Survey results 2020WARBURTON FIAT/IFTA Timeline Survey results 2020
WARBURTON FIAT/IFTA Timeline Survey results 2020
 
OOMEN MEZARIS ReTV
OOMEN MEZARIS ReTVOOMEN MEZARIS ReTV
OOMEN MEZARIS ReTV
 
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
BUCHMAN Digitisation of quarter inch audio tapes at DR (FRAME Expert)
 
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉCULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
CULJAT (FRAME Expert) Public procurement in audiovisual digitisation at RTÉ
 
HULSENBECK Value Use and Copyright Comission initiatives
HULSENBECK Value Use and Copyright Comission initiativesHULSENBECK Value Use and Copyright Comission initiatives
HULSENBECK Value Use and Copyright Comission initiatives
 
WILSON Film digitisation at BBC Scotland
WILSON Film digitisation at BBC ScotlandWILSON Film digitisation at BBC Scotland
WILSON Film digitisation at BBC Scotland
 
GOLODNOFF We need to make our past accessible!
GOLODNOFF We need to make our past accessible!GOLODNOFF We need to make our past accessible!
GOLODNOFF We need to make our past accessible!
 
LORENZ Building an integrated digital media archive and legal deposit
LORENZ Building an integrated digital media archive and legal depositLORENZ Building an integrated digital media archive and legal deposit
LORENZ Building an integrated digital media archive and legal deposit
 
BIRATUNGANYE Shock of formats
BIRATUNGANYE Shock of formatsBIRATUNGANYE Shock of formats
BIRATUNGANYE Shock of formats
 
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
CANTU VT is TV The History of Argentinian Video Art and Television Archives P...
 
BERGER RIPPON BBC Music memories
BERGER RIPPON BBC Music memoriesBERGER RIPPON BBC Music memories
BERGER RIPPON BBC Music memories
 
AOIBHINN and CHOISTIN Rehash your archive
AOIBHINN and CHOISTIN Rehash your archiveAOIBHINN and CHOISTIN Rehash your archive
AOIBHINN and CHOISTIN Rehash your archive
 
HULSENBECK BLOM A blast from the past open up
HULSENBECK BLOM A blast from the past open upHULSENBECK BLOM A blast from the past open up
HULSENBECK BLOM A blast from the past open up
 
PERVIZ Automated evolvable media console systems in digital archives
PERVIZ Automated evolvable media console systems in digital archivesPERVIZ Automated evolvable media console systems in digital archives
PERVIZ Automated evolvable media console systems in digital archives
 
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AIAICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
AICHROTH Systemaic evaluation and decentralisation for a (bit more) trusted AI
 
VINSON Accuracy and cost assessment for archival video transcription methods
VINSON Accuracy and cost assessment for archival video transcription methodsVINSON Accuracy and cost assessment for archival video transcription methods
VINSON Accuracy and cost assessment for archival video transcription methods
 
LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?LYCKE Artificial intelligence, hype or hope?
LYCKE Artificial intelligence, hype or hope?
 
AZIZ BABBUCCI Let's play with the archive
AZIZ BABBUCCI Let's play with the archiveAZIZ BABBUCCI Let's play with the archive
AZIZ BABBUCCI Let's play with the archive
 

Dernier

Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfLars Albertsson
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023ymrp368
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一ffjhghh
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxolyaivanovalion
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 

Dernier (20)

Industrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdfIndustrialised data - the key to AI success.pdf
Industrialised data - the key to AI success.pdf
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Data-Analysis for Chicago Crime Data 2023
Data-Analysis for Chicago Crime Data  2023Data-Analysis for Chicago Crime Data  2023
Data-Analysis for Chicago Crime Data 2023
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一定制英国白金汉大学毕业证(UCB毕业证书)																			成绩单原版一比一
定制英国白金汉大学毕业证(UCB毕业证书) 成绩单原版一比一
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
VidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptxVidaXL dropshipping via API with DroFx.pptx
VidaXL dropshipping via API with DroFx.pptx
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 

Automated metadata projects at Yle

  • 1. Automated metadata generation projects at Yle Elina Selkälä Manager, archive publishing and metadata Yle Archives elina.selkala@yle.fi FIAT/IFTA Media Management Seminar Lugano 8.-9.6.2017
  • 2. Agenda Automated metadata generation projects at Yle • Yle in a nutshell • Yle Archives, collections and materials • Production of metadata at Yle • What we experimented on: examples of automatic content analysis projects • What we learned • What is happening next • What is the role of the information professional in the age of AI
  • 3. This is Yle Automated metadata generation projects at Yle • Public service broadcasting company • 3 nationwide television & 6 radio channels, 24 regional radio stations • Extensive online presence: yle.fi, svenska.yle.fi, Yle Areena, Yle Elävä arkisto • In addition to Finnish and Swedish, has broadcasts in 11 languages, e.g. Sami, English and Russian • National programming hours per year: 50,000 hours of radio programming 20,000 hours of TV programming 5,000 hours of audio content online 15,000 hours of video content online
  • 4. Yle Archives Automated metadata generation projects at Yle • Archives and catalogues Yle produced and co-produced radio and TV programmes • Fosters and curates the archive collections of Yle • Offers information services and training for Yle staff • Publishes archive material online Collections • TV and radio materials, photographs, sound effects and music • Archived in Media Asset Management System ”Metro” (Avid) • Represents an important part of Finnish cultural heritage • Archive has also sheet music, books and online resources e.g. papers, magazines, databases
  • 5. Radio and TV Archive collections Automated metadata generation projects at Yle TV materials • TV programmes and raw material from 1957 onwards & film materials from 1906 onward • Collection consists of around 700,000 programmes and clips • All Yle productions / co-productions have been systematically archived since 1984 • Archiving in native digital form since 2009 • Around 10,000 hours of video content is archived / year • Relatively good metadata Radio materials • Yle produced programmes and raw material, oldest surviving clip from 1935 • The collection consists of around 2 million programmes and clips • Currently around 10% of radio transmissions are archived (e.g. News and works of art) • Archiving in native digital form from the beginning of the 2000s • Around 20,000 hours of audio content is archived / year • Metadata of varying quality
  • 6. Metadata production at Yle Archived radio and TV programmes Automated metadata generation projects at Yle • Yle’s archive materials are widely used as whole programmes (reruns) and clips • Metadata incomplete or insufficient for many reasons → hinders findability and safe re-use • Alongside tape collections digitization projects, related programme metadata is updated and improved • Huge endeavour, therefore prioritization is needed (most used, customer orders) • Descriptive metadata is done manually • Done by Archives’ information specialists (about 15 people)
  • 7. Metadata production at Yle New audio and video content Automated metadata generation projects at Yle • Metadata production decentralized Metadata added and stored throughout the production and publishing process Some metadata from production and publishing systems, descriptive metadata filled out manually Done by Yle staff; production coordinators, editors, producers, etc. • Company-wide Archiving Policy Defines the responsibilities, contents to be archived, metadata and formats • Growing amount of published content • Metadata is used for archiving and reuse purposes, as well as reporting • New needs for metadata: improve discoverability and visibility on
  • 8. Automated content analysis projects at Yle Fall 2016 • Automated content analysis (virtual) team with participants from different parts of Yle • Improve discoverability on web services (Yle Areena) • Improve discoverability from archive databases • New ways to subtitle video content • Management of raw materials and versions • Team’s goals were to: • Learn about AI, machine learning and automatic content analysis methods in theory and practice • Carry out pilot projects (PoCs) with some companies • Find solutions for automated metadata production in practic Automated metadata generation projects at Yle
  • 9. Case 1 Automatic content analysis of TV programmes (1/2) Pilot project with Valossa Labs Goal • Test and evaluate the quality and suitability of automatically produced (descriptive) metadata in Yle’s metadata production Tested methods • Text analysis of subtitles → tagging, annotation • Image recognition: object and face recognition • OCR of captions • Automatic segmentation Automated metadata generation projects at Yle
  • 10. Case 1 Automatic content analysis of TV programmes (2/2) Results • Face recognition works well, object recognition is somewhat unreliable and too detailed • Subtitles could also be used for content analysis • Automatic segmentation (scenes, inserts) works well • Test period was too short, no experiences about the learning capabilities of the system • Speech recognition alongside image recognition would probably be profitable, but the tested application did not support this feature Automated metadata generation projects at Yle
  • 11. Case 2 Automatic content analysis of audio content (1/2) Pilot project with Lingsoft Goal • Test and evaluate the quality of speech & music recognition and automatic annotation Tested methods • Speech recognition → textual data for text analysis • Automatic annotation and indexing • Music recognition (distinguish music from speech) Automated metadata generation projects at Yle
  • 12. Case 2 Automatic content analysis of audio content (2/2) Results • Quality of the audio and speaker's way to speak have a significant impact • Accuracy of the transcription is sufficient for annotation → relevant keywords, tags • Music recognition works fairly well • Speaker recognition would be useful, but the tested service did not support this feature Automated metadata generation projects at Yle
  • 13. Case 3 Automatic content analysis of Yle Areena content (1/3) Pilot project with Qvik, Valossa Labs and Aalto University Goal • Improve findability and usability of audio and video content in Yle Areena online service Three experiments • Speech recognition: Time-code based transcriptions of audio files • Image / structure recognition: fast forward opening & closing credits, inserts • Text analysis: automatic annotation Yle Areena content New functionalities for the end user Automatic content analysis Media Metadata Automated metadata generation projects at Yle
  • 14. Case 3 Automatic content analysis of Yle Areena content (2/3) Speech-to-text & text analysis • Time-coded transcription and automatic annotation of audio and video content Results • Transcriptions were added to Yle Areena web page, search engines were able to index contents → searches to verbal content was made possible • Identification of relevant concepts was successful Automated metadata generation projects at Yle
  • 15. Case 3 Automatic content analysis of Yle Areena content (3/3) Identifying the structure of the content • Automatic segmentation and identification of recurrent elements (opening & closing credits) • Object recognition Results • Recurring elements (based on images) and topics (based on subtitling) can be identified → intelligent fast forward is possible (Demo) • Object recognition is somewhat unreliable Automated metadata generation projects at Yle
  • 16. Lessons learned Define needs, requirements, and goals • What is needed and who needs • Costs and benefits Define how success is measured • Define how success is measured • Evaluation criteria Plan lead-through of projects • Time and other resources Cooperation with outside partners • Ready-made test material packages Contract and copyrights issues Share your information Automated metadata generation projects at Yle
  • 17. On-going projects Production • Robot journalism, Voitto-robotti (pilot project) • Automatic annotation of Yle’s web articles (in production) Publishing • Automatic metadata production by speech recognition and image recognition (PoC) • Speech recognition in subtitling (PoC) Consumption / use • Recommendation for Yle Areena content (in production) • Yle Uutisvahti application, recommendation engine (in production) • Automatic moderation of web discussions (PoC) • Deduction of customer demographics (in production) Automated metadata generation projects at Yle
  • 18. Information professionals changing role What is the role of information professionals in the age of AI? • Machine’s teacher • Quality assessor, quality control manager • Curator and valuer of metadata • Customer value assessor • Publisher of (archived) content New skills are needed • Comprehension of the methods to assess the opportunities available • Technical know-how Information professional and the machine need to coexist Automated metadata generation projects at Yle