SlideShare une entreprise Scribd logo
1  sur  43
Télécharger pour lire hors ligne
Recent Advances in Computer Vision

         Ming-Hsuan Yang
Computer vision
• Holly grail – tell a story from an image
History
• “In the 1960s, almost no one realized that
  machine vision was difficult.” – David Marr,
  1982
• Marvin Minsky asked Gerald Jay Sussman
  to “spend the summer linking a camera to
  a computer and getting the computer to
  describe what it saw” – Crevier, 1993
• 40+ years later, we are still working on this
1970s
1980s
1990s


        • Face detection
        • Particle filter
        • Pfinder
        • Normalized cut
2000s
• SIFT
  –   Mosaicing, panorama
  –   Object recognition
  –   Photo tourism, photosynth
  –   Human detection

• Adaboost-based face detector
Related topics
Conferences
• CVPR – Computer Vision and Pattern
  Recognition, since 1983
  – Annual, held in US
• ICCV – International Conference on
  Computer Vision, since 1987
  – Every other year, alternate in 3 continents
• ECCV – European Conference on
  Computer Vision, since 1990
  – Every other year, held in Europe
Conferences (cont’d)
• ACCV – Asian Conference on Computer
  Vision
• BMVC – British Machine Vision
  Conference
• ICPR – International Conference on
  Pattern Recognition
• SIGGRAPH
• NIPS – Neural Information Processing
  Systems
Conferences (cont’d)
• MICCAI – Medical Image Computing and
  Computer-Assisted Intervention
• ISBI – International Symposium on Biomedical
  Imaging
• FG – IEEE Conference on Automatic Face and
  Gesture Recognition
• ICCP, ICDR, ICVS, DAGM, CAIP, MVA, AAAI,
  IJCAI, ICML, ICRA, ICASSP, ICIP, SPIE, DCC,
  WACV, 3DPVT, ACM Multimedia, ICME, …
Conference organization
• General chairs: administration
• Program chairs: handling papers
• Area chairs:
  –   Assign reviewers
  –   Read reviews and rebuttals
  –   Consolidation reports
  –   Recommendation
• Reviewers
• Authors
Review process
• Submission
• CVPR/ECCV/ICCV
  – Double blind review
  – Program chairs: assign papers to area chairs
  – Area chairs: assign papers to reviewers
• Rebuttal
Area chair meetings
• 2 day meetings
• Several panels
• Each paper is reviewed by at least 2 area
  chairs
• Buddy system
• Area chair make recommendations
• Program chairs make final decisions
Conference acceptance rates
•   ICCV/CVPR/ECCV: ~ 30%
•   ACCV (2009): ~ 30%
•   NIPS: ~ 30%
•   BMVC: ~ 40%
•   ICIP: ~ 45%
•   ICPR: ~ 55%

• Disclaimer
    – low acceptance rate = high quality?
CVPR
 Submission             Oral




              Overall
ICCV
 Submission             Oral




              Overall
ECCV
 Submission             Oral




              Overall
Journals
• PAMI – IEEE Transactions on Pattern
  Analysis and Machine Intelligence, since
  1979 (impact factor: 5.96, #1 in all engineering
  and AI, top-ranked IEEE and CS journal)
• IJCV – International Journal on Computer
  Vision, since 1988 (impact factor: 5.36, #2 in
  all engineering and AI)
• CVIU – Computer Vision and Image
  Understanding, since 1972 (impact factor:
  2.20)
Journals (cont’d)
• IVC – Image and Vision Computing
• IEEE Transactions on Medical Imaging
• TIP – IEEE Transactions on Image
  Processing
• MVA – Machine Vision and Applications
• PR – Pattern Recognition
• TM – IEEE Transactions on Multimedia
• …
PAMI review process
• Editor-in-chief (EIC) assigns papers to
  associate editors (AE)
• AE assigns reviewers
• First-round review: 3-6 months
  –   Accept as is
  –   Accept with minor revision
  –   Major revision
  –   Resubmit as new
  –   Reject
PAMI review process (cont’d)
• Second-round review: 2-4 months
  – Accept as is
  – Accept with minor revision
  – Reject
• EIC makes final decision
• Overall turn-around time: 6 to 12 months
• Rule of thumb: 30% additional work
  beyond a CVPR/ICCV/ECCV paper
IJCV/CVIU review process
• Similar formats
• CVIU has roughly the same turn-around
  time as PAMI
• IJCV tends to have longer turn-around
  time
Journal acceptance rate
• PAMI, IJCV: ~ 30%
• CVIU: ~ 30%
Tools
• Google scholar, citeseer,
• h-index
• Software: publish or perish

• Disclaimer:
  – h index = significance?
  – # of citation = significance
How to get your papers rejected?
• Refer to Jim Kajia (SIGGRAPH 93 papers
  chair): How to get your SIGGRAPH paper
  rejected?
• Do not
  –   Pay attention to review process
  –   Put yourself as a reviewer perspective
  –   Put the work in right context
  –   Carry out sufficient amount of experiments
  –   Compare with state-of-the-art algorithms
  –   Pay attention to writing
Pay attention to review process
• Learn how others/you can pick apart a
  paper
• Learn from other’s mistakes
• Learn how to write good papers
• Learn what it takes to get a paper
  published
Put yourself as reviewer
•   What are the contributions?
•   Does it advance the science in the filed?
•   Why you should accept this paper?
•   Is this paper a case study?
•   Is this paper interesting?
•   What is the audience?
•   Does anyone care about this work?
Experimental validation
•   Common data set
•   Killer data set
•   Large scale experiment
•   Evaluation metric
Compare with state of the art
• Do your homework
• Need to know what is out there
• Need to show why one’s method
  outperforms others, and in what way?
  – speed?
  – accuracy?
  – easy to implement?
  – general application?
Writing
•   Clear presentation
•   Terse
•   Careful about wording
•   Make claims with strong evidence
Review form
• Summary
• Overall Rating
  – Definite accept, weakly accept, borderline, weakly reject, definite
    reject
• Novelty
  – Very original, original, minor originality, has been done before
• Importance/relevance
  – Of broad interest, interesting to a subarea, interesting only to a
    small number of attendees, out of CVPR scope
Review form (cont’d)
• Clarity of presentation
   – Reads very well, is clear enough, difficult to read, unreadable
• Technical correctness
   – Definite correct, probably correct but did not check completely,
     contains rectifiable errors, has major problems
• Experimental validation
   – Excellent validation or N/A (a theoretical paper), limited but
     convincing, lacking in some aspects, insufficient validation
• Additional comments
• Reviewer’s name
Challenging issues
•   Large scale
•   Unconstrained
•   Real-time
•   Robustness
•   Recover from failure – graceful dead
Some hot topics
•   Object recognition, categorization
•   Internet scale image search
•   Video search
•   Human detection
•   3D human pose estimation
•   Computational photography
•   Scene understanding
Some hot tools
•   Prior
•   Context
•   Sparse representation
•   Multiple instance learning
•   Online learning
•   Convex optimization
•   Constraint
•   Hashing
Prior




        Torralba and Sinha ICCV 01
Prior




        Heitz and Koller ECCV 08
Prior




   Jia CVPR 08   He et al. CVPR 09
Scene understanding




          Leibe et al. CVPR 07
Image search




          Wu et al. CVPR 09
Computational photography




        Johnson and Adelson et al. CVPR 09
Computational photography




           Ahuja et al.

Contenu connexe

En vedette

How to Read Academic Papers
How to Read Academic PapersHow to Read Academic Papers
How to Read Academic PapersJia-Bin Huang
 
Top 10 ways to get your paper rejected at #CAGJournal
Top 10 ways to get your paper rejected at #CAGJournalTop 10 ways to get your paper rejected at #CAGJournal
Top 10 ways to get your paper rejected at #CAGJournalJoaquim Jorge
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash CourseJia-Bin Huang
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXJia-Bin Huang
 
About DevFactory
About DevFactory About DevFactory
About DevFactory HackerEarth
 
Richard Matthew Stallman - A Brief Biography
Richard Matthew Stallman - A Brief BiographyRichard Matthew Stallman - A Brief Biography
Richard Matthew Stallman - A Brief BiographyHackerEarth
 
Intel RealSense technology : Overview and demos
Intel RealSense technology : Overview and demosIntel RealSense technology : Overview and demos
Intel RealSense technology : Overview and demosZayen Chagra
 
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro..."Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...Edge AI and Vision Alliance
 
미래교육을 위한 인터페이스 NUI & education
미래교육을 위한 인터페이스 NUI & education미래교육을 위한 인터페이스 NUI & education
미래교육을 위한 인터페이스 NUI & educationChoohan Cho
 
Computer vision, machine, and deep learning
Computer vision, machine, and deep learningComputer vision, machine, and deep learning
Computer vision, machine, and deep learningIgi Ardiyanto
 
Augmented reality as next UI Challenge
Augmented reality as next UI ChallengeAugmented reality as next UI Challenge
Augmented reality as next UI ChallengeZühlke
 
Evolution or Innovation
Evolution or InnovationEvolution or Innovation
Evolution or InnovationLifelog Health
 
How Computer Vision is Reshaping Real Estate Search - Andrew Flachner
How Computer Vision is Reshaping Real Estate Search - Andrew FlachnerHow Computer Vision is Reshaping Real Estate Search - Andrew Flachner
How Computer Vision is Reshaping Real Estate Search - Andrew FlachnerInman News
 
How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist? HackerEarth
 
Matrix Factorisation (and Dimensionality Reduction)
Matrix Factorisation (and Dimensionality Reduction)Matrix Factorisation (and Dimensionality Reduction)
Matrix Factorisation (and Dimensionality Reduction)HJ van Veen
 
Augment my reality info press-montreal2010
Augment my reality info press-montreal2010Augment my reality info press-montreal2010
Augment my reality info press-montreal2010MobileMoxie
 
"How Deep Learning Is Enabling Computer Vision Markets," a Presentation from ...
"How Deep Learning Is Enabling Computer Vision Markets," a Presentation from ..."How Deep Learning Is Enabling Computer Vision Markets," a Presentation from ...
"How Deep Learning Is Enabling Computer Vision Markets," a Presentation from ...Edge AI and Vision Alliance
 

En vedette (20)

How to Read Academic Papers
How to Read Academic PapersHow to Read Academic Papers
How to Read Academic Papers
 
Top 10 ways to get your paper rejected at #CAGJournal
Top 10 ways to get your paper rejected at #CAGJournalTop 10 ways to get your paper rejected at #CAGJournal
Top 10 ways to get your paper rejected at #CAGJournal
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeX
 
Raskar 2012, Idea Hexagon
Raskar 2012, Idea HexagonRaskar 2012, Idea Hexagon
Raskar 2012, Idea Hexagon
 
About DevFactory
About DevFactory About DevFactory
About DevFactory
 
Richard Matthew Stallman - A Brief Biography
Richard Matthew Stallman - A Brief BiographyRichard Matthew Stallman - A Brief Biography
Richard Matthew Stallman - A Brief Biography
 
UnityでAR
UnityでARUnityでAR
UnityでAR
 
Intel RealSense technology : Overview and demos
Intel RealSense technology : Overview and demosIntel RealSense technology : Overview and demos
Intel RealSense technology : Overview and demos
 
Towards Embedded Computer Vision - New @ 2013
Towards Embedded Computer Vision - New @ 2013Towards Embedded Computer Vision - New @ 2013
Towards Embedded Computer Vision - New @ 2013
 
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro..."Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
"Computer Vision 2.0: Where We Are and Where We're Going," a Presentation fro...
 
미래교육을 위한 인터페이스 NUI & education
미래교육을 위한 인터페이스 NUI & education미래교육을 위한 인터페이스 NUI & education
미래교육을 위한 인터페이스 NUI & education
 
Computer vision, machine, and deep learning
Computer vision, machine, and deep learningComputer vision, machine, and deep learning
Computer vision, machine, and deep learning
 
Augmented reality as next UI Challenge
Augmented reality as next UI ChallengeAugmented reality as next UI Challenge
Augmented reality as next UI Challenge
 
Evolution or Innovation
Evolution or InnovationEvolution or Innovation
Evolution or Innovation
 
How Computer Vision is Reshaping Real Estate Search - Andrew Flachner
How Computer Vision is Reshaping Real Estate Search - Andrew FlachnerHow Computer Vision is Reshaping Real Estate Search - Andrew Flachner
How Computer Vision is Reshaping Real Estate Search - Andrew Flachner
 
How to become a Data Scientist?
How to become a Data Scientist? How to become a Data Scientist?
How to become a Data Scientist?
 
Matrix Factorisation (and Dimensionality Reduction)
Matrix Factorisation (and Dimensionality Reduction)Matrix Factorisation (and Dimensionality Reduction)
Matrix Factorisation (and Dimensionality Reduction)
 
Augment my reality info press-montreal2010
Augment my reality info press-montreal2010Augment my reality info press-montreal2010
Augment my reality info press-montreal2010
 
"How Deep Learning Is Enabling Computer Vision Markets," a Presentation from ...
"How Deep Learning Is Enabling Computer Vision Markets," a Presentation from ..."How Deep Learning Is Enabling Computer Vision Markets," a Presentation from ...
"How Deep Learning Is Enabling Computer Vision Markets," a Presentation from ...
 

Similaire à Recent Advances in Computer Vision

Artifact Evaluation Experience CGO'15 / PPoPP'15
Artifact Evaluation Experience CGO'15 / PPoPP'15Artifact Evaluation Experience CGO'15 / PPoPP'15
Artifact Evaluation Experience CGO'15 / PPoPP'15Grigori Fursin
 
e3-chap-09.ppt
e3-chap-09.ppte3-chap-09.ppt
e3-chap-09.pptKingSh2
 
Peeking behind the test: insights and innovations from the Medical Council of...
Peeking behind the test: insights and innovations from the Medical Council of...Peeking behind the test: insights and innovations from the Medical Council of...
Peeking behind the test: insights and innovations from the Medical Council of...MedCouncilCan
 
Overview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryOverview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryTanvir Moin
 
Operationalization of Lean thinking through Value Stream Mapping with Simulat...
Operationalization of Lean thinking through Value Stream Mapping with Simulat...Operationalization of Lean thinking through Value Stream Mapping with Simulat...
Operationalization of Lean thinking through Value Stream Mapping with Simulat...Letizia Jaccheri
 
Evaluation and User Study in HCI
Evaluation and User Study in HCIEvaluation and User Study in HCI
Evaluation and User Study in HCIByungkyu (Jay) Kang
 
lect1.pdf
lect1.pdflect1.pdf
lect1.pdfAtkaAli
 
Klaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessmentKlaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessmentEADTU
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesAlan Said
 
Human computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspectiveHuman computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspectiveoralonso
 
Usability Lab within Agile (by Ian Franklin at NUX Leeds January 2018)
Usability Lab within Agile (by Ian Franklin at NUX Leeds January 2018)Usability Lab within Agile (by Ian Franklin at NUX Leeds January 2018)
Usability Lab within Agile (by Ian Franklin at NUX Leeds January 2018)Northern User Experience
 

Similaire à Recent Advances in Computer Vision (20)

Dip
DipDip
Dip
 
Artifact Evaluation Experience CGO'15 / PPoPP'15
Artifact Evaluation Experience CGO'15 / PPoPP'15Artifact Evaluation Experience CGO'15 / PPoPP'15
Artifact Evaluation Experience CGO'15 / PPoPP'15
 
e3-chap-09.ppt
e3-chap-09.ppte3-chap-09.ppt
e3-chap-09.ppt
 
Evaluation techniques
Evaluation techniquesEvaluation techniques
Evaluation techniques
 
Chapter 8 Evaluation Techniques
Chapter 8 Evaluation  TechniquesChapter 8 Evaluation  Techniques
Chapter 8 Evaluation Techniques
 
Challenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL modelChallenges in medical imaging and the VISCERAL model
Challenges in medical imaging and the VISCERAL model
 
E3 chap-09
E3 chap-09E3 chap-09
E3 chap-09
 
Peeking behind the test: insights and innovations from the Medical Council of...
Peeking behind the test: insights and innovations from the Medical Council of...Peeking behind the test: insights and innovations from the Medical Council of...
Peeking behind the test: insights and innovations from the Medical Council of...
 
Overview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear IndustryOverview of Computer Vision For Footwear Industry
Overview of Computer Vision For Footwear Industry
 
Systematic Literature Review & Mapping
Systematic Literature Review & MappingSystematic Literature Review & Mapping
Systematic Literature Review & Mapping
 
Operationalization of Lean thinking through Value Stream Mapping with Simulat...
Operationalization of Lean thinking through Value Stream Mapping with Simulat...Operationalization of Lean thinking through Value Stream Mapping with Simulat...
Operationalization of Lean thinking through Value Stream Mapping with Simulat...
 
Evaluation and User Study in HCI
Evaluation and User Study in HCIEvaluation and User Study in HCI
Evaluation and User Study in HCI
 
Human Computer Interaction Evaluation
Human Computer Interaction EvaluationHuman Computer Interaction Evaluation
Human Computer Interaction Evaluation
 
lect1.pdf
lect1.pdflect1.pdf
lect1.pdf
 
Klaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessmentKlaus Dieter Rossade online assessment
Klaus Dieter Rossade online assessment
 
Best Practices in Recommender System Challenges
Best Practices in Recommender System ChallengesBest Practices in Recommender System Challenges
Best Practices in Recommender System Challenges
 
Human computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspectiveHuman computation, crowdsourcing and social: An industrial perspective
Human computation, crowdsourcing and social: An industrial perspective
 
Usability Lab within Agile (by Ian Franklin at NUX Leeds January 2018)
Usability Lab within Agile (by Ian Franklin at NUX Leeds January 2018)Usability Lab within Agile (by Ian Franklin at NUX Leeds January 2018)
Usability Lab within Agile (by Ian Franklin at NUX Leeds January 2018)
 
HCI-Lecture-1
HCI-Lecture-1HCI-Lecture-1
HCI-Lecture-1
 
E3 chap-09
E3 chap-09E3 chap-09
E3 chap-09
 

Plus de antiw

Cvpr2010 open source vision software, intro and training part viii point clou...
Cvpr2010 open source vision software, intro and training part viii point clou...Cvpr2010 open source vision software, intro and training part viii point clou...
Cvpr2010 open source vision software, intro and training part viii point clou...antiw
 
Cvpr2010 open source vision software, intro and training part vii point cloud...
Cvpr2010 open source vision software, intro and training part vii point cloud...Cvpr2010 open source vision software, intro and training part vii point cloud...
Cvpr2010 open source vision software, intro and training part vii point cloud...antiw
 
graphical models for the Internet
graphical models for the Internetgraphical models for the Internet
graphical models for the Internetantiw
 
15 pieces of advice i wish my ph d advisor had given me
15 pieces of advice i wish my ph d advisor had given me15 pieces of advice i wish my ph d advisor had given me
15 pieces of advice i wish my ph d advisor had given meantiw
 
Randy pauschtimemanagement2007
Randy pauschtimemanagement2007Randy pauschtimemanagement2007
Randy pauschtimemanagement2007antiw
 
Write a research paper howto - good presentation
Write a research paper   howto - good presentationWrite a research paper   howto - good presentation
Write a research paper howto - good presentationantiw
 
15 pieces of advice i wish my ph d advisor had given me
15 pieces of advice i wish my ph d advisor had given me15 pieces of advice i wish my ph d advisor had given me
15 pieces of advice i wish my ph d advisor had given meantiw
 
Note beamer
Note beamerNote beamer
Note beamerantiw
 
Open Cv 2005 Q4 Tutorial
Open Cv 2005 Q4 TutorialOpen Cv 2005 Q4 Tutorial
Open Cv 2005 Q4 Tutorialantiw
 

Plus de antiw (9)

Cvpr2010 open source vision software, intro and training part viii point clou...
Cvpr2010 open source vision software, intro and training part viii point clou...Cvpr2010 open source vision software, intro and training part viii point clou...
Cvpr2010 open source vision software, intro and training part viii point clou...
 
Cvpr2010 open source vision software, intro and training part vii point cloud...
Cvpr2010 open source vision software, intro and training part vii point cloud...Cvpr2010 open source vision software, intro and training part vii point cloud...
Cvpr2010 open source vision software, intro and training part vii point cloud...
 
graphical models for the Internet
graphical models for the Internetgraphical models for the Internet
graphical models for the Internet
 
15 pieces of advice i wish my ph d advisor had given me
15 pieces of advice i wish my ph d advisor had given me15 pieces of advice i wish my ph d advisor had given me
15 pieces of advice i wish my ph d advisor had given me
 
Randy pauschtimemanagement2007
Randy pauschtimemanagement2007Randy pauschtimemanagement2007
Randy pauschtimemanagement2007
 
Write a research paper howto - good presentation
Write a research paper   howto - good presentationWrite a research paper   howto - good presentation
Write a research paper howto - good presentation
 
15 pieces of advice i wish my ph d advisor had given me
15 pieces of advice i wish my ph d advisor had given me15 pieces of advice i wish my ph d advisor had given me
15 pieces of advice i wish my ph d advisor had given me
 
Note beamer
Note beamerNote beamer
Note beamer
 
Open Cv 2005 Q4 Tutorial
Open Cv 2005 Q4 TutorialOpen Cv 2005 Q4 Tutorial
Open Cv 2005 Q4 Tutorial
 

Dernier

Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research DiscourseAnita GoswamiGiri
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...DhatriParmar
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationdeepaannamalai16
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Projectjordimapav
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxkarenfajardo43
 
Multi Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleMulti Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleCeline George
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxDhatriParmar
 

Dernier (20)

Scientific Writing :Research Discourse
Scientific  Writing :Research  DiscourseScientific  Writing :Research  Discourse
Scientific Writing :Research Discourse
 
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
Blowin' in the Wind of Caste_ Bob Dylan's Song as a Catalyst for Social Justi...
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
Congestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentationCongestive Cardiac Failure..presentation
Congestive Cardiac Failure..presentation
 
ClimART Action | eTwinning Project
ClimART Action    |    eTwinning ProjectClimART Action    |    eTwinning Project
ClimART Action | eTwinning Project
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptxINCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptxGrade Three -ELLNA-REVIEWER-ENGLISH.pptx
Grade Three -ELLNA-REVIEWER-ENGLISH.pptx
 
Multi Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP ModuleMulti Domain Alias In the Odoo 17 ERP Module
Multi Domain Alias In the Odoo 17 ERP Module
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
 

Recent Advances in Computer Vision

  • 1. Recent Advances in Computer Vision Ming-Hsuan Yang
  • 2. Computer vision • Holly grail – tell a story from an image
  • 3. History • “In the 1960s, almost no one realized that machine vision was difficult.” – David Marr, 1982 • Marvin Minsky asked Gerald Jay Sussman to “spend the summer linking a camera to a computer and getting the computer to describe what it saw” – Crevier, 1993 • 40+ years later, we are still working on this
  • 6. 1990s • Face detection • Particle filter • Pfinder • Normalized cut
  • 7. 2000s • SIFT – Mosaicing, panorama – Object recognition – Photo tourism, photosynth – Human detection • Adaboost-based face detector
  • 9. Conferences • CVPR – Computer Vision and Pattern Recognition, since 1983 – Annual, held in US • ICCV – International Conference on Computer Vision, since 1987 – Every other year, alternate in 3 continents • ECCV – European Conference on Computer Vision, since 1990 – Every other year, held in Europe
  • 10. Conferences (cont’d) • ACCV – Asian Conference on Computer Vision • BMVC – British Machine Vision Conference • ICPR – International Conference on Pattern Recognition • SIGGRAPH • NIPS – Neural Information Processing Systems
  • 11. Conferences (cont’d) • MICCAI – Medical Image Computing and Computer-Assisted Intervention • ISBI – International Symposium on Biomedical Imaging • FG – IEEE Conference on Automatic Face and Gesture Recognition • ICCP, ICDR, ICVS, DAGM, CAIP, MVA, AAAI, IJCAI, ICML, ICRA, ICASSP, ICIP, SPIE, DCC, WACV, 3DPVT, ACM Multimedia, ICME, …
  • 12. Conference organization • General chairs: administration • Program chairs: handling papers • Area chairs: – Assign reviewers – Read reviews and rebuttals – Consolidation reports – Recommendation • Reviewers • Authors
  • 13. Review process • Submission • CVPR/ECCV/ICCV – Double blind review – Program chairs: assign papers to area chairs – Area chairs: assign papers to reviewers • Rebuttal
  • 14. Area chair meetings • 2 day meetings • Several panels • Each paper is reviewed by at least 2 area chairs • Buddy system • Area chair make recommendations • Program chairs make final decisions
  • 15. Conference acceptance rates • ICCV/CVPR/ECCV: ~ 30% • ACCV (2009): ~ 30% • NIPS: ~ 30% • BMVC: ~ 40% • ICIP: ~ 45% • ICPR: ~ 55% • Disclaimer – low acceptance rate = high quality?
  • 16. CVPR Submission Oral Overall
  • 17. ICCV Submission Oral Overall
  • 18. ECCV Submission Oral Overall
  • 19. Journals • PAMI – IEEE Transactions on Pattern Analysis and Machine Intelligence, since 1979 (impact factor: 5.96, #1 in all engineering and AI, top-ranked IEEE and CS journal) • IJCV – International Journal on Computer Vision, since 1988 (impact factor: 5.36, #2 in all engineering and AI) • CVIU – Computer Vision and Image Understanding, since 1972 (impact factor: 2.20)
  • 20. Journals (cont’d) • IVC – Image and Vision Computing • IEEE Transactions on Medical Imaging • TIP – IEEE Transactions on Image Processing • MVA – Machine Vision and Applications • PR – Pattern Recognition • TM – IEEE Transactions on Multimedia • …
  • 21. PAMI review process • Editor-in-chief (EIC) assigns papers to associate editors (AE) • AE assigns reviewers • First-round review: 3-6 months – Accept as is – Accept with minor revision – Major revision – Resubmit as new – Reject
  • 22. PAMI review process (cont’d) • Second-round review: 2-4 months – Accept as is – Accept with minor revision – Reject • EIC makes final decision • Overall turn-around time: 6 to 12 months • Rule of thumb: 30% additional work beyond a CVPR/ICCV/ECCV paper
  • 23. IJCV/CVIU review process • Similar formats • CVIU has roughly the same turn-around time as PAMI • IJCV tends to have longer turn-around time
  • 24. Journal acceptance rate • PAMI, IJCV: ~ 30% • CVIU: ~ 30%
  • 25. Tools • Google scholar, citeseer, • h-index • Software: publish or perish • Disclaimer: – h index = significance? – # of citation = significance
  • 26. How to get your papers rejected? • Refer to Jim Kajia (SIGGRAPH 93 papers chair): How to get your SIGGRAPH paper rejected? • Do not – Pay attention to review process – Put yourself as a reviewer perspective – Put the work in right context – Carry out sufficient amount of experiments – Compare with state-of-the-art algorithms – Pay attention to writing
  • 27. Pay attention to review process • Learn how others/you can pick apart a paper • Learn from other’s mistakes • Learn how to write good papers • Learn what it takes to get a paper published
  • 28. Put yourself as reviewer • What are the contributions? • Does it advance the science in the filed? • Why you should accept this paper? • Is this paper a case study? • Is this paper interesting? • What is the audience? • Does anyone care about this work?
  • 29. Experimental validation • Common data set • Killer data set • Large scale experiment • Evaluation metric
  • 30. Compare with state of the art • Do your homework • Need to know what is out there • Need to show why one’s method outperforms others, and in what way? – speed? – accuracy? – easy to implement? – general application?
  • 31. Writing • Clear presentation • Terse • Careful about wording • Make claims with strong evidence
  • 32. Review form • Summary • Overall Rating – Definite accept, weakly accept, borderline, weakly reject, definite reject • Novelty – Very original, original, minor originality, has been done before • Importance/relevance – Of broad interest, interesting to a subarea, interesting only to a small number of attendees, out of CVPR scope
  • 33. Review form (cont’d) • Clarity of presentation – Reads very well, is clear enough, difficult to read, unreadable • Technical correctness – Definite correct, probably correct but did not check completely, contains rectifiable errors, has major problems • Experimental validation – Excellent validation or N/A (a theoretical paper), limited but convincing, lacking in some aspects, insufficient validation • Additional comments • Reviewer’s name
  • 34. Challenging issues • Large scale • Unconstrained • Real-time • Robustness • Recover from failure – graceful dead
  • 35. Some hot topics • Object recognition, categorization • Internet scale image search • Video search • Human detection • 3D human pose estimation • Computational photography • Scene understanding
  • 36. Some hot tools • Prior • Context • Sparse representation • Multiple instance learning • Online learning • Convex optimization • Constraint • Hashing
  • 37. Prior Torralba and Sinha ICCV 01
  • 38. Prior Heitz and Koller ECCV 08
  • 39. Prior Jia CVPR 08 He et al. CVPR 09
  • 40. Scene understanding Leibe et al. CVPR 07
  • 41. Image search Wu et al. CVPR 09
  • 42. Computational photography Johnson and Adelson et al. CVPR 09