SlideShare a Scribd company logo
1 of 43
Toward Accurate and Robust Cross-
Ratio based Gaze Trackers Through
Learning from Simulation
Jia-Bin Huang1, Qin Cai2, Zicheng Liu2,
Narendra Ahuja1, and Zhengyou Zhang2
21
Why?
• Multimodal natural interaction
• Gaze + touch, gesture, speech
If I were an iron man…
Why?
• Understanding user attention and intention
Why?
• Understanding interaction among people
Before sunrise
1995
Sclera
Limbus
Pupil
Iris
Glint
Cornea (like a spherical mirror)
Mike @ Monster University
Geometric Model of an Eye
Gaze Estimation using
Pupil Center and Corneal Reflections
Interpolation-
based
Cross-Ratio
based
Model-based
Model-based Gaze Estimation
• Detailed geometric modeling between light sources, corneal, and
camera [Guestrin and Eizenman, 2006]
• Pros
• Accurate (reported performance < 1o)
• 3D gaze direction
• Head pose invariant
• Cons
• Need careful hardware calibration
Figure from [Guestrin and Eizenman, 2006]
Interpolation-based Gaze Estimation
• Learn polynomial regression from subject-dependent calibration
• Directly map from normalized to Point of Regard (2D PoR)
[Cerrolaza et al., 2008]
• Pros
• Simple to implement
• No need for hardware calibration
• Cons
• Head pose sensitive
Cross-Ratio based Gaze Estimation
• Gaze estimation by exploiting invariance of a plane
projectivity [Yoo et al. 2002]
• Pros
• Simple to implement
• No need for hardware calibration
• Head pose invariant
• Cons
• Large subject dependent bias occur
because simplifying assumptions Figure from [Coutinho and Morimoto 2012]
The Basic Form of Cross-Ratio Method
Image
Corneal
Display
Two Sources of Errors [Kang et al. 2008]
• Angular deviation of visual axis and optical axis
• Virtual image of pupil center is not coplanar with corneal
reflections
Improve Accuracy for Stationary Head
CR [Yoo-2002]
CR-Multi [Yoo-2005]
CR-HOM [Kang-2007]
CR-HOMN [Hansen-2010]
CR-DV [Coutinho-2006]
No correction
Scale correction
Scale and translation correction
Homography correction
Homography correction
+ Residual interpolation
Improve Robustness for Head
Movements
No adaptation Adapt to eye
depth variations
Adapt to eye movements
Assumptions
1) weak perspective
2) fixed eye parameters.
CR [Yoo-2002] CR-DD [Coutinho and
Morimoto 2010]
PL-CR [Coutinho and
Morimoto 2012]
Accuracy of Gaze Prediction for
Stationary Head
Robustness to Head
Movement
No adaptation
CR [Yoo-2002]
CR-Multi [Yoo-2005]
CR-DV [Coutinho-2006]
CR-HOM [Kang-2007]
CR-HOMN [Hansen-2010]
No correction
Scale correction
Scale and translation
correction
Homography correction
Homography correction +
Residual interpolation
CR-DD [Coutinho-2010]
Adapt to eye depth
variations only
PL-CR [Coutinho-2012]
Adapt to eye movements
Assumptions
1) weak perspective
2) fixed eye parameters.
Adapt to eye movements
No assumptions on
1) weak perspective
2) fixed eye parameters
This paper
How? The Main Idea
• Build upon the homography normalization method [Hansen et al
2010]
• Improving accuracy and robustness simultaneously by introducing the
Adaptive Homography Mapping
Adaptive Homograph Mapping
• Two types of predictor variables
• : capture the head movements relative to the calibration position
• Affine transformation between the glints quadrilateral
• : capture gaze direction for spatially-varying mapping
• Pupil center position in the normalized space
• : polynomial regression of degree two with parameter
Training Adaptive Homography Mapping
• Exploit large amount of simulated data
• the set of sampled head position in 3D
• the set of calibration target index in the screen space
• Objective function
Minimizing the Objective Function
• Minimize an algebraic error at each sampled head position
• Use the solution from algebraic error minimization as initialization
Minimize the re-projection errors using the Levenberg-Marquardt
algorithm
Visualize the Training Process
• Eye gaze prediction results using the bias-correcting homography
computed at the calibration position
RMSE Error Comparisons Using
Different Training Models
• Differences are small in
linear regression
• Linear model is not
sufficiently complex
• Compensation using both
predictor variables achieve
the lowest errors
Linear Regression
Linear Regression
Adding the normalized pupil center
corrected spatially-varying errors
Quadratic Regression
Quadratic Regression
Experimental Results – Synthetic data
• Setup
• Screen size 400mm x 300mm
• Four IR lights
• Camera 13mm focal length, placed slighted below the screen border
(FoV~31 degree)
• Calibration position and eye parameters
• Eye parameters from [Guestrin and Eizenman, 2006]
Stationary Head
Varying corneal radius
Stationary Head
Varying pupil-corneal distance
Stationary Head
Varying (horizontal) angle between optical/visual axis
Stationary Head
Varying (vertical) angle between optical/visual axis
Head Movements Parallel to the Screen
Head Movement along Depth Variation
Tested at Another Head Position
Noise Sensitivity Analysis
Effect of Sensor Resolution (at
calibration)
Focal Length = 13 mm Focal Length = 35 mm
Effect of Sensor Resolution (at new
position)
Focal Length = 13 mm Focal Length = 35 mm
Real Data Evaluation –
Programmable Hardware Setup
Off-axis IR light sources
Stereo camera
(We use one only in this work)
On-axis ring light
Real Data Evaluation – Feature Detection
• Detecting glints and pupil center
Averaged Gaze Estimation Error
at calibration position
Averaged Gaze Estimation Error
Calibrated at 600mm from screenCalibrated at 500mm from screen
Conclusions
• A learning-based approach for simultaneously compensating (1)
spatially varying errors and (2) errors induced from head movements
• Generalize previous work on compensating head movements using
glint geometric transformation [Cerroaza et al. 2012] [Coutinho and
Morimoto 2012]
• Leveraging simulated data avoid the tedious data collection
Future Work
• Consider subject-dependent parameters in the learning and inference
the adaptive homography adaptation
• Integrate binocular information, please see poster
Zhengyou Zhang, Qin Cai, Improving Cross-Ratio-Based Eye Tracking
Techniques by Leveraging the Binocular Fixation Constraint
• Extensive user study using a physical setup
Comments or questions?
Jia-Bin Huang
jbhuang1@Illinois.edu
Narendra Ahuja
n-ahuja@Illinois.edu
Zhengyou Zhang
zhang@microsoft.com
Qin Cai
qincai@microsoft.com
Zicheng Liu
zliu@microsoft.com

More Related Content

Viewers also liked

Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Jia-Bin Huang
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Linear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialLinear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialJia-Bin Huang
 
Writing Fast MATLAB Code
Writing Fast MATLAB CodeWriting Fast MATLAB Code
Writing Fast MATLAB CodeJia-Bin Huang
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015Jia-Bin Huang
 
Three Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucThree Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucJia-Bin Huang
 
Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Jia-Bin Huang
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)Jia-Bin Huang
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation Jia-Bin Huang
 
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)Jia-Bin Huang
 
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Jia-Bin Huang
 
What Makes a Creative Photograph?
What Makes a Creative Photograph?What Makes a Creative Photograph?
What Makes a Creative Photograph?Jia-Bin Huang
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash CourseJia-Bin Huang
 
How to come up with new research ideas
How to come up with new research ideasHow to come up with new research ideas
How to come up with new research ideasJia-Bin Huang
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXJia-Bin Huang
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Jia-Bin Huang
 
The future is a cloud hosted retail store
The future is a cloud hosted retail storeThe future is a cloud hosted retail store
The future is a cloud hosted retail store24Symbols
 
Spanish american war
Spanish american warSpanish american war
Spanish american warJaZzY25
 
Caber potensi besar
Caber potensi besarCaber potensi besar
Caber potensi besarbogasi
 

Viewers also liked (20)

Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
Image Completion using Planar Structure Guidance (SIGGRAPH 2014)
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Linear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorialLinear Algebra and Matlab tutorial
Linear Algebra and Matlab tutorial
 
Writing Fast MATLAB Code
Writing Fast MATLAB CodeWriting Fast MATLAB Code
Writing Fast MATLAB Code
 
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015Lecture 29 Convolutional Neural Networks -  Computer Vision Spring2015
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
 
Three Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiucThree Reasons to Join FVE at uiuc
Three Reasons to Join FVE at uiuc
 
Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.Applying for Graduate School in S.T.E.M.
Applying for Graduate School in S.T.E.M.
 
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
A Physical Approach to Moving Cast Shadow Detection (ICASSP 2009)
 
Jia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum VitaeJia-Bin Huang's Curriculum Vitae
Jia-Bin Huang's Curriculum Vitae
 
UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation UIUC CS 498 - Computational Photography - Final project presentation
UIUC CS 498 - Computational Photography - Final project presentation
 
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
美國研究所申請流程 (A Guide for Applying Graduate Schools in USA)
 
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
Single Image Super-Resolution from Transformed Self-Exemplars (CVPR 2015)
 
What Makes a Creative Photograph?
What Makes a Creative Photograph?What Makes a Creative Photograph?
What Makes a Creative Photograph?
 
Computer Vision Crash Course
Computer Vision Crash CourseComputer Vision Crash Course
Computer Vision Crash Course
 
How to come up with new research ideas
How to come up with new research ideasHow to come up with new research ideas
How to come up with new research ideas
 
Research 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeXResearch 101 - Paper Writing with LaTeX
Research 101 - Paper Writing with LaTeX
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 
The future is a cloud hosted retail store
The future is a cloud hosted retail storeThe future is a cloud hosted retail store
The future is a cloud hosted retail store
 
Spanish american war
Spanish american warSpanish american war
Spanish american war
 
Caber potensi besar
Caber potensi besarCaber potensi besar
Caber potensi besar
 

Similar to Accurate and Robust Cross-Ratio Gaze Tracking Through Learning from Simulation

Coneal topography instrumentation, techniques, procedures, limitations, advan...
Coneal topography instrumentation, techniques, procedures, limitations, advan...Coneal topography instrumentation, techniques, procedures, limitations, advan...
Coneal topography instrumentation, techniques, procedures, limitations, advan...Raju Kaiti
 
Coutinho A Depth Compensation Method For Cross Ratio Based Eye Tracking
Coutinho A Depth Compensation Method For Cross Ratio Based Eye TrackingCoutinho A Depth Compensation Method For Cross Ratio Based Eye Tracking
Coutinho A Depth Compensation Method For Cross Ratio Based Eye TrackingKalle
 
Magnification and it's clinical uses
Magnification and it's clinical usesMagnification and it's clinical uses
Magnification and it's clinical usesRaju Kaiti
 
Progressive Addition Lens
Progressive Addition LensProgressive Addition Lens
Progressive Addition LensSarmila Acharya
 
Getting More Precision in Videoscope Measurements While Taking Larger Measure...
Getting More Precision in Videoscope Measurements While Taking Larger Measure...Getting More Precision in Videoscope Measurements While Taking Larger Measure...
Getting More Precision in Videoscope Measurements While Taking Larger Measure...Olympus IMS
 
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Jens Grubert
 
Peripheral refraction in myopia
Peripheral refraction in myopiaPeripheral refraction in myopia
Peripheral refraction in myopiaHossein Mirzaie
 
Lec11 single view-converted
Lec11 single view-convertedLec11 single view-converted
Lec11 single view-convertedBaliThorat1
 
Stereoscopic field of view guidelines
Stereoscopic field of view guidelinesStereoscopic field of view guidelines
Stereoscopic field of view guidelinesBoris Greenberg
 
Prism & it's uses
Prism & it's uses Prism & it's uses
Prism & it's uses Raju Kaiti
 

Similar to Accurate and Robust Cross-Ratio Gaze Tracking Through Learning from Simulation (20)

Nidek CEM-530
Nidek CEM-530Nidek CEM-530
Nidek CEM-530
 
Coneal topography instrumentation, techniques, procedures, limitations, advan...
Coneal topography instrumentation, techniques, procedures, limitations, advan...Coneal topography instrumentation, techniques, procedures, limitations, advan...
Coneal topography instrumentation, techniques, procedures, limitations, advan...
 
Schematic eye and cardinal points
Schematic eye and cardinal pointsSchematic eye and cardinal points
Schematic eye and cardinal points
 
Keratometer
KeratometerKeratometer
Keratometer
 
Coutinho A Depth Compensation Method For Cross Ratio Based Eye Tracking
Coutinho A Depth Compensation Method For Cross Ratio Based Eye TrackingCoutinho A Depth Compensation Method For Cross Ratio Based Eye Tracking
Coutinho A Depth Compensation Method For Cross Ratio Based Eye Tracking
 
Magnification and it's clinical uses
Magnification and it's clinical usesMagnification and it's clinical uses
Magnification and it's clinical uses
 
Stereoscopy in Dentistry
Stereoscopy in DentistryStereoscopy in Dentistry
Stereoscopy in Dentistry
 
Corneal topography
Corneal topographyCorneal topography
Corneal topography
 
Evaluation of 3D-Projection Image Capture
Evaluation of 3D-Projection Image CaptureEvaluation of 3D-Projection Image Capture
Evaluation of 3D-Projection Image Capture
 
Aberrometry
AberrometryAberrometry
Aberrometry
 
Progressive Addition Lens
Progressive Addition LensProgressive Addition Lens
Progressive Addition Lens
 
Getting More Precision in Videoscope Measurements While Taking Larger Measure...
Getting More Precision in Videoscope Measurements While Taking Larger Measure...Getting More Precision in Videoscope Measurements While Taking Larger Measure...
Getting More Precision in Videoscope Measurements While Taking Larger Measure...
 
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
Google Glass, The META and Co. - How to calibrate your Optical See-Through He...
 
Peripheral refraction in myopia
Peripheral refraction in myopiaPeripheral refraction in myopia
Peripheral refraction in myopia
 
Lec11 single view-converted
Lec11 single view-convertedLec11 single view-converted
Lec11 single view-converted
 
Ph.D. Research
Ph.D. ResearchPh.D. Research
Ph.D. Research
 
Stereoscopic field of view guidelines
Stereoscopic field of view guidelinesStereoscopic field of view guidelines
Stereoscopic field of view guidelines
 
Presentación Tesis 08022016
Presentación Tesis 08022016Presentación Tesis 08022016
Presentación Tesis 08022016
 
PPT s02-machine vision-s2
PPT s02-machine vision-s2PPT s02-machine vision-s2
PPT s02-machine vision-s2
 
Prism & it's uses
Prism & it's uses Prism & it's uses
Prism & it's uses
 

More from Jia-Bin Huang

How to write a clear paper
How to write a clear paperHow to write a clear paper
How to write a clear paperJia-Bin Huang
 
Real-time Face Detection and Recognition
Real-time Face Detection and RecognitionReal-time Face Detection and Recognition
Real-time Face Detection and RecognitionJia-Bin Huang
 
Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual trackingJia-Bin Huang
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression EnhancementJia-Bin Huang
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionJia-Bin Huang
 
Static and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionStatic and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionJia-Bin Huang
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionJia-Bin Huang
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Jia-Bin Huang
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Jia-Bin Huang
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Jia-Bin Huang
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Jia-Bin Huang
 

More from Jia-Bin Huang (11)

How to write a clear paper
How to write a clear paperHow to write a clear paper
How to write a clear paper
 
Real-time Face Detection and Recognition
Real-time Face Detection and RecognitionReal-time Face Detection and Recognition
Real-time Face Detection and Recognition
 
Pose aware online visual tracking
Pose aware online visual trackingPose aware online visual tracking
Pose aware online visual tracking
 
Face Expression Enhancement
Face Expression EnhancementFace Expression Enhancement
Face Expression Enhancement
 
Image Smoothing for Structure Extraction
Image Smoothing for Structure ExtractionImage Smoothing for Structure Extraction
Image Smoothing for Structure Extraction
 
Static and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture RecognitionStatic and Dynamic Hand Gesture Recognition
Static and Dynamic Hand Gesture Recognition
 
Real-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes RecognitionReal-Time Face Detection, Tracking, and Attributes Recognition
Real-Time Face Detection, Tracking, and Attributes Recognition
 
Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)Estimating Human Pose from Occluded Images (ACCV 2009)
Estimating Human Pose from Occluded Images (ACCV 2009)
 
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...Information Preserving Color Transformation for Protanopia and Deuteranopia (...
Information Preserving Color Transformation for Protanopia and Deuteranopia (...
 
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
Enhancing Color Representation for the Color Vision Impaired (CVAVI 2008)
 
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)Learning Moving Cast Shadows for Foreground Detection (VS 2008)
Learning Moving Cast Shadows for Foreground Detection (VS 2008)
 

Recently uploaded

Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncssuser2ae721
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxbritheesh05
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniquesugginaramesh
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
An introduction to Semiconductor and its types.pptx
An introduction to Semiconductor and its types.pptxAn introduction to Semiconductor and its types.pptx
An introduction to Semiconductor and its types.pptxPurva Nikam
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitter8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitterShivangiSharma879191
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 

Recently uploaded (20)

Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
 
Artificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptxArtificial-Intelligence-in-Electronics (K).pptx
Artificial-Intelligence-in-Electronics (K).pptx
 
Comparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization TechniquesComparative Analysis of Text Summarization Techniques
Comparative Analysis of Text Summarization Techniques
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
An introduction to Semiconductor and its types.pptx
An introduction to Semiconductor and its types.pptxAn introduction to Semiconductor and its types.pptx
An introduction to Semiconductor and its types.pptx
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdf
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitter8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitter
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 

Accurate and Robust Cross-Ratio Gaze Tracking Through Learning from Simulation

  • 1. Toward Accurate and Robust Cross- Ratio based Gaze Trackers Through Learning from Simulation Jia-Bin Huang1, Qin Cai2, Zicheng Liu2, Narendra Ahuja1, and Zhengyou Zhang2 21
  • 2. Why? • Multimodal natural interaction • Gaze + touch, gesture, speech If I were an iron man…
  • 3. Why? • Understanding user attention and intention
  • 4. Why? • Understanding interaction among people Before sunrise 1995
  • 5. Sclera Limbus Pupil Iris Glint Cornea (like a spherical mirror) Mike @ Monster University
  • 7. Gaze Estimation using Pupil Center and Corneal Reflections Interpolation- based Cross-Ratio based Model-based
  • 8. Model-based Gaze Estimation • Detailed geometric modeling between light sources, corneal, and camera [Guestrin and Eizenman, 2006] • Pros • Accurate (reported performance < 1o) • 3D gaze direction • Head pose invariant • Cons • Need careful hardware calibration Figure from [Guestrin and Eizenman, 2006]
  • 9. Interpolation-based Gaze Estimation • Learn polynomial regression from subject-dependent calibration • Directly map from normalized to Point of Regard (2D PoR) [Cerrolaza et al., 2008] • Pros • Simple to implement • No need for hardware calibration • Cons • Head pose sensitive
  • 10. Cross-Ratio based Gaze Estimation • Gaze estimation by exploiting invariance of a plane projectivity [Yoo et al. 2002] • Pros • Simple to implement • No need for hardware calibration • Head pose invariant • Cons • Large subject dependent bias occur because simplifying assumptions Figure from [Coutinho and Morimoto 2012]
  • 11. The Basic Form of Cross-Ratio Method Image Corneal Display
  • 12. Two Sources of Errors [Kang et al. 2008] • Angular deviation of visual axis and optical axis • Virtual image of pupil center is not coplanar with corneal reflections
  • 13. Improve Accuracy for Stationary Head CR [Yoo-2002] CR-Multi [Yoo-2005] CR-HOM [Kang-2007] CR-HOMN [Hansen-2010] CR-DV [Coutinho-2006] No correction Scale correction Scale and translation correction Homography correction Homography correction + Residual interpolation
  • 14. Improve Robustness for Head Movements No adaptation Adapt to eye depth variations Adapt to eye movements Assumptions 1) weak perspective 2) fixed eye parameters. CR [Yoo-2002] CR-DD [Coutinho and Morimoto 2010] PL-CR [Coutinho and Morimoto 2012]
  • 15. Accuracy of Gaze Prediction for Stationary Head Robustness to Head Movement No adaptation CR [Yoo-2002] CR-Multi [Yoo-2005] CR-DV [Coutinho-2006] CR-HOM [Kang-2007] CR-HOMN [Hansen-2010] No correction Scale correction Scale and translation correction Homography correction Homography correction + Residual interpolation CR-DD [Coutinho-2010] Adapt to eye depth variations only PL-CR [Coutinho-2012] Adapt to eye movements Assumptions 1) weak perspective 2) fixed eye parameters. Adapt to eye movements No assumptions on 1) weak perspective 2) fixed eye parameters This paper
  • 16. How? The Main Idea • Build upon the homography normalization method [Hansen et al 2010] • Improving accuracy and robustness simultaneously by introducing the Adaptive Homography Mapping
  • 17. Adaptive Homograph Mapping • Two types of predictor variables • : capture the head movements relative to the calibration position • Affine transformation between the glints quadrilateral • : capture gaze direction for spatially-varying mapping • Pupil center position in the normalized space • : polynomial regression of degree two with parameter
  • 18. Training Adaptive Homography Mapping • Exploit large amount of simulated data • the set of sampled head position in 3D • the set of calibration target index in the screen space • Objective function
  • 19. Minimizing the Objective Function • Minimize an algebraic error at each sampled head position • Use the solution from algebraic error minimization as initialization Minimize the re-projection errors using the Levenberg-Marquardt algorithm
  • 20. Visualize the Training Process • Eye gaze prediction results using the bias-correcting homography computed at the calibration position
  • 21. RMSE Error Comparisons Using Different Training Models • Differences are small in linear regression • Linear model is not sufficiently complex • Compensation using both predictor variables achieve the lowest errors
  • 23. Linear Regression Adding the normalized pupil center corrected spatially-varying errors
  • 26. Experimental Results – Synthetic data • Setup • Screen size 400mm x 300mm • Four IR lights • Camera 13mm focal length, placed slighted below the screen border (FoV~31 degree) • Calibration position and eye parameters • Eye parameters from [Guestrin and Eizenman, 2006]
  • 29. Stationary Head Varying (horizontal) angle between optical/visual axis
  • 30. Stationary Head Varying (vertical) angle between optical/visual axis
  • 31. Head Movements Parallel to the Screen
  • 32. Head Movement along Depth Variation
  • 33. Tested at Another Head Position
  • 35. Effect of Sensor Resolution (at calibration) Focal Length = 13 mm Focal Length = 35 mm
  • 36. Effect of Sensor Resolution (at new position) Focal Length = 13 mm Focal Length = 35 mm
  • 37. Real Data Evaluation – Programmable Hardware Setup Off-axis IR light sources Stereo camera (We use one only in this work) On-axis ring light
  • 38. Real Data Evaluation – Feature Detection • Detecting glints and pupil center
  • 39. Averaged Gaze Estimation Error at calibration position
  • 40. Averaged Gaze Estimation Error Calibrated at 600mm from screenCalibrated at 500mm from screen
  • 41. Conclusions • A learning-based approach for simultaneously compensating (1) spatially varying errors and (2) errors induced from head movements • Generalize previous work on compensating head movements using glint geometric transformation [Cerroaza et al. 2012] [Coutinho and Morimoto 2012] • Leveraging simulated data avoid the tedious data collection
  • 42. Future Work • Consider subject-dependent parameters in the learning and inference the adaptive homography adaptation • Integrate binocular information, please see poster Zhengyou Zhang, Qin Cai, Improving Cross-Ratio-Based Eye Tracking Techniques by Leveraging the Binocular Fixation Constraint • Extensive user study using a physical setup
  • 43. Comments or questions? Jia-Bin Huang jbhuang1@Illinois.edu Narendra Ahuja n-ahuja@Illinois.edu Zhengyou Zhang zhang@microsoft.com Qin Cai qincai@microsoft.com Zicheng Liu zliu@microsoft.com