A Benchmark of Computational Models of Saliency to Predict Human Fixations in Videos

Shoaib Azam, Syed Omer Gilani, Moongu Jeon, Rehan Yousaf, Jeong Bae Kim

2016

Abstract

In many applications of computer graphics and design, robotics and computer vision, there is always a need to predict where human looks in the scene. However this is still a challenging task that how human visual system certainly works. A number of computational models have been designed using different approaches to estimate the human visual system. Most of these models have been tested on images and performance is calculated on this basis. A benchmark is made using images to see the immediate comparison between the models. Apart from that there is no benchmark on videos, to alleviate this problem we have a created a benchmark of six computational models implemented on 12 videos which have been viewed by 15 observers in a free viewing task. Further a weighted theory (both manual and automatic) is designed and implemented on videos using these six models which improved Area under the ROC. We have found that Graph Based Visual Saliency (GBVS) and Random Centre Surround Models have outperformed the other models.

Download


Paper Citation


in Harvard Style

Azam S., Gilani S., Jeon M., Yousaf R. and Kim J. (2016). A Benchmark of Computational Models of Saliency to Predict Human Fixations in Videos . In Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2016) ISBN 978-989-758-175-5, pages 134-142. DOI: 10.5220/0005678701340142

in Bibtex Style

@conference{visapp16,
author={Shoaib Azam and Syed Omer Gilani and Moongu Jeon and Rehan Yousaf and Jeong Bae Kim},
title={A Benchmark of Computational Models of Saliency to Predict Human Fixations in Videos},
booktitle={Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2016)},
year={2016},
pages={134-142},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005678701340142},
isbn={978-989-758-175-5},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 11th Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 4: VISAPP, (VISIGRAPP 2016)
TI - A Benchmark of Computational Models of Saliency to Predict Human Fixations in Videos
SN - 978-989-758-175-5
AU - Azam S.
AU - Gilani S.
AU - Jeon M.
AU - Yousaf R.
AU - Kim J.
PY - 2016
SP - 134
EP - 142
DO - 10.5220/0005678701340142