Dynamic Subtitle Placement Considering the Region of Interest and Speaker Location

Wataru Akahori, Tatsunori Hirai, Shigeo Morishima

2017

Abstract

This paper presents a subtitle placement method that reduces unnecessary eye movements. Although methods that vary the position of subtitles have been discussed in a previous study, subtitles may overlap the region of interest (ROI). Therefore, we propose a dynamic subtitling method that utilizes eye-tracking data to avoid the subtitles from overlapping with important regions. The proposed method calculates the ROI based on the eye-tracking data of multiple viewers. By positioning subtitles immediately under the ROI, the subtitles do not overlap the ROI. Furthermore, we detect speakers in a scene based on audio and visual information to help viewers recognize the speaker by positioning subtitles near the speaker. Experimental results show that the proposed method enables viewers to watch the ROI and the subtitle in longer duration than traditional subtitles, and is effective in terms of enhancing the comfort and utility of the viewing experience.

Download


Paper Citation


in Harvard Style

Akahori W., Hirai T. and Morishima S. (2017). Dynamic Subtitle Placement Considering the Region of Interest and Speaker Location . In Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 6: VISAPP, (VISIGRAPP 2017) ISBN 978-989-758-227-1, pages 102-109. DOI: 10.5220/0006262201020109

in Bibtex Style

@conference{visapp17,
author={Wataru Akahori and Tatsunori Hirai and Shigeo Morishima},
title={Dynamic Subtitle Placement Considering the Region of Interest and Speaker Location},
booktitle={Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 6: VISAPP, (VISIGRAPP 2017)},
year={2017},
pages={102-109},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006262201020109},
isbn={978-989-758-227-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 12th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 6: VISAPP, (VISIGRAPP 2017)
TI - Dynamic Subtitle Placement Considering the Region of Interest and Speaker Location
SN - 978-989-758-227-1
AU - Akahori W.
AU - Hirai T.
AU - Morishima S.
PY - 2017
SP - 102
EP - 109
DO - 10.5220/0006262201020109