SVM-based Video Segmentation and Annotation of Lectures and Conferences

Stefano Masneri, Oliver Schreer

2014

Abstract

This paper presents a classification system for video lectures and conferences based on Support Vector Machines (SVM). The aim is to classify videos into four different classes (talk, presentation, blackboard, mix). On top of this, the system further analyses presentation segments to detect slide transitions, animations and dynamic content such as video inside the presentation. The developed approach uses various colour and facial features from two different datasets of several hundred hours of video to train an SVM classifier. The system performs the classification on frame-by-frame basis and does not require pre-computed shotcut information. To avoid over-segmentation and to take advantage of the temporal correlation of succeeding frames, the results are merged every 50 frames into a single class. The presented results prove the robustness and accuracy of the algorithm. Given the generality of the approach, the system can be easily adapted to other lecture datasets.

Download


Paper Citation


in Harvard Style

Masneri S. and Schreer O. (2014). SVM-based Video Segmentation and Annotation of Lectures and Conferences . In Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014) ISBN 978-989-758-004-8, pages 425-432. DOI: 10.5220/0004686004250432

in Bibtex Style

@conference{visapp14,
author={Stefano Masneri and Oliver Schreer},
title={SVM-based Video Segmentation and Annotation of Lectures and Conferences},
booktitle={Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014)},
year={2014},
pages={425-432},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004686004250432},
isbn={978-989-758-004-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 9th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2014)
TI - SVM-based Video Segmentation and Annotation of Lectures and Conferences
SN - 978-989-758-004-8
AU - Masneri S.
AU - Schreer O.
PY - 2014
SP - 425
EP - 432
DO - 10.5220/0004686004250432