A Generic Probabilistic Graphical Model for Region-based Scene Interpretation

Michael Ying Yang

2015

Abstract

The task of semantic scene interpretation is to label the regions of an image and their relations into meaningful classes. Such task is a key ingredient to many computer vision applications, including object recognition, 3D reconstruction and robotic perception. The images of man-made scenes exhibit strong contextual dependencies in the form of the spatial and hierarchical structures. Modeling these structures is central for such interpretation task. Graphical models provide a consistent framework for the statistical modeling. Bayesian networks and random fields are two popular types of the graphical models, which are frequently used for capturing such contextual information. Our key contribution is the development of a generic statistical graphical model for scene interpretation, which seamlessly integrates different types of the image features, and the spatial structural information and the hierarchical structural information defined over the multi-scale image segmentation. It unifies the ideas of existing approaches, e. g. conditional random field and Bayesian network, which has a clear statistical interpretation as the MAP estimate of a multi-class labeling problem. We demonstrate experimentally the application of the proposed graphical model on the task of multi-class classification of building facade image regions.

Download


Paper Citation


in Harvard Style

Yang M. (2015). A Generic Probabilistic Graphical Model for Region-based Scene Interpretation . In Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2015) ISBN 978-989-758-090-1, pages 486-491. DOI: 10.5220/0005341004860491

in Bibtex Style

@conference{visapp15,
author={Michael Ying Yang},
title={A Generic Probabilistic Graphical Model for Region-based Scene Interpretation},
booktitle={Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2015)},
year={2015},
pages={486-491},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005341004860491},
isbn={978-989-758-090-1},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 10th International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2015)
TI - A Generic Probabilistic Graphical Model for Region-based Scene Interpretation
SN - 978-989-758-090-1
AU - Yang M.
PY - 2015
SP - 486
EP - 491
DO - 10.5220/0005341004860491