REAL-TIME ROAD SCENE CLASSIFICATION USING INFRARED IMAGES
David Forslund, Per Cronvall, Jacob Roll
2010
Abstract
This paper aims at employing scene classification in real-time to the two-class problem of separating city and rural scenes in images constructed from an infrared sensor that is mounted at the front of a vehicle. The 'Bag of Words' algorithm for image representation has been evaluated and compared to two low-level methods 'Edge Direction Histograms', and 'Invariant Moments'. A method for fast scene classification using the Bag of Words algorithm is proposed using a grey patch based algorithm for image element representation and a modified floating search for visual word selection. It is also shown empirically that floating search for visual word selection outperforms the currently popular k-means clustering for small vocabulary sizes.
References
- Battiato, S., Farinella, G. M., Gallo, G., and Ravì, D. (2008). Scene categorization using bag of textons on spatial hierarchy. In ICIP, pages 2536-2539. IEEE.
- Bosch, A., Muñoz, X., and Martí, R. (2007). Which is the best way to organize/classify images by content? Image and Vision Computing, 25(6):778-791.
- Bosch, A., Zisserman, A., and Muoz, X. (2008). Scene classification using a hybrid generative/discriminative approach. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(4):712-727.
- Chang, C.-C. and Lin, C.-J. (2001). LIBSVM: a library for support vector machines. Av. at http://www.csie. ntu.edu.tw/cjlin/libsvm.
- Devendran, V., Thiagarajan, H., and Santra, A. K. (2007). Scene categorization using invariant moments and neural networks. In Proceedings of ICCIMA, volume 1, pages 164-168.
- Forslund, D. (2008). Realtime scene analysis in infrared images. Master's thesis, Uppsala University, Sweden.
- Hu, M.-K. (1962). Visual pattern recognition by moment invariants. IRE Transactions on Information Theory, 8(2):179-187.
- Lowe, D. (2004). Distinctive image features from scaleinvariant keypoints. Int. Journal of Computer Vision, 60(2):91-110.
- Oliva, A. and Torralba, A. (2001). Modeling the shape of the scene: A holistic representation of the spatial envelope. Int. Journal of Computer Vision, 42(3):145- 175.
- Oliva, A. and Torralba, A. (2003). Statistics of natural image categories. Network: Computation in Neural Systems, pages 391-412.
- Pudil, P., Ferri, F., Novovicova, J., and Kittler, J. (1994). Floating search methods for feature selection with nonmonotonic criterion functions. In ICPR94, pages 279-283.
- Quelhas, P., Monay, F., Odobez, J. M., Gatica-Perez, D., Tuytelaars, T., and Van Gool, L. (2005). Modeling scenes with local descriptors and latent aspects. In Tenth IEEE Int. Conf. on Computer Vision, 2005, volume 1, pages 883-890.
- Sivic, J. and Zisserman, A. (2003). Video google: a text retrieval approach to object matching in videos. In Ninth IEEE Int. Conf. on Computer Vision, 2003, pages 1470-1477.
- Szummer, M. and Picard, R. W. (1998). Indoor-outdoor image classification. In Proceedings of the 1998 Int. Workshop on Content-Based Access of Image and Video Databases, page 42.
- Vailaya, A., Figueiredo, M. A. T., Jain, A. K., and Zhang, H.-J. (2001). Image classification for content-based indexing. IEEE Transactions on Image Processing, 10(1):117-130.
- Vailaya, A., Jain, A., and Zhang, H. J. (1998). On image classification: City vs. landscape. In Proceedings of the IEEE Workshop on Content - Based Access of Image and Video Libraries, pages 3-8.
- Walker, L. L. and Malik, J. (2003). When is scene recognition just texture recognition. Vision Research, 44:2301-2311.
Paper Citation
in Harvard Style
Forslund D., Cronvall P. and Roll J. (2010). REAL-TIME ROAD SCENE CLASSIFICATION USING INFRARED IMAGES . In Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2010) ISBN 978-989-674-029-0, pages 351-356. DOI: 10.5220/0002821503510356
in Bibtex Style
@conference{visapp10,
author={David Forslund and Per Cronvall and Jacob Roll},
title={REAL-TIME ROAD SCENE CLASSIFICATION USING INFRARED IMAGES},
booktitle={Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2010)},
year={2010},
pages={351-356},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002821503510356},
isbn={978-989-674-029-0},
}
in EndNote Style
TY  - CONF 
JO  - Proceedings of the International Conference on Computer Vision Theory and Applications - Volume 2: VISAPP, (VISIGRAPP 2010)
TI  - REAL-TIME ROAD SCENE CLASSIFICATION USING INFRARED IMAGES
SN  - 978-989-674-029-0
AU  - Forslund D. 
AU  - Cronvall P. 
AU  - Roll J. 
PY  - 2010
SP  - 351
EP  - 356
DO  - 10.5220/0002821503510356