Gaussian Process for Regression in Business Intelligence: A Fraud Detection Application

Bruno H. A. Pilon, Juan J. Murillo-Fuentes, João Paulo C. L. da Costa, Rafael T. de Sousa Júnior, Antonio M. R. Serrano

2015

Abstract

Business Intelligence (BI) systems are designed to provide information to support the decision making process in companies and governmental institutions. In this scenario, future events depend on the decisions and on the previous events. Therefore, the mathematical analysis of past data can be an important tool for the decision making process and to detect anomalies. Depending on the amount and the type of data to be analyzed, techniques from statistics, Machine Learning (ML), data mining and signal processing can be used to automate all or part of the system. In this paper, we propose to incorporate Gaussian Process for Regression (GPR) in BI systems in order to predict the data. As presented in this work, fraud detection is one important application of BI systems. We show that such application is possible with the use of GPR in the predictive stage, considering that GPR natively returns a full statistical description of the estimated variable, which can be used as a trigger measure to classify trusted and untrusted data. We validate our proposal with real world BI data provided by the Brazilian Federal Patrimony Department (SPU), regarding the monthly collection of federal taxes. In order to take into account the multidimensional structure of this specific data, we propose a pre-processing stage for reshaping the original time series into a bidimensional structure. The resulting algorithm, with GPR at its core, outperforms classical predictive schemes such as Artificial Neural Network (ANN).

Download


Paper Citation


in Harvard Style

Pilon B., J. Murillo-Fuentes J., Paulo C. L. da Costa J., T. de Sousa Júnior R. and M. R. Serrano A. (2015). Gaussian Process for Regression in Business Intelligence: A Fraud Detection Application . In Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 3: KMIS, (IC3K 2015) ISBN 978-989-758-158-8, pages 39-49. DOI: 10.5220/0005593000390049

in Bibtex Style

@conference{kmis15,
author={Bruno H. A. Pilon and Juan J. Murillo-Fuentes and João Paulo C. L. da Costa and Rafael T. de Sousa Júnior and Antonio M. R. Serrano},
title={Gaussian Process for Regression in Business Intelligence: A Fraud Detection Application},
booktitle={Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 3: KMIS, (IC3K 2015)},
year={2015},
pages={39-49},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0005593000390049},
isbn={978-989-758-158-8},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 7th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management - Volume 3: KMIS, (IC3K 2015)
TI - Gaussian Process for Regression in Business Intelligence: A Fraud Detection Application
SN - 978-989-758-158-8
AU - Pilon B.
AU - J. Murillo-Fuentes J.
AU - Paulo C. L. da Costa J.
AU - T. de Sousa Júnior R.
AU - M. R. Serrano A.
PY - 2015
SP - 39
EP - 49
DO - 10.5220/0005593000390049