# SEMI INTERACTIVE METHOD FOR DATA MINING

### Lydia Boudjeloud-Assala, François Poulet

#### 2006

#### Abstract

Usual visualization techniques for multidimensional data sets, such as parallel coordinates and scatter-plot matrices, do not scale well to high numbers of dimensions. A common approach to solve this problem is dimensionality selection. We present new semi-interactive method for dimensionality selection to select pertinent dimension subsets without losing information. Our cooperative approach uses automatic algorithms, interactive algorithms and visualization methods: an evolutionary algorithm is used to obtain optimal dimension subsets which represent the original data set without losing information for unsupervised tasks (clustering or outlier detection) using a new validity criterion. A visualization method is used to present the user interactive evolutionary algorithm results and let him actively participate in evolutionary algorithm search with more efficiency resulting in a faster evolutionary algorithm convergence. We have implemented our approach and applied it to real data set to confirm it is effective for supporting the user in the exploration of high dimensional data sets and evaluate the visual data representation.

#### References

