Caractérisation, construction et dénombrement des ultramétriques supérieures minimales
In this paper, we consider the l₁-clustering problem for a finite data-point set which should be partitioned into k disjoint nonempty subsets. In that case, the objective function does not have to be either convex or differentiable, and generally it may have many local or global minima. Therefore, it becomes a complex global optimization problem. A method of searching for a locally optimal solution is proposed in the paper, the convergence of the corresponding iterative process is proved and the...
The problem of assessing the reliability of clusters patients identified by clustering algorithms is crucial to estimate the significance of subclasses of diseases detectable at bio-molecular level, and more in general to support bio-medical discovery of patterns in gene expression data. In this paper we present an experimental analysis of the reliability of clusters discovered in lung tumor patients using DNA microarray data. In particular we investigate if subclasses of lung adenocarcinoma...
The relations between automatic clustering methods and inferentiel statistical models have mostely been studied when the data involves only one set. We propose to study these relations in the case of data involving two sets. We shall look at cross clustering methods as suggested by Govaert [6]; we show that these methods, like the simple clustering methods, can be considered as a clustering approach of a mixture model. We introduce the notion of crossed mixture from a concret example and...
A new supervised classification algorithm of a heavily distorted pattern (shape) obtained from noisy observations of nonstationary signals is proposed in the paper. Based on the Gabor transform of 1-D non-stationary signals, 2-D shapes of signals are formulated and the classification formula is developed using the pattern matching idea, which is the simplest case of a pattern recognition task. In the pattern matching problem, where a set of known patterns creates predefined classes, classification...
The paper deals with two Mises distributions on the circle with unknown mean directions and a common concentration parameter that is known. The likelihood rule and the plug-in rule are examined. For the statistic of the plug-in rule, the moment generating function is given and a method of obtaining the moments is proposed.
Many image segmentation algorithms have been proposed to partition an image into foreground regions of interest and background regions to be ignored. These algorithms use pixel intensities to partition the image, so it should be good practice to choose an appropriate background color as different as possible from the foreground one. In the case of a unique digitizing operation the user can make the choice of background color by himself in order to obtain a good result in the segmentation process,...
AMS Subj. Classification: 62P10, 62H30, 68T01This study examines different approaches of binary classification applied to the prob- lem of making distinction between former and current smokers. Prediction is based on data collected in national survey performed by the National center for health statistics of America in 2000. The process consists of two essential parts. The first one determines which attributes are relevant to smokers status, by using methods like basic genetic algorithm and different evaluation...
This paper proposes new classifiers under the assumption of multivariate normality for multivariate repeated measures data (doubly multivariate data) with Kronecker product covariance structures. These classifiers are especially useful when the number of observations is not large enough to estimate the covariance matrices, and thus the traditional classifiers fail. The quality of these new classifiers is examined on some real data. Computational schemes for maximum likelihood estimates of required...
In this paper, we illustrate an application of Ascendant Hierarchical Cluster Analysis (AHCA) to complex data taken from the literature (interval data), based on the standardized weighted generalized affinity coefficient, by the method of Wald and Wolfowitz. The probabilistic aggregation criteria used belong to a parametric family of methods under the probabilistic approach of AHCA, named VL methodology. Finally, we compare the results achieved using our approach with those obtained by other authors....