Search
Now showing items 1-10 of 15
A computational environment for data preprocessing in supervised classification
(2004)
In this thesis, a data preprocessing environment has been created, for use in a supervised classification context, with the Windows platform of the R programming language and environment for statistical computing and ...
Componentes principales supervisados para clasificación de datos de expresión genética
(2005)
The gene expression data obtained through the technology of microarrays are characterized by its considerably greater amount of features in comparison to the number of observations. The direct use of traditional statistics ...
Clasificación noparamétrica en datos direccionales
(2004)
In a supervised classification problem, when the vectors of data are direction- al, it means, that they take values on a k-dimensional sphere, the application of the algorithms of pattern recognition as k-nearest-neighbour ...
Clasificadores por redes bayesianas
(2005)
A Bayesian network is a compact representation of joint probability function. Formally, a Bayesian network is an acyclic directed graph in which each node represents a random variable and the relationships of dependence ...
Algorithms for non-parametric classifiers in multi-relational data mining
(2006)
Over the last decades, due to the advances in information technologies, both the industrial and scientific communities have acquired large volumes of data in digital form. Most of these data sets are stored using relational ...
Análisis sobre métodos de pruebas de hipótesis múltiple en la identificación de genes diferencialmente expresados
(2009-07)
The Human Genome Project is the most important reason for the surge of new technologies in the microarray area. These technologies facilitate the experimentation with a large number of genes simultaneously. These experiments ...
On applications of rough sets theory to knowledge discovery
(2007)
Knowledge Discovery in Databases (KDD) is the nontrivial extraction of implicit, previously unknown and potentially useful information from data. Data preprocessing is a step of the KDD process that reduces the complexity ...
Regresión logística con penalidad ridge aplicada a datos de expresión genética
(2005)
Logistic regression analysis is used in classification to find out which group an individual belong from a predictor variables set. In classification sometimes we work with data sets with more variables than observations. ...
Métodos para mejorar la calidad de un conjunto de datos para descubrir conocimiento
(2007)
Today, data generation is growing exponentially in both directions; instances (rows) and features (columns). This causes that many datasets can not be analyzed without preprocessing. The large size of the dataset to be ...
Contributions to parallel and distributed computing in knowledge discovery and data mining
(2006)
Recently databases are increasing continuously without bound, due to new data acquisition technologies. One challenge is how to gain knowledge from these large data sets. In this thesis, we analyze and improve the algorithmic ...