Search
Now showing items 11-20 of 24
Unsupervised classification of text documents
(2007)
The automatic extraction of knowledge from very large document collections is becoming an important issue in order to exploit the increasing available information stored in text form. A significant aspect of this extraction ...
Modelo de clasificación y predicción en dos etapas: utilizando árboles de clasificación y el análisis de regresión multivariada
(2015-06)
Currently there exists a great variety of methods and algorithms attempting to optimize the process of classification. However, these methods do not take into account the internal structure of the classification datasets. ...
A computational environment for data preprocessing in supervised classification
(2004)
In this thesis, a data preprocessing environment has been created, for use in a supervised classification context, with the Windows platform of the R programming language and environment for statistical computing and ...
Regresión logística con penalidad ridge aplicada a datos de expresión genética
(2005)
Logistic regression analysis is used in classification to find out which group an individual belong from a predictor variables set. In classification sometimes we work with data sets with more variables than observations. ...
Métodos para mejorar la calidad de un conjunto de datos para descubrir conocimiento
(2007)
Today, data generation is growing exponentially in both directions; instances (rows) and features (columns). This causes that many datasets can not be analyzed without preprocessing. The large size of the dataset to be ...
Contributions to parallel and distributed computing in knowledge discovery and data mining
(2006)
Recently databases are increasing continuously without bound, due to new data acquisition technologies. One challenge is how to gain knowledge from these large data sets. In this thesis, we analyze and improve the algorithmic ...
Análisis sobre métodos de pruebas de hipótesis múltiple en la identificación de genes diferencialmente expresados
(2009-07)
The Human Genome Project is the most important reason for the surge of new technologies in the microarray area. These technologies facilitate the experimentation with a large number of genes simultaneously. These experiments ...
On applications of rough sets theory to knowledge discovery
(2007)
Knowledge Discovery in Databases (KDD) is the nontrivial extraction of implicit, previously unknown and potentially useful information from data. Data preprocessing is a step of the KDD process that reduces the complexity ...
Un algoritmo para clasificación no supervisada de datos funcionales
(2011-12)
La estadística multivariada ofrece diversas herramientas que permiten realizar un análisis para ciertos conjuntos de datos. Sin embargo surge una rama de la estadística en la cual se dejan de observar conjuntos de datos ...
Reducción de la dimensionalidad para optimizar la clasificación de datos funcionales
(2015-05)
Nowdays throw due to the continuous advance of technology, statisticians have been facing the need to develop new methods to extract meaningful information quickly and efficiently in large data sets, such as functional ...