Search

Now showing items 11-20 of 24

Unsupervised classification of text documents

Aparicio-Carrasco, Roxana K. (2007)

The automatic extraction of knowledge from very large document collections is becoming an important issue in order to exploit the increasing available information stored in text form. A significant aspect of this extraction ...

Modelo de clasificación y predicción en dos etapas: utilizando árboles de clasificación y el análisis de regresión multivariada

Choque-Dextre, Yency E. (2015-06)

Currently there exists a great variety of methods and algorithms attempting to optimize the process of classification. However, these methods do not take into account the internal structure of the classification datasets. ...

A computational environment for data preprocessing in supervised classification

Rodríguez, Caroline K. (2004)

In this thesis, a data preprocessing environment has been created, for use in a supervised classification context, with the Windows platform of the R programming language and environment for statistical computing and ...

Regresión logística con penalidad ridge aplicada a datos de expresión genética

Prieto-Castellanos, Karen A. (2005)

Logistic regression analysis is used in classification to find out which group an individual belong from a predictor variables set. In classification sometimes we work with data sets with more variables than observations. ...

Métodos para mejorar la calidad de un conjunto de datos para descubrir conocimiento

Daza-Portocarrero, Luis A. (2007)

Today, data generation is growing exponentially in both directions; instances (rows) and features (columns). This causes that many datasets can not be analyzed without preprocessing. The large size of the dataset to be ...

Contributions to parallel and distributed computing in knowledge discovery and data mining

Lozano-Inca, Elio (2006)

Recently databases are increasing continuously without bound, due to new data acquisition technologies. One challenge is how to gain knowledge from these large data sets. In this thesis, we analyze and improve the algorithmic ...

Análisis sobre métodos de pruebas de hipótesis múltiple en la identificación de genes diferencialmente expresados

Muñiz-Rivera, Lus M. (2009-07)

The Human Genome Project is the most important reason for the surge of new technologies in the microarray area. These technologies facilitate the experimentation with a large number of genes simultaneously. These experiments ...

On applications of rough sets theory to knowledge discovery

Coaquira-Nina, Frida R. (2007)

Knowledge Discovery in Databases (KDD) is the nontrivial extraction of implicit, previously unknown and potentially useful information from data. Data preprocessing is a step of the KDD process that reduces the complexity ...

Un algoritmo para clasificación no supervisada de datos funcionales

Barreto-González, César A. (2011-12)

La estadística multivariada ofrece diversas herramientas que permiten realizar un análisis para ciertos conjuntos de datos. Sin embargo surge una rama de la estadística en la cual se dejan de observar conjuntos de datos ...

Reducción de la dimensionalidad para optimizar la clasificación de datos funcionales

Huanca-Ochoa, Shirley Y. (2015-05)

Nowdays throw due to the continuous advance of technology, statisticians have been facing the need to develop new methods to extract meaningful information quickly and efficiently in large data sets, such as functional ...