Search
Now showing items 11-20 of 24
Análisis sobre métodos de pruebas de hipótesis múltiple en la identificación de genes diferencialmente expresados
(2009-07)
The Human Genome Project is the most important reason for the surge of new technologies in the microarray area. These technologies facilitate the experimentation with a large number of genes simultaneously. These experiments ...
On applications of rough sets theory to knowledge discovery
(2007)
Knowledge Discovery in Databases (KDD) is the nontrivial extraction of implicit, previously unknown and potentially useful information from data. Data preprocessing is a step of the KDD process that reduces the complexity ...
Un algoritmo para clasificación no supervisada de datos funcionales
(2011-12)
La estadística multivariada ofrece diversas herramientas que permiten realizar un análisis para ciertos conjuntos de datos. Sin embargo surge una rama de la estadística en la cual se dejan de observar conjuntos de datos ...
Regresión logística con penalidad ridge aplicada a datos de expresión genética
(2005)
Logistic regression analysis is used in classification to find out which group an individual belong from a predictor variables set. In classification sometimes we work with data sets with more variables than observations. ...
Métodos para mejorar la calidad de un conjunto de datos para descubrir conocimiento
(2007)
Today, data generation is growing exponentially in both directions; instances (rows) and features (columns). This causes that many datasets can not be analyzed without preprocessing. The large size of the dataset to be ...
Contributions to parallel and distributed computing in knowledge discovery and data mining
(2006)
Recently databases are increasing continuously without bound, due to new data acquisition technologies. One challenge is how to gain knowledge from these large data sets. In this thesis, we analyze and improve the algorithmic ...
Generalizaciones de minimos cuadrados parciales con aplicación en clasificacion supervisada
(2004)
The development of technologies such as microarrays has generated a large amount of data. The main characteristic of this kind of data it is the large number of predictors (genes) and few observations (experiments). Thus, ...
Comparación de algoritmos para clustering de "streams" de series de tiempo
(2012-05)
In recent years, technological advances have resulted in a huge increment in data production as in the evolution of methods that facilitated its collection. The data that arrive continuously and massively with infinite ...
A comparison in cluster validation techniques
(2004)
Clustering may be defined as a process that aims to find partitions of similar objects. It is an unsupervised recognition procedure since there are no predefined classes that indicate grouping properties in the data set. ...
Efecto de casos anómalos en máquinas de vectores de soporte
(2009-03)
Support Vector Machines (SVM) is a new technique of classification that has received much attention in recent years. In many applications, the SVM has shown better performance than machine learning methods, and it has been ...