Now showing items 1-6 of 6
Semi-supervised document classification using ontologies
Many modern applications of automatic document classification require learning accurately with little training data. Addressing the need to reduce the manual labeling process, the semi-supervised classification technique ...
Generalizaciones de minimos cuadrados parciales con aplicación en clasificacion supervisada
The development of technologies such as microarrays has generated a large amount of data. The main characteristic of this kind of data it is the large number of predictors (genes) and few observations (experiments). Thus, ...
Métodos para mejorar la calidad de un conjunto de datos para descubrir conocimiento
Today, data generation is growing exponentially in both directions; instances (rows) and features (columns). This causes that many datasets can not be analyzed without preprocessing. The large size of the dataset to be ...
Contributions to parallel and distributed computing in knowledge discovery and data mining
Recently databases are increasing continuously without bound, due to new data acquisition technologies. One challenge is how to gain knowledge from these large data sets. In this thesis, we analyze and improve the algorithmic ...
On applications of rough sets theory to knowledge discovery
Knowledge Discovery in Databases (KDD) is the nontrivial extraction of implicit, previously unknown and potentially useful information from data. Data preprocessing is a step of the KDD process that reduces the complexity ...
Temporal outlier detection using dynamic Bayesian networks and probabilistic association rules
Temporal datasets provide records of the evolution and dependencies of random variables over time. Recently, there has been an increase in the application of temporal datasets in areas such as intrusion detection, fraud ...