Abstract
Dimensionality reduction provides a compact representation of an original high-dimensional data, which means the reduced data is free from any further processing and only the vital information is retained. For this reason, it is an invaluable preprocessing step before the application of many machine learning algorithms that perform poorly on high-dimensional data. In this thesis, the perceptron classification algorithm – an eager learner - is applied to three two-class datasets (Student, Weather and Ionosphere datasets). The k-Nearest Neighbors classification algorithm - a lazy learner - is also applied to the same two-class datasets. Each dataset is then reduced using fifteen different dimensionality reduction techniques. The perceptron and k-nearest neighbor classification algorithms are applied to each reduced set and the performance (evaluated using confusion matrix) of the dimensionality reduction techniques is compared on preserving the classification of a dataset by the k-nearest neighbors and perceptron classification algorithms. This investigation revealed that the dimensionality reduction techniques implemented in this thesis seem to perform much better at preserving K-Nearest Neighbor classification than they do at preserving the classification of the original datasets using the perceptron. In general, the dimensionality reduction techniques prove to be very efficient in preserving the classification of both the lazy and eager learners used for this investigation. Keywords: Classification, confusion matrix, dimensionality reduction, eager learner, k-nearest neighbors, lazy learner, and the perceptron.
BACKGROUND TO THE STUDY
Terrorism and insurgency is globally becoming a household word as there is no nation that is com...
ABSTRACT
This study was motivated by the growing concern on the impact of Institutional Quality on economic outcomes. Th...
ABSTRACT
This study was carried out to examine agriculture with special reference to Enugu North local...
Abstract
Solid wastemanagement is an established environmental health challenge in most societies. The heterogeneousnat...
ABSTRACT
This study investigated the incidence of salmonella and ecoli infection among...
ABSTRACT
This study examines the impact of Training and development on effective performance of workers in the public se...
Background
Related Work I begin with background information on ways for representing temporal networks and data structures for storing (s...
Abstract
Nigeria as nation has over the years engaged in lots of developmental activities without actions which makes a...
Abstract: Innovations in assessing creativity and innovation in technical education are ess...
ABSTRACT
This study explored the effect of performance management on employee performance. The general objective of the study was to dete...