Abstract
Filter-based feature selection methods such as information gain, Gini index, and gain ratio are commonly used in machine learning. It is often assumed that these methods select the most accurate features, but we show this is not true. In this thesis, we study cases when these feature selection metrics and accuracy show “misorderings”: given a pair of features F1 and F2, where F1 has a higher accuracy than F2, the feature selection value is higher for F2 than F1. We first study the frequency of misorderings in randomly-produced synthetic data. Secondly, we study the potential for misordering as two key parameters of the features in a dataset are varied. Finally, we study misorderings in real data and show that misorderings are also prevalent there. Based on our results, we observe that different metrics exhibit different misordering rates, and imposing redundancy-elimination criteria may have the side effect of reducing misordering.
Background of the Study
Curriculum integration has emerged as a transformative educational strategy aimed at bridging the...
Background of the Study
Substance abuse and psychiatric disorders are often comorbid conditions, with s...
Chapter One: Introduction
1.1 Background of the Study
Employee commitment refers to the psycholo...
Background of the Study
Education is a key driver of social and economic development, and effective budgeting practices are...
Background of the Study
Employee retention is a critical challenge for small businesses, particularly in the hospitality...
Chapter One: Introduction
1.1 Background of the Study
Women...
Background of the Study :
The advent of digital technology has revolutionized various aspects of education, including the m...
Background of the Study
Integrated Risk Management (IRM) solutions have emerged as a critical framework for banks aiming to mitigate cred...
ABSTRACT
The purpose of this study was to assess the influence of the program Agriculture for a better tomorrow on farming pract...
Background of the Study
Digital transformation has become a critical driver of innovation and efficiency in human resource management (HR...