Hepatitis C Stage Classification with hybridization of GA and Chi2 Feature Selection

No Thumbnail Available

Date

2022-02-02

Journal Title

Journal ISSN

Volume Title

Publisher

IJCSNS International Journal of Computer Science and Network Security

Abstract

In metaheuristic algorithms such as Genetic Algorithm (GA), initial population has a significant impact as it affects the time such algorithm takes to obtain an optimal solution to the given problem. In addition, it may influence the quality of the solution obtained. In the machine learning field, feature selection is an important process to attaining a good performance model; Genetic algorithm has been utilized for this purpose by scientists. However, the characteristics of Genetic algorithm, namely random initial population generation from a vector of feature elements, may influence solution and execution time. In this paper, the use of a statistical algorithm has been introduced (Chi2) for feature relevant checks where p-values of conditional independence were considered. Features with low p-values were discarded and subject relevant subset of features to Genetic Algorithm. This is to gain a level of certainty of the fitness of features randomly selected. An ensembled-based learning model for Hepatitis has been developed for Hepatitis C stage classification. 1385 samples were used using Egyptian-dataset obtained from UCI repository. The comparative evaluation confirms decreased in execution time and an increase in model performance accuracy from 56% to 63%.

Description

Keywords

Hepatitis, Hepatitis prediction, Feature selection

Citation

Umar, Rukayya et.al. (2022). Hepatitis C Stage Classification with hybridization of GA and Chi2 Feature Selection. IJCSNS International Journal of Computer Science and Network Security, 22(1)

Endorsement

Review

Supplemented By

Referenced By