Cite

In this paper, we present the impact of the data normalization on the classification model performance. In first part of this paper, we present the structure of our dataset, where we discuss the features of the data set and basic statistical analysis of the data. In this research, we worked with the medical data about the patients with the Parkinson disease. In second part of this paper, we present the process of data normalization and the impact of scaling data on the classification model performance. In this research, we used the XGBoost model as our classification model. The main classification task was to classify whether the patient is ill with Parkinson disease or not. Since the data set contains more numerical parameters of different scaling, the main aim of this paper was to investigate the impact of the data normalization (scaling) on the performance of the classification model.

eISSN:
1338-0532
Language:
English
Publication timeframe:
2 times per year
Journal Subjects:
Engineering, Introductions and Overviews, other