Performance Evaluation of Multiple Deep Learning Models for Wine Quality Prediction

Dedik Fabiyanto, Yan Rianto

Abstract


Research utilizing a dataset from the UCI repository evaluated the predictive accuracy of nine machine learning models for wine quality. The models employed include Logistic Regression, K-Nearest Neighbor (KNN), Decision Tree, Support Vector Machine (SVM), Random Forest, XGBoost, LightGBM, CatBoost, and Gradient Boosting. The dataset comprises 1,599 samples with 12 chemical parameters. Data preprocessing, including oversampling, normalization, standardization, and seeding, was performed to enhance model performance.

The study's findings indicate that the models with the highest accuracy values were LightGBM (87.80%), CatBoost (86.60%), and Random Forest (85.70%). A voting classifier combining these three models achieved an accuracy of 87.29%. Further analysis using a confusion matrix demonstrated that this combined model effectively predicts the "Good" and "Not Good" classes.

In conclusion, the combination of LightGBM, CatBoost, and Random Forest models proves to be an effective approach for predicting wine quality based on chemical parameters, with an accuracy value of 87.29%.


Keywords


wine quality, voting classifier, model evaluation

Full Text:

PDF

References


R. Zhu, “Chemical Change and Quality Control in Winemaking,” Scientific and Social Research, vol. 4, no. 7, pp. 62-67, 14 Juli 2022.

M. H. Shahrajabian dan W. Sun, “Assessment of Wine Quality, Traceability and Detection of Grapes Wine, Detection of Harmful Substances in Alcohol and Liquor Composition Analysis,” Letters in Drug Design & Discovery, vol. 21 (8), no. Doi: 10.2174/1570180820666230228115450, pp. 1377-1399, Juni 2024.

L. Le, P. N. Hurtado, I. Lawrence, Q. Tian dan B. Chen, “Applying Neural Networks in Wineinformatics with the New Computational Wine Wheel,” Fermentation, vol. 9 (7), no. Doi: 10.3390/fermentation9070629, pp. 629-629., 2023.

J. Dong, “Red Wine Quality Analysis based on Machine Learning Techniques,” Highlights in Science, Engineering and Technology, vol. 49, no. Doi: 10.54097/hset.v49i.8506, pp. 208-213, 2023.

C. Zeng, J. Fang, Q. Yang, C. Xiang, Z. Zhao dan Y. Lei, “Wine quality grade data analysis and prediction based on multiple machine learning algorithms,” dalam Proceedings of the 2nd International Conference on Mechatronics and Smart Systems, 2024.

J. A. Clarin, “Comparison of the Performance of Several Regression Algorithms in Predicting the Quality of White Wine in WEKA,” International Journal of Emerging Technology and Advanced Engineering, vol. 12 (07), no. Doi: 10.46338/ijetae0722_03 , pp. 20-26, 3 Juli 2022.

A. K., “Regression Modeling Approaches for Red Wine Quality Prediction: Individual and Ensemble,” International Journal for Research in Applied Science & Engineering Technology (IJRASET), vol. 11, no. Doi: doi.org/10.22214/ijraset.2023.54363, pp. 3621-3627, Juni 2023.

N. Pourmoradi, “Red Wine Quality,” Kaggle, 2023. [Online]. Available: https://www.kaggle.com/code/nimapourmoradi/red-wine-quality/input. [Diakses 21 Juni 2024].

R. S. Jackson, Wine Science: Principles and Applications (3rd Edition), Burlington: Academic Press, 2008.

A. Géron, Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems (2nd edition), N. Tache, Penyunt., Sebastopol: O’Reilly Media, Inc, 2019.

Ridwan, E. H. Hermaliani dan M. Ernawati, “Penerapan Metode SMOTE Untuk Mengatasi Imbalanced Data Pada,” Computer Science (CO-SCIENCE), vol. 4 (1), no. E-ISSN: 2774-9711, pp. 80-88, Januari 2024.

D. A. Nasution, H. H. Khotimah dan N. Chamidah, “Perbandingan Normalisasi Data untuk Klasifikasi Wine Menggunakan Algoritma K-NN,” CESS (Journal of Computer Engineering System and Science), vol. 4 (1), pp. 78-82, Januari 2019.

S. Ioffe dan C. Szegedy, “Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift,” dalam Proceedings of the 32nd International Conference on Machine Learning, PMLR, 2015.

K. S. Nugroho, “Confusion Matrix untuk Evaluasi Model pada Supervised Learning,” 13 November 2019. [Online]. Available: https://ksnugroho.medium.com/confusion-matrix-untuk-evaluasi-model-pada-unsupervised-machine-learning-bc4b1ae9ae3f. [Diakses 23 Juni 2024].

S. Raschka dan V. Mirjalili, Python Machine Learning, Burningham: Packt Publishing Ltd., 2019.




DOI: https://doi.org/10.31315/telematika.v21i2.13007

DOI (PDF): https://doi.org/10.31315/telematika.v21i2.13007.g6670

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Copyright of :
TELEMATIKA: Jurnal Informatika dan Teknologi Informasi
ISSN 1829-667X (print); ISSN 2460-9021 (online)


Dipublikasi oleh
Jurusan Teknik Informatika, UPN Veteran Yogyakarta
Jl. Babarsari 2 Yogyakarta 55281 (Kampus Unit II)
Telp: +62 274 485786
email: [email protected]

 

Jurnal Telematika sudah diindeks oleh beberapa lembaga berikut:
 

 

 

 

 

Status Kunjungan Jurnal Telematika