Implementation of Feature Selection Information Gain in Support Vector Machine Method for Stroke Disease Classification
(1) Universitas Islam Negeri Sultan Syarif Kasim Riau, Pekanbaru
(2) Universitas Islam Negeri Sultan Syarif Kasim Riau, Pekanbaru
(3) Universitas Islam Negeri Sultan Syarif Kasim Riau, Pekanbaru
(4) Universitas Islam Negeri Sultan Syarif Kasim Riau, Pekanbaru
(*) Corresponding Author
Abstract
Stroke is a disease with a high mortality and disability rate that requires early detection. However, the main challenge in the classification process of this disease is data imbalance and the large number of irrelevant features in the dataset. This study proposes a combination of Support Vector Machine (SVM) method with Information Gain feature selection technique and data balancing using Synthetic Minority Over-sampling Technique (SMOTE) to improve classification accuracy. The dataset used consists of 5,110 data with 10 variables and 1 label. Feature selection was performed with three threshold values (0.04; 0.01; and 0.0005), while SVM classification was tested on three different kernels: Linear, RBF, and Polynomial. Model evaluation was performed using Confusion Matrix and training and test data sharing using k-fold cross validation with k=10. The best results were obtained on the RBF kernel with Cost=100 and Gamma=5 parameters at an Information Gain threshold of 0.0005, with accuracy reaching 90.51%. These results show that the combination of techniques used aims to determine the variables that most affect SVM classification in detecting stroke disease
Keywords
Full Text:
PDFReferences
K. Wirastuti, N. S. Riasari, D. Djannah, dan M. Silviana, “Upaya Pencegahan Stroke melalui Skrining Skor Risiko Stroke dengan Intervensi Penyuluhan dan Pemeriksaan Faktor Risiko Stroke di Kelurahan Bojong Salaman Kecamatan Pusponjolo Selatan Semarang Barat,” J. ABDIMAS-KU J. Pengabdi. Masy. Kedokt., vol. 2, no. 1, hal. 23, 2023, doi: 10.30659/abdimasku.2.1.23-29.
D. Kuriakose dan Z. Xiao, “IMP para qué es el ictus, tipos y causas. También para datos epidemiológicos y tratamientos.,” Int. J. Mol. Sci., vol. 21, no. 20, hal. 1–24, 2020.
D. Majumder, “Ischemic Stroke: Pathophysiology and Evolving Treatment Approaches,” Neurosci. Insights, vol. 19, 2024, doi: 10.1177/26331055241292600.
M. N. Aziz dan A. Supriyadi, “Pengaruh Proprioceptive Neuromuscular Facilitation Techniques Terhadap Penurunan Spastisitas Otot Pasien Stroke: a Critical Review,” 2021, [Daring]. Tersedia pada: http://eprints.ums.ac.id/id/eprint/91145
L. Wang et al., “Remote ischemic conditioning enhances oxygen supply to ischemic brain tissue in a mouse model of stroke: Role of elevated 2,3-biphosphoglycerate in erythrocytes,” J. Cereb. Blood Flow Metab., vol. 41, no. 6, hal. 1277–1290, 2021, doi: 10.1177/0271678X20952264.
Setiawan et al, “Diagnosis Dan Tatalaksana Stroke Hemoragik,” J. Med. Utama, vol. 02, no. 01, hal. 402–406, 2021.
T. G. Rahayu, “Analisis Faktor Risiko Terjadinya Stroke Serta Tipe Stroke,” Faletehan Heal. J., vol. 10, no. 01, hal. 48–53, 2023, doi: 10.33746/fhj.v10i01.410.
Y. A. Utama dan S. S. Nainggolan, “Faktor Resiko yang Mempengaruhi Kejadian Stroke: Sebuah Tinjauan Sistematis,” J. Ilm. Univ. Batanghari Jambi, vol. 22, no. 1, hal. 549, 2022, doi: 10.33087/jiubj.v22i1.1950.
Astannudinsyah, Rusmegawati, dan C. K. Negara, “Jurnal Medika Karya Ilmiah Kesehatan Vol 5, No.2. 2020 ISSN :,” Med. Karya Ilm. Kesehat., vol. 5, no. 2, 2020, [Daring]. Tersedia pada: http://jurnal.itkeswhs.ac.id/index.php/medika/article/download/129/128
Harmawati, Etriyanti, dan S. Hardini, “Deteksi Dini Gejala Awal Stroke,” J. Abdimas Saintika, vol. 3, no. 2, hal. 186–189, 2021, [Daring]. Tersedia pada: https://jurnal.syedzasaintika.ac.id
N. Chafid et al., Kecerdasan Buatan. Batam: Juli 2024, 2024. [Daring]. Tersedia pada: https://www.google.co.id/books/edition/KECERDASAN_BUATAN/A98SEQAAQBAJ?hl=id&gbpv=1&dq=Kecerdasan+Buatan,+Batam:+Yayasan+Cendikia+Mulia+Mandiri&pg=PR2&printsec=frontcover
M. M. Ahsan, S. A. Luna, dan Z. Siddique, “Machine-Learning-Based Disease Diagnosis : A,” Healthcare, hal. 1–30, 2022.
M. W. Sanjaya, Fisika Komputasi Berbasis Machine Learning Dengan Pemrograman Python. Bandung: BolaBot, 2024. [Daring]. Tersedia pada: https://www.google.co.id/books/edition/Fisika_Komputasi_Berbasis_Machine_Learni/-gI-EQAAQBAJ?hl=id&gbpv=1&dq=Fisika+Komputasi+Berbasis+Machine+Learning+Dengan+Pemrograman+Python&pg=PA72&printsec=frontcover
R. Guido, S. Ferrisi, D. Lofaro, dan D. Conforti, “An Overview on the Advancements of Support Vector Machine Models in Healthcare Applications: A Review,” Inf., vol. 15, no. 4, 2024, doi: 10.3390/info15040235.
S. Rahayu dan Y. Yamasari, “Klasifikasi Penyakit Stroke dengan Metode Support Vector Machine (SVM),” J. Informatics Comput. Sci., vol. 5, no. 03, hal. 440–446, 2024, doi: 10.26740/jinacs.v5n03.p440-446.
A. Nazri dan R. A. Panbudi, “Implementasi Algoritma SVM dalam Memprediksi Penyakit Stroke,” vol. 06, no. 02, hal. 4–8, 2024.
L. Pasiolo et al., “PENYAKIT STROKE DENGAN ALGORITMA SUPPORT,” vol. 7, no. 1, hal. 61–73.
D. Ispriyanti, P. A. Octaviani, dan Y. Wilandari, “Penerapan Metode SVM Pada Data Akreditasi Sekolah Dasar Di Kabupaten Magelang,” J. Gaussian, vol. 3, no. 8, hal. 811–820, 2014.
I. Santoso, Windu Gata, dan Atik Budi Paryanti, “Penggunaan Feature Selection di Algoritma Support Vector Machine untuk Sentimen Analisis Komisi Pemilihan Umum,” J. RESTI (Rekayasa Sist. dan Teknol. Informasi), vol. 3, no. 3, hal. 364–370, 2019, doi: 10.29207/resti.v3i3.1084.
N. A. Amri et al., IMAGE PROCESSING. Yogyakarta: PT. Green Pustaka Indonesia, 2025. [Daring]. Tersedia pada: https://www.google.co.id/books/edition/Image_Processing/a-pXEQAAQBAJ?hl=id&gbpv=1&dq=seleksi+fitur+adalah&pg=PA82&printsec=frontcover
A. Angela Sitinjak et al., Matematika Pada Kecerdasan Buatan. Makasar: CV. Tohar Media, 2024. [Daring]. Tersedia pada: https://www.google.co.id/books/edition/Matematika_Pada_Kecerdasan_Buatan/cEYhEQAAQBAJ?hl=id&gbpv=1&dq=Matematika+Pada+Kecerdasan+Buatan&pg=PA223&printsec=frontcover
A. W. Attabi, Lailil Muflikhah, dan Mochammad Ali Fauzi, “Penerapan Analisis Sentimen untuk Menilai Suatu Produk pada Twitter Berbahasa Indonesia dengan Metode Naïve Bayes Classifier dan Information Gain,” J. Pengemb. Teknol. Inf. dan Ilmu Komput., vol. 2, no. 11, hal. 4548–4554, 2018.
A. Kulsumarwati, I. Purnamasari, dan B. A. Darmawan, “Penerapan SVM dan Information Gain Pada Analisis Sentimen Pelaksanaan Pilkada Saat Pandemi,” J. Teknol. Inform. dan Komput., vol. 7, no. 2, hal. 101–109, 2021, doi: 10.37012/jtik.v7i2.641.
D. Apriliani dan O. Somantri, “Support Vector Machine Berbasis Feature Selection Untuk Sentiment Analysis Kepuasan Pelanggan Terhadap Pelayanan Warung dan Restoran Kuliner Kota Tegal,” J. Teknol. Inf. dan Ilmu Komput., vol. 5, no. 5, hal. 537, 2018, doi: 10.25126/jtiik.201855867.
B. Aribowo dan S. Fairuz, Panduan Praktis Machine Learning Klasifikasi Menggunakan Python. Yogyakarta: dandra, 2024. [Daring]. Tersedia pada: https://www.google.co.id/books/edition/Panduan_Praktis_Machine_Learning_Klasifi/Wr5AEQAAQBAJ?hl=id&gbpv=1&dq=Panduan+Praktis+Machine+Learning+Klasifikasi+Menggunakan+Python&pg=PA55&printsec=frontcover
D. W. Lestari, D. A. Lusia, M. Y. Rochayani, dan U. Sa’adah, KUPAS TUNTAS ALGORITMA DATA MINING DAN IMPLEMENTASINYA MENGGUNAKAN R. Malang: UB Press, 2021. [Daring]. Tersedia pada: https://www.google.co.id/books/edition/Kupas_Tuntas_Algoritma_Data_Mining_dan_I/SI1TEAAAQBAJ?hl=id&gbpv=1&dq=normalisasi+dalam+data+mining&pg=PA49&printsec=frontcover
A. Apriyanto et al., DATA MINING (Teori dan Penerapannya dalam Berbagai Bidang). Jambi: PT. Sonpedia Publishing Indonesia, 2024. [Daring]. Tersedia pada: https://www.google.co.id/books/edition/Data_Mining_Teori_dan_Penerapannya_dalam/hTAxEQAAQBAJ?hl=id&gbpv=1&dq=normalisasi+data+dalam+data+mining+adalah&pg=PA113&printsec=frontcover
M. S. Amrullah dan S. F. Pane, Analisis Sentiment Masyarakat Terhadap Kebijakan Polisi Tilang Manual Di Indonesia Dengan Svm(Support Vector Machine). Bandung Barat: Buku Pedia, 2023. [Daring]. Tersedia pada: https://www.google.co.id/books/edition/Analisis_Sentimen_Masyarakat_Terhadap_Ke/_Uq5EAAAQBAJ?hl=id&gbpv=1&dq=Analisis+Sentimen+Masyarakat+Terhadap+Kebijakan+Polisi+Tilang+Manual+di+Indonesia+dengan+SVM&pg=PA9&printsec=frontcover
S. Agustin et al., No Title. Batam: Yayasan Cendikia Mulia Mandiri, 2025. [Daring]. Tersedia pada: https://www.google.co.id/books/edition/PENGOLAHAN_CITRA_DIGITAL/s0pLEQAAQBAJ?hl=id&gbpv=1&dq=Pengolahan+Citra+Digital&pg=PA72&printsec=frontcover
DOI: http://dx.doi.org/10.61944/bids.v4i1.116
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Anisa Fitri, Iis Afrianty, Elvia Budianita, Siska Kurnia Gusti

This work is licensed under a Creative Commons Attribution 4.0 International License.
Bulletin of Informatics and Data Science
Asosiasi Peneliti Data Science Indonesia
Email: pdsi.bids@gmail.com
This work is licensed under a Creative Commons Attribution 4.0 International License.
