Classification of Banjarese Hulu and Kuala Dialects in Banjarese Prose Texts
DOI:
https://doi.org/10.59188/eduvest.v4i10.39017Keywords:
Banjarese Dialect, Boolean Query, Naïve Bayes, Dialect ClassificationAbstract
This research focuses on classifying the Hulu and Kuala Banjarese dialects in the prose text “Datu Kandangan and Datu Kartamina”. These dialects represent linguistic variations resulting from geographical, social, and cultural differences among language communities, particularly in South Kalimantan, Indonesia. Language analysis methods such as Python Natural Language Toolkit (NLTK), NumPy, and Latent Dirichlet Allocation (LDA) Visualization (LyLDAvis) were employed to classify the dialects, involving data preprocessing steps like tokenization, punctuation removal, stop word normalization, and stemming. The research findings reveal the superiority of the "Naive Bayes" method over the "Boolean Query," achieving high accuracy in identifying positive examples and classifying texts into Upper and Lower Banjar dialects. The "Naive Bayes" method outperforms the "Boolean Query" with precision and recall values of 0.955563 and 0.956098, while the "Boolean Query" only reaches 0.021416 and 0.146341. This study makes a significant scholarly contribution to understanding language and cultural diversity in South Kalimantan, opening opportunities for further exploration in developing Natural Language Processing (NLP) technology for Indonesian regional languages.
References
Tuhenay, D., & Mailoa, E. (2021). Perbandingan Klasifikasi Bahasa Menggunakan Metode Naïve Bayes Classifier (NBC) dan Support Vector Machine (SVM). Fakultas Teknologi Informasi, Program Studi Teknik Informatika, Universitas Kristen Satya Wacana.
Setiawan, A., Santoso, L. W., & Adipranata, R. (2020). Klasifikasi Artikel Berita Bahasa Indonesia Dengan Naive Bayes Classifier. Program Studi Informatika, Fakultas Teknologi Industri, Universitas Kristen Petra.
Hapip, A. D. (1977). Kamus Banjar-Indonesia. (Departemen Pendidikan dan Kebudayaan, Ed.) (1st ed.). Jakarta: Pusat Pembinaan dan Pengembangan Bahasa. Retrieved from http://repositori.kemdikbud.go.id
Hapip, A. D., Kawi, D., & Noor, B. (1981). Struktur Bahasa Banjar Kuala (Seri bb 71). Jakarta: Pusat Pembinaan dan Pengembangan Bahasa Departemen Pendidikan dan Kebudayaan. Retrieved from http://repositori.kemdikbud.go.id/id/eprint/240
Hapsari, R. K., & Juli, Y. (2015). Stemming Artikel Berbahasa Indonesia Dengan Pendekatan Confix-Stripping. Prosiding Seminar Nasional Manajemen Teknologi XXII, 1–8.
Humaidi, A., Kamariah, & Harpriyanti, H. (2017). Infleksi dalam Bahasa Banjar. Jurnal Bahasa, Sastra, Dan Pengajarannya, 2(2), 262–272.
Kamus Bahasa Banjar dialek Hulu-Indonesia. Pusat Pembinaan dan Pengembangan Bahasa. Banjarmasin: Balai Bahasa Banjarmasin, 2008. Deskripsi Fisik: xv, 311 hlm.; 21 cm. ISBN: 978-979-685-776-0.
Sugiyono, & Maryani, Y. (2008). Kamus Bahasa Indonesia. Jakarta: Pusat Bahasa Departemen Pendidikan Nasional.
Sugono, D. (2017). Bahasa dan Peta Bahasa di Indonesia (1st ed.). Jakarta: Kementrian Pendidikan dan Kebudayaan. Retrieved from http://repositori.kemdikbud.go.id/7191/1/
Suseno, Y. S. A. (2012). Jendela Sastra Media Sastra Indonesia. Retrieved April 3, 2019, from http://www.jendelasastra.com/karya/prosa/datu-kandangan-wan-datu-kartamina
Published
Issue
Section
License
Copyright (c) 2024 Muslihul Aqqad, Nova Rijati
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.