Development of an Optimized Bernoulli Naïve Bayes Classifier for Threshold-Based False Onset Rainfall Prediction

DOI: https://doi.org/10.33003/jobasr

Saifullahi Suleiman

Shamsudden Suleiman

Jamilu A. Bashir

Yusuf Bello

Abstract
This study presents the development of an optimized Bernoulli Naive Bayes classifier for predicting threshold-based false onset rainfall, a phenomenon critical to farming. The methodology followed in this research includes data collection, preprocessing, feature selection and threshold analysis, model development, model optimization, and evaluation.The primary focus of this research was the optimization of this model to improve its performance. Leave-one-out cross-validation was employed to systematically validate the model by training it on all but one instance and testing it on the excluded instance, ensuring robust performance evaluation. Grid search was used for hyper parameter tuning to identify the optimal parameters that maximize model accuracy. Alpha smoothing was applied to handle zero probabilities, ensuring the model's generalization to unseen data. The model was evaluated using key performance metrics, such as accuracy, precision, recall, and F1 score. Experimental results revealed that the optimized model achieved significant improvements in predictive accuracy and reliability over baseline implementations. This optimization framework highlights the model's computational efficiency and its suitability for real-time applications. The findings establish the potential of the optimized model as a powerful tool for addressing challenges associated with false onset rainfall prediction. Unlike deterministic models, this research emphasized probabilistic reasoning, introducing a novel approach to rainfall prediction.
References
Abdilah, A., Fitri, R. R., & Hasanah, R. U. (2024). Rainfall prediction using Naive Bayes classification and SMOTE. Indonesian Journal of Electrical Engineering and Computer Science, 23(1), 45–51. https://doi.org/10.11591/ijeecs.v23.i1.pp45-51 Adefolalu, D. O. (1986). Further aspects of Sahelian drought as evident from rainfall regime of Nigeria. Archives for Meteorology, Geophysics and Bioclimatology, 36(2), 123–135. Adeyeri, O. E., Okogbue, E. C., & James, O. (2020). Trends and variability in rainfall onset, cessation and length of growing season over Nigeria. Theoretical and Applied Climatology, 139(1), 571–585. https://doi.org/10.1007/s00704-019-02970-8 Ajayi, A. E., Ogunjobi, K. O., & Adefisan, E. A. (2021). False onset of rainfall and implications for maize farming in the Guinea Savanna of Nigeria. Weather and Climate Extremes, 33, 100357. https://doi.org/10.1016/j.wace.2021.100357 Balogun, E. E. (2000). Rainfall anomalies and onset of rains in Nigeria. Nigerian Journal of Meteorology, 5(2), 13–22. Kohavi, R. (1995). A study of cross-validation and bootstrap for accuracy estimation and model selection. In Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI) (pp. 1137–1143). Kundu, R., & Ahmed, M. R. (2020). A novel rainfall prediction model using machine learning approach. Applied Computing and Informatics. https://doi.org/10.1016/j.aci.2020.03.004 Liyew, A. Y., & Melese, M. M. (2021). Daily rainfall prediction using machine learning techniques: A case of Ethiopia. Journal of Big Data, 8, 1–17. https://doi.org/10.1186/s40537-021-00427-w Manandhar, S., Dev, S., Lee, Y. H., Meng, Y. S., & Winkler, S. (2019). A comparative study of rainfall prediction using artificial neural network and support vector machine. Weather and Climate Extremes, 24, 100201. https://doi.org/10.1016/j.wace.2019.100201 Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to information retrieval. Cambridge University Press. Odekunle, T. O. (2004). Rainfall and the length of the growing season in Nigeria. International Journal of Climatology, 24(4), 467–479. https://doi.org/10.1002/joc.1012 Odekunle, T. O., Balogun, A. A., & Ogunjobi, K. (2019). Rainfall characteristics and false start of rainy season in a tropical city. Environmental Monitoring and Assessment, 191, 512. https://doi.org/10.1007/s10661-019-7690-3 Ojo, O. (1977). The climate of West Africa. Heinemann. Ojo, O., & Ogunjo, S. T. (2022). Machine learning model for predicting rainfall over Nigeria. SN Applied Sciences, 4, 196. https://doi.org/10.1007/s42452-022-04999-7 Olaniran, O. J., & Sumner, G. N. (1989). A study of rainfall trends in Nigeria. Journal of Climatology, 9(1), 253–264. https://doi.org/10.1002/joc.3370090210 Omotosho, J. B., & Abiodun, B. J. (2021). Predictability of false rainfall onset in Nigeria. Theoretical and Applied Climatology, 145, 525–538. https://doi.org/10.1007/s00704-021-03620-7 Oswal, V. (2019). Rainfall prediction using machine learning techniques. International Journal of Engineering Research & Technology (IJERT), 8(7), 317–322. https://doi.org/10.17577/IJERTV8IS070145 Rahman, M. M., Akhand, M. M., & Hossain, M. A. (2022). A hybrid machine learning model for rainfall prediction. Neural Computing and Applications, 34, 7791–7805. https://doi.org/10.1007/s00521-021-06621-9 Rennie, J., Shih, L., Teevan, J., & Karger, D. (2003). Tackling the poor assumptions of Naïve Bayes text classifiers. In Proceedings of the 20th International Conference on Machine Learning (ICML-03) (pp. 616–623). Sandeep, N., & Jahavi, S. S. (2020). Machine learning approach for rainfall prediction: A case study. International Journal of Scientific & Technology Research, 9(1), 2305–2309. Sokolova, M., & Lapalme, G. (2009). A systematic analysis of performance measures for classification tasks. Information Processing & Management, 45(4), 427–437. https://doi.org/10.1016/j.ipm.2009.03.002 Varma, S., & Simon, R. (2006). Bias in error estimation when using cross-validation for model selection. BMC Bioinformatics, 7, 91. https://doi.org/10.1186/1471-2105-7-91 Zhang, H. (2004). The optimality of naive Bayes. In Proceedings of the Seventeenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2004) (pp. 562–567)
PDF