EVALUATING THE PERFORMANCE OF MACHINE LEARNING ALGORITHMS FOR PREDICTING METEOROLOGICAL PARAMETERS

Yannick MUBAKILAYI(1), Simon Ntumba(2), Pierre Kafunda(3), Gracias Kabulu(4), Theodore kabangu(5), Christian Kabeya(6),


(1) Department of Computer Science, University of Kinshasa, Kinshasa
(2) Department of Computer Science, University of Kinshasa, Kinshasa
(3) Department of Computer Science, University of Kinshasa, Kinshasa
(4) Department of Computer Science, University of Mbujimayi, Mbujimayi
(5) Department of Computer Science, University of Mbujimayi, Mbujimayi
(6) Department of Mechanics, University of Mbujimayi, Mbujimayi
Corresponding Author

Abstract


Artificial learning techniques are currently used for weather and climate forecasting, etc. In this paper, we will evaluate three algorithms for predicting meteorological parameters based on the humidity parameter. Our dataset was taken from Mbujimayi airport in the DRC. So, having the real-world data at our disposal, we used these Machine Learning tools to interpret and understand what happened by training the three models separately, and draw the conclusion as to which was the best model. Then, we used the three models to make some predictions about what our environment will be like tomorrow, and to draw conclusions and make decisions about whether or not our climate is already facing climate change. Three models are used: Decision tree, k-nearest neighbor and neural network, the analysis reveals that of the three models tested, the decision tree scored 81.8% after training with an average prediction of 71.5%, in second place we have the K-nearest neighbor with a score of 70% after training with an average prediction of 70, 8% and the cloture neural network with 64% training and an average prediction of 66.1%. Thus, the decision tree outperforms the other models in terms of training and prediction of meteorological parameters, and is the best model with a very high performance compared to the other models.

Keywords


Machine Learning, prediction, climate change

References


S. Thober, R. Kumar, N. Wanders, A. Marx, M. Pan, O. Rakovec, L. Samaniego, J. Sheffield, E. F. Wood, M. Zink, Multi-model ensemble projections of European river floods and high flows at 1.5, 2, and 3 degrees global warming, Environ. Res. Lett. 13 (2018) 014003.

K. Mohammed, A. K. M. S. Islam, G. M. T. Islam, L. Alfieri, M. J. U. Khan, S. K. Bala, M. K. Das, Future Floods in Bangladesh under 1.5°C, 2°C, and 4°C Global Warming Scenarios, J. Hydrol. Eng. 23 (2018) 04018050.

B. Azari, M. Tabesh, Urban storm water drainage system optimization using a sustainability index and LID/BMPs, Sustain. Cities Soc. 76 (2022) 103500.

K. Azizi, C. I. Meier, Improving the Characterization of Urban Flash Floods through Application of Local Knowledge, in AGU Fall Meeting Abstracts, (2020), 2020, H162-0011.

S. Ebrahimi, M. Khorram, Variability effect of hydrological regime on river quality pattern and its uncertainties: case study of Zarjoob River in Iran, J. Hydroinformatics 23 (2021) 1146–1164.

M. Tabesh, B. Azari, H. Rezaei, Hydraulic performance assessment of stormwater networks, in Proceedings of the International Conference on Civil, Architecture and Disaster Management (ICCADM-16), Hong Kong, (2016), 17–18.

B. Azari, M. Tabesh, Storm Water Drainage Networks Hydraulic Performance Assessment, International Conference On Sustainable Development And Urban Construction, Iran. (2015)

K. Azizi, C. I. Meier, Urban Pluvial Flood Risk Assessment: Challenges and Opportunities for Improvement Using a Community-Based Approach, in World Environmental and Water Resources Congress 2021, (2021), 350–361. doi: 10.1061/9780784483466.033.

D. Absalon, B. Ślesak, Air temperature increase and quality of life in an anthropogenically transformed environment: A case study, Polish J. Environ. Stud. 21 (2012) 235–239.

Q. Song, B. S. Chissom, Forecasting enrollments with fuzzy time series — Part I, Fuzzy Sets Syst. 54 (1993) 1–9.

S.-M. Chen, J.-R. Hwang, Temperature prediction using fuzzy time series, IEEE Trans. Syst. Man, Cybern. Part B 30 (2000) 263–275.

N. Zhong, S. S. Yau, J. Ma, S. Shimojo, M. Just, B. Hu, G. Wang, K. Oiwa, Y. Anzai, Brain Informatics-Based Big Data and the Wisdom Web of Things, IEEE Intell. Syst. 30 (2015) 2–7.

Z. Su, Q. Xu, Q. Qi, Big data in mobile social networks: a QoE-oriented framework, IEEE Netw. 30 (2016) 52–57.

J. Wen, J. Yang, B. Jiang, H. Song, H. Wang, Big Data Driven Marine Environment Information Forecasting: A Time Series Prediction Network, IEEE Trans. Fuzzy Syst. 29 (2021) 4–18.

M. Alizamir, O. Kisi, A. N. Ahmed, C. Mert, C. M. Fai, S. Kim, N. W. Kim, A. El-Shafie, Advanced machine learning model for better prediction accuracy of soil temperature at different depths, PLoS One 15 (2020) 1–25.

B. Azari, M. Tabesh, Optimal design of stormwater collection networks considering hydraulic performance and BMPs, Int. J. Environ. Res. 12 (2018) 585–596.

A. Rahman, A. Khan, A. A. Raza, Parkinson’s disease detection based on signal processing algorithms and machine learning, CRPASE Trans. Electr. Electron. Comput. Eng. 6 (2020) 141–145.

S. R. Moosavia, D. A. Woodb, S. A. Samadanic, Modeling Performance of Foam-CO2 Reservoir Flooding with Hybrid Machine-learning Models Combining a Radial Basis Function and Evolutionary Algorithms, methods 4 (2020) 5.

K. A. Marill, Advanced Statistics: Linear Regression, Part II: Multiple Linear Regression, Acad. Emerg. Med. 11 (2004) 94–102.

M. M. Ahmadi, H. Shahriari, Y. Samimi, A novel robust control chart for monitoring multiple linear profiles in phase II, Commun. Stat. - Simul. Comput. (2020) 1–12.

S. Mehri, M. M. Ahmadi, H. Shahriari, A. Aghaie, Robust process capability indices for multiple linear profiles, Qual. Reliab. Eng. Int. 37 (2021) 3568–3579.

J. Liang, Q. Yang, T. Sun, J. D. Martin, H. Sun, L. Li, MIKE 11 model-based water quality model as a tool for the evaluation of water quality management plans, J. Water Supply Res. Technol. 64 (2015) 708–718.

I. Saini, D. Singh, A. Khosla, QRS detection using K-Nearest Neighbor algorithm (KNN) and evaluation on standard ECG databases, J. Adv. Res. 4 (2013) 331–344.

X. Wang, L. Ma, X. Wang, Apply semi-supervised support vector regression for remote sensing water quality retrieving, in 2010 IEEE International Geoscience and Remote Sensing Symposium, (2010), 2757–2760. doi: 10.1109/IGARSS.2010.5653832.

S. Liu, H. Tai, Q. Ding, D. Li, L. Xu, Y. Wei, A hybrid approach of support vector regression with genetic algorithm optimization for aquaculture water quality prediction, Math. Comput. Model. 58 (2013) 458–465.

A. Najah, A. El-Shafie, O. A. Karim, A. H. El-Shafie, Application of artificial neural networks for water quality prediction, Neural Comput. Appl. 22 (2013) 187–201.

K. P. Singh, A. Basant, A. Malik, G. Jain, Artificial neural network modeling of the river water quality—A case study, Ecol. Modell. 220 (2009) 888–895.

B. Fotovvati, M. Balasubramanian, E. Asadi, Modeling and Optimization Approaches of Laser-Based Powder-Bed Fusion Process for Ti-6Al-4V Alloy, Coatings 10 (2020)

P. Hosseinzadeh Talaee, Daily soil temperature modeling using neuro-fuzzy approach, Theor. Appl. Climatol. 118 (2014) 481–489.

A. Jain, R. W. McClendon, G. Hoogenboom, R. Ramyaa, Prediction of frost for fruit protection using artificial neural networks, Am. Soc. Agric. Eng. St. Joseph, MI, ASAE Pap. (2003) 3–3075.

B. A. Smith, R. W. McClendon, G. Hoogenboom, Improving air temperature prediction with artificial neural networks, Int. J. Comput. Intell. 3 (2006) 179–186.

S. Salcedo-Sanz, R. C. Deo, L. Carro-Calvo, B. Saavedra-Moreno, Monthly prediction of air temperature in Australia and New Zealand with machine learning algorithms, Theor. Appl. Climatol. 125 (2016) 13–25.

A. Azad, H. Kashi, S. Farzin, V. P. Singh, O. Kisi, H. Karami, H. Sanikhani, Novel approaches for air temperature prediction: A comparison of four hybrid evolutionary fuzzy models, Meteorol. Appl. 27 (2020) e1817.

Y. Wang, Y. Bai, L. Yang, H. Li, Short time air temperature prediction using pattern approximate matching, Energy Build. 244 (2021) 111036.

M. Moeini, A. Shojaeizadeh, M. Geza, Supervised Machine Learning for Estimation of Total Suspended Solids in Urban Watersheds, Water 13 (2021) 147.

P. Kumar, S. F. Shah, M. A. Uqaili, L. Kumar, R. F. Zafar, Forecasting of Drought: A Case Study of Water-Stressed Region of Pakistan, Atmosphere (Basel). 12 (2021) 1248.

L.Mukunayi, J. BanokaNsona, Kakule K, Kanda N, Impact of Mbujimayi Airport on Groundfill Cavities in Bipemba Community, The International Journal of Engineering and Science, 8(2019) 53-61

E. Fix and J. Hodges, "Discriminatory Analysis: Nonparametric Discrimination: Consistency Properties," 4, 1951.

T. M. Cover and P. E. Hart, "Nearest Neighbor Pattern Classification," IEEE Trans. Inform. Theory, vol. IT-13, pp. 21-27, 1967.

Y. Hamamoto, S. Uchimura, and S. Tomita, "A Bootstrap Technique for Nearest Neighbor Classifier Design," IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, vol. 19, no. 1, pp. 73-79, 1997.

E. Alpaydin, "Voting Over Multiple Condensed Nearest Neoghbors," Artificial Intelligence Review, vol. 11, pp. 115-132,1997.

K. Q. Weinberger and L. K. Saul, "Distance Metric Learning for Large Margin Nearest Neighbor Classification," Journal of Machine Learning Research, vol. 10, pp. 207-244, 2009.

A. Kataria and M. D. Singh, "A Review of Data Classification Using K-Nearest Neighbour Algorithm," International Journal of Emerging Technology and Advanced Engineering, vol. 3, no. 6,pp. 354-360, 2013.

N. Bhatia and A. Vandana, "Survey of Nearest NeighborTechniques," (IJCSIS) International Journal of Computer Science and Information Security, vol. 8, no. 2, pp. 302-305, 2010.

A. B. A. Hassant, "Visual Speech Recognition," in Speech Technologies, I. Ipsic, Ed. Rijeka: InTech - Open Access Publisher, 2011, vol. 2, ch. 14.

A. B. A. Hassanat, "Visual Passwords Using Automatic Lip Reading," International Journal of Sciences: Basic and Applied Research (IJSBAR), vol. 13, no. 1, pp. 218-231, 2014.

P. Kumar P., Artificial Neural Network Based Numerical Solution of Ordinary Differential Equations, Thesis, National institute of technology, 2012

Unlukara, A.; Kurunc, A.; Cemek, B. Green Long Pepper Growth under Dierent Saline and Water Regime Conditions and Usability ofWater Consumption in Plant Salt Tolerance. J. Agric. Sci. 2015,21,167–176.

Bruin, H.; Trigo, I. A New Method to Estimate Reference Crop Evapotranspiration from Geostationary Satellite Imagery: Practical Considerations. Water 2019, 11, 382. [CrossRef]

Witten, I.H.; Frank, E. Data Mining: Practical Machine Learning Tools and Techniques, 2nd ed.; Morgan Kaufmann Series in Data Management Systems:Washington,DC,USA, 2005.

Quinlan, J. Simplifying decision trees. Int. J. Man-Mach. Stud. 1987, 27, 221–23


Full Text: PDF

Article Metrics

Abstract View : 151 times
PDF Download : 36 times

DOI: 10.56327/ijiscs.v7i2.1526

Refbacks

  • There are currently no refbacks.