Venezuelan Insurance Clusters with Unsupervised Machine Learning

Luis David Lara Rodríguez(1), Jorge Aquino Olmos(2), Elzabeth López Meléndez(3),


(1) Polytechnic University of Puebla
(2) Latin American Technological University Online
(3) Technological University of Huejotzingo
Corresponding Author

Abstract


Insurance companies play an important role in a healthy economy, as they provide a security service to goods and people. The Venezuelan insurance market has faced great challenges in the last decades in a narrow environment, where knowing the closest competitors is of utmost importance. The public governing body that regulates the comparison between insurers has only made use of the premiums charged as a rating factor, however this governing body makes public an additional range of indicators, for this research we have selected five of these indicators from the last three years and contrast whether the premiums charged represent a unique rating characteristic. Several machine learning methods have been applied for this study, from dimensionality reduction techniques, unsupervised clustering and optimal clustering performance criteria; all of them have allowed us to discover the hidden patterns of the data and present a clustering that reaches 100% accuracy, of the same number of classes as the government entity; which reflect the naturalness of the data as opposed to the arbitrary one given by this same entity. From its derived groupings, it is possible to affirm that the parameter of premiums collected does not represent a determining grouping characteristic, allowing the Venezuelan insurance market to compare itself efficiently with its competitors.


Keywords


Machine Learning, PCA, Mixture of Gaussian, K-means, BIC, AIC, Insures.

References


C. Hernández Avendaño, "La industria aseguradora en el PIB de Colombia," Revista Faselcolda, vol. 189, pp. 2-10, 2023.

S. A. Guevara and G. B. Ruegeles, "Importancia de la Industria aseguradora en el encuentron y desarrollo económico del país.," pp. 19-25, 1985.

W. Mayorga, "2013. Un año en transacción," Revista Fasecolda, pp. 18-23, 2014.

G. B. d. Venezuela, 2022. [Online]. Available: https://www.sudeaseg.gob.ve/.

Á. d. C. Aragón Gallegos, S. I. Cerquín Silva, R. O. Escurra Yactayo and A. L. Roncalla Viena, "Segmentación de clientes para mejorar la experiencia de compra de productos electrónicos en Falabella," Creative Commons, Perú, 2023.

M. E. Porter, Estrategia competitiva: Técnicas para el análisis de los sectores industriales y de la competencia., México: Grupo Editorial Patria, 2015, pp. 68-73.

Z.-H. Zhou, Machine Learning, Australia-China: Springer, 2016.

M. Ravil, "Machine learning methods: An overview," Computer modelling and new technologies, vol. 19, pp. 14-29, 2015.

A. Hamza, I. Hamza Awad, N. Sulaiman Mohd, M. Aliyu and B. Abuagla, "Taxonomy of Machine Learning Algorithms to classify real- time Interactive applications," International Journal of Computer Networks and Wireless Communications 2, pp. 69-73, 2012.

T. Ayodele, "Types of machine learning algoritms," New advances in machine learning, vol. 3, pp. 19-48, 2010.

W. Barbakh, "Review of Clustering Algorithms. In: Non-Standard Parameter Adaptation for Exploratory Data Analysis," in Studies in Computational Intelligence, vol. 249, Springer, Berlin, Heidelberg, 2009, pp. 7-28.

F. N. Khan, Q. Fan, C. Lu and A. P. T. Lau, "An optical communication's perspective on machine learning and its applications," Journal of Lightwave Technology, vol. 37, no. 2, pp. 493-516, 2019.

E. Harki and Z. S. Rashid, "Analysis of factors that affect radiation dose level during interventional cardiology procedures using logistic regression," Revista Mexicana de Física, vol. 69, no. 3, pp. 1-8, 2023.

D. Kim, S.-H. Kim, T. Kim, B. B. Kang, M. Lee, W. Park, S. Ku, D. Kim, J. Kwon, H. Lee, J. Bae, Y.-L. Park,

K.-J. Cho and S. Jo, "Review of machine learning methods in soft robotics," Plos One, p. 16, 2021.

J. T. Mc. Coy and L. Auret, "Machine learning applications in minerals processing: A review," Minerals Engineering, vol. 132, pp. 95-109, 2019.

A. Sharma, A. Jain, P. Gupta and V. Chowdary, "Machine learning applications for precision agriculture: A comprehensive review," IEEE Access, vol. 9, pp. 4843-4873, 2020.

L. Patel, T. Shulka, X. Huang, D. W. Ussery and S. Wang, "Machine Learning Methods in Drug Discovery," Molecules, vol. 25 (22), no. 5277, 2020.

M. W. Libbrecth and S. W. Nonble, "Machine learning applications in genetics and genomics," Nature Reviews Genetics volume, vol. 16, pp. 321-332, 2015.

B. Abbasi and D. M. Goldenholz, "Machine learning applications in epilepsy," Epilepsia, vol. 60, no. 10, pp. 2037-2047, 2019.

K. Kourou, T. P. Exarchos, K. P. Exarchos, M. V. Karamouzis and D. I. Fotiadis, "Machine learning applications in cancer prognosis and prediction," Computational and structural biotechnology journal, vol. 13, pp. 8-17, 2015.

M. Field, N. Hardcastle, M. Jameson, N. Aherne and L. Holloway, "Machine learning applications in radiation oncology," Physics and Imaging in Radiation Oncology, vol. 19, pp. 13-24, 24 06 2021.

T. Kurita, "Principal component analysis (PCA).," Computer Vision: A Reference Guide, pp. 1-4, 2020.

J. MacQueen, "Classification and analysis of multivariate observations.," in 5th Berkeley Symp. Math. Statist. Probability., Los Angeles California, 1967.

J. Wang and S. Xialong, "An improved K-Means clustering algorithm," in 2011 IEEE 3rd International Conference on Communication Software and Networks, China, 2011.

J. Wu, "Cluster Analysis and K-means Clustering: An Introduction," in Advances in K-means Clustering, Berlin, Heidelberg, Springer, Berlin, Heidelberg, 2012, pp. 1-26.

E. Patel and D. S. Kushwaha, "Clustering Cloud Workloads: K-Means vs Gaussian Mixture Model," Procedia Computer Science, vol. 171, pp. 158-167, 2020.

A. Alam, M. Muqueem and S. Ahmad, "Comprehensive review on Clustering Techniques and its application on High Dimensional Data," IJCSNS International Journal of Computer Science and Network Security, vol. 21, no. 6, pp. 237-244, 2021.

C. Agiakloglou and A. Tsimpanos , "Evaluating the performance of AIC and BIC for selecting spatial econometric models," Journal of Spatial Econometrics, pp. 1-35, 2023.

X. Wang and Y. Xu, "An improved index for clustering validation based on Silhouette index and Calinski-Harabasz index," IOP Conference Series: Materials Science and Engineering, vol. 569, no. 5, pp. 1-6, 2019.

R. B. Pereira, A. Plastino, B. Zadrozny and L. H. C. Merschmann, "Correlation analysis of performance measures for multi-label classification," Information Processing & Management, vol. 54, no. 3, pp. 359-369, Mayo 2018.

P. Ferrando, U. Lorenzo and J. Muñiz, "Decalogue for the factor analysis of test items," Psicothema, pp. 7-17, 2022.

V. Nateski, "An overvies of the supervised machine learning methods," Horizon, pp. 51-62, 2017.

W. A. Barbakh, Y. Wu and C. Fyfe, "Review of Clustering Algorithms," in Studies in Computational Intelligence book series (SCI,volume 249), vol. 249, Berlin, Springer, Berlin, Heidelberg., 2009.

P. J. Hardin and J. M. Shumway, "Statistical significance and normalized confusion matrices," Photogrammetric engineering and remote sensing, vol. 63, no. 6, pp. 735-739, 1997.


Full Text: PDF

Article Metrics

Abstract View : 89 times
PDF Download : 26 times

DOI: 10.56327/jtksi.v7i1.1634

Refbacks

  • There are currently no refbacks.