Modeling Zero Inflation and Over-Dispersion in Domestic Package Insurance Claims Portfolio: A Case of Madison Insurance Company-Kenya
International Journal of Data Science and Analysis
Volume 6, Issue 5, October 2020, Pages: 137-144
Received: Sep. 29, 2020;
Accepted: Oct. 14, 2020;
Published: Oct. 21, 2020
Views 58 Downloads 39
Polycarp Nyabuto, Department of Statistics and Actuarial Science, Jomo Kenyatta University of Agriculture and Technology, Nairobi, Kenya
Anthony Wanjoya, Department of Statistics and Actuarial Science, Dedan Kimathi University of Technology, Nyeri, Kenya
Antony Ngunyi, Department of Statistics and Actuarial Science, Dedan Kimathi University of Technology, Nyeri, Kenya
The standard Poisson distribution is widely used as a mechanism for regression modeling of count data outcomes. However, the suitability of this modeling technique is only limited to equi-dispersed count data outcomes. This is due to the fact that this modeling technique does not take into account the problems associated with over dispersion and excess zeros in many data sets as with insurance claims data. The study objective is to model domestic package insurance claims frequency using zero inflated and hurdle models since insurance portfolios are characterized by the non-occurrence of claims over a given time interval. This non-occurrence of claims over a given time interval usually leads to the Zero-Inflation and Dispersion associated with insurance claims data. The study consequently evaluates the performance of the Poisson, Zero Inflated Poisson (ZIP) and Hurdle Poisson (HP) models in determining the model that best models the domestic package insurance claims data. This is then used to estimate, predict and determine the heterogeneity of occurrence of the aforementioned insurance claims. The statistical Hosmer-Lemeshow tests is used to define the suitability of the fitted model to estimate the zero-inflation and over-dispersion characteristic of the data. To determine the presence of outliers and the distribution of residuals, the Residual Pearson and Deviance statistics are used. Data on a number of claims for domestic package insurance policy from Madison Insurance ltd, Kenya spanning from 2014 to 2018 (261 weeks) is used in the study.
Modeling Zero Inflation and Over-Dispersion in Domestic Package Insurance Claims Portfolio: A Case of Madison Insurance Company-Kenya, International Journal of Data Science and Analysis.
Vol. 6, No. 5,
2020, pp. 137-144.
Copyright © 2020 Authors retain the copyright of this article.
This article is an open access article distributed under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/
) which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Alicja, Wolny-Dominiak. (2013). Zero-inflated claim count modeling and testing–a case study. Ekonometria, 1 (39), 144-151. ISSN 1507-3866.
Yogita, S. W., & Kamalja, K. K. (2017). Modeling Auto Insurance Claims in Singapore. Sri Lankan Journal of Applied statistics, 18 (2); 105–118. doi: 10.4038/sljastats.v18i2.7957.
Insurance Regulatory Authority. (2017). Insurance Annual Report. Nairobi: Insurance Regulatory Authority. https://www.ira.go.ke.
Edward, W. F., Gee, Lee. & Lu, Yang. (2016). Multivariate Frequency-Severity Regression models in Insurance. Department of Statistics, University of Wisconsin-Madison, 1300 University Avenue, Madison, WI 53706, USA.
Feldman, J. A., & Robert, L. B. (2005). Risk and Insurance. Education and Examination Committee for the Society of Actuaries. https://www.soa.org/globalassets/assets/files/edu/P-21-05.pdf.
Kwame, G. A., ELVIS, D., & Agbodah, K. (2014). Probability Modeling and Simulation of Insurance Claims in Ghana. Global Journal of Commerce and Management Perspective, 3 (5); 41-49. ISSN 2319-7285.
Yulia, R., Noriszura, I., & Saiful, H. J. (2013). Estimation of Claim Cost Data Using Zero Adjusted Gamma and Inverse Gaussian Regression Models. Journal of Mathematics and Statistics, 9 (3); 186-192. doi: 10.3844/jmssp.2013.186.192.
Vytaras, B., & Andreas, K. (2015). Measuring Severity and Tail Risk of Norwegian Fire Insurance Claims. North American Actuarial Journal, 20 (1); 1-16. doi: 10.1080/10920277.2015.1062784.
Joseph, N. W., & Christophe, P. (2011). On the Zero-Inflated Count Models with Application to Modeling Annual trends in Incidences of some Occupational Allergic Diseases in France. Journal of Data Science 9 (2011); 639-659.
Rotimi, F. A., Oyindamola B. Y., & Ayoola S. A. (2018). Modeling excess zeros in count data with application to Antenatal Care Utilization. International Journal of Statistics and Probability, 7 (3); 22-35. Doi: 105539/ijsp.v7n3p22.
Marjan, Q. (2019). On the Violation of Claims with Excess Zeros in Liability Insurance: A comparative study. Open access journal, 7 (3); 1-17.
Evgenii, V. G., & Elena, A. M. (2017). Modern Claim frequency and severity models: An application to the Russian motor and own damage insurance market. Cogent Economics & Finance, 5 (1). doi: 10.1080/23322039.2017.1311097.
Sakthivel, K. M., & Rajitha, C. S. (2017). A Comparative Study of Zero-Inflated, Hurdle Models with Artificial Neural Network in Claim Count Modeling. International Journal of Statistics and Systems, 12 (2); 265-276. ISSN 0973-2675.
Asmussen, S., & Albrecher, H. (2010). Ruin Probabilities (2Nd Edition). Advanced series on statistical science & applied probability, 14 (2); World Scientific Publishing Co. ISBN-13: 978-9813203617.
Staub, k., & Winkelmann, R. (2013). Consistent estimation of zero inflated count models. Health Economics, (22); 673-686. doi: 10.1002/hec.2844.
Klugman, S. A., Panjer, H. H., & Willmot, G. E. (2008). Loss Models: From Data to Decisions (Thirded.) John Wiley & Sons, Inc. ISBN: 978-1-119-52378-9.
Cheng Tian. (2018). Hurdle Models in Non-Life Insurance. Foculty of Mathematics and Physics, Charles University. Thesis Id: 188550.