This research evaluates the performance of various count data models, including Poisson Regression (PR), Zero-Inflated Poisson Regression (ZIP), Zero-Truncated Poisson Regression (ZTP), Truncated Negative Binomial Poisson Regression (TNBP), and Negative Binomial Poisson Regression (NBP), using immunization coverage data from the National Primary Health Care Development Agency (NPHCDA). The study focuses on children under 12 months, assessing model fit using Likelihood Ratio (LR), Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) criteria. Analysis conducted with STATA indicates that the Truncated Negative Binomial Poisson Regression (TNBP) outperformed other models in fit and efficiency. Both the ZTeeP and TNBP models demonstrated the best fit, with lower AIC (1959.107) and BIC (2037.649) values and higher Pseudo R-squared values (0.0677 for ZTP and 0.0590 for TNBP), compared to standard models. Age was identified as a significant predictor, negatively associated with immunization status, implying that older infants in the under-12-month category are less likely to receive all vaccinations. The ZTP model showed significant positive effects for antigens such as HepB0, OPV0, BCG, and Measles, with age having a significant negative association. The findings highlight the importance of selecting appropriate statistical models for accurate public health data analysis, enhancing decision-making in immunization programs.
Published in | International Journal of Statistical Distributions and Applications (Volume 10, Issue 4) |
DOI | 10.11648/j.ijsd.20241004.12 |
Page(s) | 89-100 |
Creative Commons |
This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited. |
Copyright |
Copyright © The Author(s), 2024. Published by Science Publishing Group |
PR, ZIP, TPR, NB, NBR, AIC, BIC, LR
[1] | Abdulkabir, M., Udokang, A. E., Raji, S. T., and Bello L. K. (2015), An empirical study of generalized linear model for count data Applied & Computational Mathematics 2015, 4:5 |
[2] | Agresti, A. (2007), An Introduction to Categorical Data Analysis, Second Edition, Wiley, Inc., New York. |
[3] | Atkins, D. C., Baldwin, S. A., Zheng, C., Gallop, R. J. and Neighbors, C., (2013). "A tutorial on count regression and zero-altered count models for longitudinal substance use data": Correction to Atkins et al. (2012). Psychology of Addictive Behaviors, 27(2), 379. |
[4] | Cameron, A. C., and Trivedi, P. K. (1998). Regression analysis of count data. New York: Cambridge University Press. |
[5] | Cameron, A. C.; Trivedi, P. K. (2013). Regression Analysis of Count Data Book (Second ed.). Cambridge University Press. ISBN 978-1-107-66727-3. |
[6] | Everitt B. S. (2002): The Cambridge Dictionary of Statistics (2nd ed.). New York: Cambridge University Press. |
[7] | Famoye, F. and Singh, K. P., (2006). Zero-inflated generalized Poisson regression model with an application to domestic violence data. Journal of Data Science, 4(1), pp. 117-130. |
[8] | Famoye, F., Wulu, J. T. and Singh, K. P., (2004). On the generalized Poisson regression model with an application to accident data. Journal of Data Science, 2(2004), pp. 287-295. |
[9] | Greene, C. B. (1994). Testing for over dispersion in Poisson and binomial regression models. Journal of the American Statistical Association 87(418): 451-57. |
[10] | Greene, W. H. (2003), Econometric Analysis; New York University, Upper Saddle River, New Jersey, Fifth Edition. |
[11] | Hardin J. W., and Hilbe, J. M. (2007) Generalized Linear Models and Extensions, Second Edition. 2nd ed. Stata Press; 2007. |
[12] | Ijomah, M. A., Biu, E. O., and Mgbeahurike, C. (2018) Assessing Logistic and Poisson Regression Model in Analyzing Count Data. International Journal of Applied Science and Mathematical Theory ISSN 2489-009X Vol. 4 No. 1 2018. |
[13] | Karazsia, B. T. and Van Dulmen, M. H., (2008). Regression models for count data: Illustrations using longitudinal predictors of childhood injury. Journal of pediatric psychology, 33(10), pp. 1076-1084. |
[14] | Lambert, D. (1992). Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics, 34(1), pp. 1-14. |
[15] | Yang, S., Harlow, L. I., Puggioni, G., & Redding, C. A. (2017). A comparison of different methods of zero-inflated data analysis and an application in health surveys. Journal of Modern Applied Statistical Methods, 16(1), 518-543. |
[16] | Long, J. S., and Freese, J. (2005) Regression Models for Categorical Dependent Variables Using Stata, Second Edition. 2nd ed. Stata Press; 2005. |
[17] | Mamun, M. A. A., (2014). Zero-inflated regression models for count data: an application to under-5 deaths (Master thesis, Ball State University) Muncie, Indiana. |
[18] | Mwalili, S. M., Lesaffre, E. and Declerck, D., (2008). The zero-inflated negative binomial regression model with correction for misclassification: an example in caries research. Statistical methods in medical research, 17(2), pp. 123-139. |
[19] | Peng, J. (2013). Count Data Models for Injury Data from the National Health Interview Survey (NHIS) (Doctoral dissertation, The Ohio State University). |
[20] | Piza, E. L. (2012): Using Poisson and Negative Binomial Regression Models to Measure the Influence of Risk on Crime incident Counts. Rutgers Center on Public Security. |
[21] | Poston Jr, D. L. and McKibben, S. L., (2003). Using zero-inflated count regression models to estimate the fertility of US women. Journal of Modern Applied Statistical Methods, 2(2), p. 10. |
[22] | Shaw, D. (1988), 'On-site samples' regression problems of non-negative integers, truncation, and endogenous stratification', Journal of Econometrics, 37, 211-223; 2005. |
[23] | Slymen, D., Ayala, G., Arredondo, E., and Elder, J., (2006), A demonstration of modeling count data with an application to physical activity. Epidemiologic Perspectives and Innovations 3: 1-9. |
[24] | Yang, S., (2014). A comparison of different methods of zero-inflated data analysis and its application in health surveys (Master thesis, Rhode Island University). |
[25] | Yang, S., Harlow, L. L., Puggioni, G. and Redding, C. A., 2017. A Comparison of Different Methods of Zero-Inflated Data Analysis and an Application in Health Surveys. Journal of Modern Applied Statistical Methods, 16(1), p. 29. |
[26] | Zorn, C. J. W. (1996). Evaluating zero-inflated and hurdle Poisson specifications. Midwest Political Science Association, 1-16. |
[27] | Zuur, A., Ieno, E. N., Walker, N., Saveliev, A. A. and Smith, G. M., (2009). Mixed effects models and extensions in ecology with R. Springer Science & Business Media. |
APA Style
Aliyu, U., Usman, U., Bashar, A. U., Faruk, D. U. (2024). Performance Assessment of Some Count Data Models to Immunization Coverage Data. International Journal of Statistical Distributions and Applications, 10(4), 89-100. https://doi.org/10.11648/j.ijsd.20241004.12
ACS Style
Aliyu, U.; Usman, U.; Bashar, A. U.; Faruk, D. U. Performance Assessment of Some Count Data Models to Immunization Coverage Data. Int. J. Stat. Distrib. Appl. 2024, 10(4), 89-100. doi: 10.11648/j.ijsd.20241004.12
@article{10.11648/j.ijsd.20241004.12, author = {Usman Aliyu and Umar Usman and Abubakar Umar Bashar and Daha Umar Faruk}, title = {Performance Assessment of Some Count Data Models to Immunization Coverage Data }, journal = {International Journal of Statistical Distributions and Applications}, volume = {10}, number = {4}, pages = {89-100}, doi = {10.11648/j.ijsd.20241004.12}, url = {https://doi.org/10.11648/j.ijsd.20241004.12}, eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijsd.20241004.12}, abstract = {This research evaluates the performance of various count data models, including Poisson Regression (PR), Zero-Inflated Poisson Regression (ZIP), Zero-Truncated Poisson Regression (ZTP), Truncated Negative Binomial Poisson Regression (TNBP), and Negative Binomial Poisson Regression (NBP), using immunization coverage data from the National Primary Health Care Development Agency (NPHCDA). The study focuses on children under 12 months, assessing model fit using Likelihood Ratio (LR), Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) criteria. Analysis conducted with STATA indicates that the Truncated Negative Binomial Poisson Regression (TNBP) outperformed other models in fit and efficiency. Both the ZTeeP and TNBP models demonstrated the best fit, with lower AIC (1959.107) and BIC (2037.649) values and higher Pseudo R-squared values (0.0677 for ZTP and 0.0590 for TNBP), compared to standard models. Age was identified as a significant predictor, negatively associated with immunization status, implying that older infants in the under-12-month category are less likely to receive all vaccinations. The ZTP model showed significant positive effects for antigens such as HepB0, OPV0, BCG, and Measles, with age having a significant negative association. The findings highlight the importance of selecting appropriate statistical models for accurate public health data analysis, enhancing decision-making in immunization programs. }, year = {2024} }
TY - JOUR T1 - Performance Assessment of Some Count Data Models to Immunization Coverage Data AU - Usman Aliyu AU - Umar Usman AU - Abubakar Umar Bashar AU - Daha Umar Faruk Y1 - 2024/11/22 PY - 2024 N1 - https://doi.org/10.11648/j.ijsd.20241004.12 DO - 10.11648/j.ijsd.20241004.12 T2 - International Journal of Statistical Distributions and Applications JF - International Journal of Statistical Distributions and Applications JO - International Journal of Statistical Distributions and Applications SP - 89 EP - 100 PB - Science Publishing Group SN - 2472-3509 UR - https://doi.org/10.11648/j.ijsd.20241004.12 AB - This research evaluates the performance of various count data models, including Poisson Regression (PR), Zero-Inflated Poisson Regression (ZIP), Zero-Truncated Poisson Regression (ZTP), Truncated Negative Binomial Poisson Regression (TNBP), and Negative Binomial Poisson Regression (NBP), using immunization coverage data from the National Primary Health Care Development Agency (NPHCDA). The study focuses on children under 12 months, assessing model fit using Likelihood Ratio (LR), Akaike Information Criterion (AIC), and Bayesian Information Criterion (BIC) criteria. Analysis conducted with STATA indicates that the Truncated Negative Binomial Poisson Regression (TNBP) outperformed other models in fit and efficiency. Both the ZTeeP and TNBP models demonstrated the best fit, with lower AIC (1959.107) and BIC (2037.649) values and higher Pseudo R-squared values (0.0677 for ZTP and 0.0590 for TNBP), compared to standard models. Age was identified as a significant predictor, negatively associated with immunization status, implying that older infants in the under-12-month category are less likely to receive all vaccinations. The ZTP model showed significant positive effects for antigens such as HepB0, OPV0, BCG, and Measles, with age having a significant negative association. The findings highlight the importance of selecting appropriate statistical models for accurate public health data analysis, enhancing decision-making in immunization programs. VL - 10 IS - 4 ER -