Automatic learning model to predict transparency indicators for effective management of public resources
Introduction: This article is the product of the application of predictive analytical models to measures and indicators of corruption risk, as researched by the Pascual Bravo University Institution and the Francisco José de Caldas District University in 2022 for doctoral research on the risk model for state transparency.
Problem: From measurements of institutional capacities, it is possible to generate anticorruption measurements, such is the case of the National AntiCorruption Index (INAC for its Spanish acronym). However, there are improvements to be made in the indicators and the need to incorporate more and better measurements that support this scourge that has long been manifested in Colombia.
Objective: The objective of this research is to emphasize the need to take advantage of open data, to generate measurements of state institutional corruption and, therefore, metrics that support its transparency and integrity based on predictive analytical models to generate predictions about government indices.
Methodology: First, the importance of generating measurements for the management of corruption cases is pointed out. Then, the application of predictive analytical models to predict scores of the National AntiCorruption Index is evidenced, finding the best model to finally make a forecast based on the identification of the relevant variables.
Results: The implementation of higher levels of digital government (egovernment) can significantly contribute to the fight against corruption and the generation of better public policies that support controls and sanctions. It not only facilitates citizen access to state services, but also allows for more open and agile access to data. This constantly promotes transparency at all levels and at all times. The Huber regression that has been implemented, its smaller penalty function, and its linear rather than quadratic growth, make it more suitable for dealing with outliers. This improves the error meter estimates and provides a good estimate of the National AntiCorruption Index score.
Conclusion: It is essential to establish a framework that anticipates the behavior of INAC and directs public policy efforts towards transparency and the prevention of corruption. In addition, it is necessary to develop objective metrics, indicators, indices and risk models that promote and evaluate transparency in the fight against corruption. This implies generating early warnings, applying sanctions, implementing controls and designing improvement plans to promote recommendations based on data that can trigger actions and take advantage of free access to public information to support citizens and the country.
Originality: A predictive analytical model based on machine learning was trained to predict the future behavior of the National AntiCorruption Index, with the aim of supporting roadmaps for entities and creating improvement actions for national entities, in which it becomes necessary to explore open government data to create new indicators and improve current ones.
Limitations: The regression models on the historical data of free access for the INACs were selected, because in terms of measurement it is what is already consolidated and available for the generation of transparency policies, access to information and the fight against corruption. The challenge for future work is to have more historical data, and to create more indicators that support measurements with the creation of improvement actions per entity that is reflected in numerical measurements.
How to Cite
License
Copyright (c) 2023 Ingeniería Solidaria

This work is licensed under a Creative Commons Attribution 4.0 International License.
Cession of rights and ethical commitment
As the author of the article, I declare that is an original unpublished work exclusively created by me, that it has not been submitted for simultaneous evaluation by another publication and that there is no impediment of any kind for concession of the rights provided for in this contract.
In this sense, I am committed to await the result of the evaluation by the journal Ingeniería Solidaría before considering its submission to another medium; in case the response by that publication is positive, additionally, I am committed to respond for any action involving claims, plagiarism or any other kind of claim that could be made by third parties.
At the same time, as the author or co-author, I declare that I am completely in agreement with the conditions presented in this work and that I cede all patrimonial rights, in other words, regarding reproduction, public communication, distribution, dissemination, transformation, making it available and all forms of exploitation of the work using any medium or procedure, during the term of the legal protection of the work and in every country in the world, to the Universidad Cooperativa de Colombia Press.
F.N.D. Piraquive, O.S. Martínez, E.V. Pérez, R.G. Crespo, “Knowledge management model for project management: KM+PMTIC,” in Construction projects: improvement strategies, quality management and potential challenges, K. Hall Ed. New York, NY, USA: Nova Publishers, 2017, pp. 2345.
IEEE Reference Guide, IEEE Periodicals, Piscataway, NJ, USA, 2018. [Online]. Available: https://ieeeauthorcenter.ieee.org/wpcontent/uploads/IEEEReferenceGuide.pdf, pp. 1014.
Transparency International, El ABC del CPI: cómo se calcula el índice de percepción de la corrupción (IPC), 2021. [Online]. Available: https://www.transparency.or/es/news/howcpiscoresarecalculated
Índice Nacional Anticorrupción, “El cambio es cero corrupción” Resultados generales. [Online]. Available: : https://tariatransparencia.gov.co/observatorioanticorrupcion/Paginas/mediciones.aspx
R. Vaitkus, A. Vasiliauskaite, “An Assessment of the Impact of Legal Regulation on Financial Security in OECD Countries,” Journal of Risk and Financial Management, vol. 15, no. 2, pp. 5592
A. Dwivedi, R. Pant, S. Pandey, M. Pande, A. Mittal, “Benefits of using big data sentiment analysis and soft computing techniques in Egovernance,” International Journal of Recent Technology and Engineering, vol. 8, no. 3. doi: https://doi.org/10.3390/jrfm15020086 , pp. 1732
Unidad de Inteligencia y Análisis Financiero, Compendio de Notas ALA/CFT, vol. III. Febrero 2017. [Online]. Available: https://www.uiaf.gov.co/salaprensa/publicaciones/notasalacft/compendionotasalacftvolumeniii, pp. 1783
A. Cardoni, E. Kiseleva, F. De Luca, “Continuous auditing and data mining for strategic risk control and anticorruption: Creating “fair” value in the digital age,” Business Strategy, and the Environment. doi: https://doi.org/10.1002/bse.2558 pp. 30723085 28(1)
R. Barone, D. Masciandaro, F. Schneider, “Corruption and money laundering: You scratch my back, I’ll scratch yours,” Metroeconomica, vol. 73, no. 1.
F. Decarolis, C. Giorgiantonio, “Corruption red flags in public procurement: new evidence from Italian calls for tenders,” EPJ Data Science, vol. 11, no.1, pp. 5592.
(n.d.). Data mining and its impact on business decision making in the context of crm.
E. Fletcher, C. Larkin, S. Corbet, “Countering money laundering and terrorist financing: A case for bitcoin regulation,” Research in International Business and Finance, pp. 5592
A. Purwanto, A. Emanuel, Data Analysis for Corruption Indications on Procurement of Goods and Services, In 3rd International Conference on Information and Communications Technology, ICOIACT 2020, pp. 5660
D. Kaufmann A. Kraay, M. Mastruzzi, Governance Matters IV: Indicadores de Gobernabilidad para 1996–2004. Draft, May 9, 2005. [Online]. Available: http://web.worldbank.org/archive/website00818/WEB/GOVMAT 2.HTM.
Secretaría de Transparencia y Anti corrupción, Indicadores de Transparencia y Anticorrupción. 2020. [Online]. Available: http://2020.anticorrupcion.gov.co/Paginas/IndicadoresdeTransparenciaold.aspx
K. Rabuzin, N. Modrusan, Prediction of public procurement corruption indices using machine learning methods. In Proceedings of the 11th International Joint Conference on Knowledge Discovery, Knowledge Engineering and Knowledge Management, 2019, pp. 333-340.
Z. Chang, V. Rusu, J. Kohler, “The Global Fund: why anticorruption, transparency and accountability matter,” Globalization and Health, vol. 17, no. 1.
B. Craenen, A. Eiben, Computational Intelligence. In Encyclopedia of Life Support Sciences, EOLSS Publishers Co.
P. Wilmott, Machine Learning: An Applied Mathematics Introduction. 2019. Panda Ohana Publishing.
A. Müller, S. Guido, Introduction to Machine Learning with Python: A Guide for Data Scientists. 2017. O´Reilly Media, pp. 5592
J. Kaplan, Artificial Intelligence. What everyone needs to know. Editorial Teell.
R. Kruse, C. Borgelt, C. Braune, S. Mostaghim, M. Steinbrecher, (n.d.), pp. 5592
B. Craenen, A. Eiben, Computational Intelligence. In Encyclopedia of Life Support Sciences, EOLSS Publishers Co, pp. 5592.
A. Engelbrecht, Computational Intelligence: An Introduction (2nd ed.), John Willey Sons, pp. 5592
S. Russell, P. Norwig, Artificial Intelligence: A Modern Approach, (2nd ed.), Prentice Hall, pp. 5592.
P. Wilmott, Machine Learning: An Applied Mathematics Introduction. Panda Ohana Publishing, pp. 5592
R. Gonzalez, R. Woods, Digital Image Processing. 4th Edition. Pearson Education Limited, pp. 5592.
S. Agarwal, “Trust as a missing link between quality of work life and subjective wellbeing,” Revista Ingeniería Solidaria, vol. 16, no. 1, Jan. 2020, pp. 5592
A. Müller, S. Guido, Introduction to Machine Learning with Python: A Guide for Data Scientists. 2017. O´Reilly Media, pp. 5592
S. Anand, V. Verma, A. Gupta Aggarwal, “Dimensional MultiRelease Software Reliability Modelling considering Fault Reduction Factor under imperfect debugging”, Revista Ingeniería Solidaria, vol. 14, no. 25, pp. 1–12, pp. 5592
Huber, J. Peter, “Robust estimation of a location parameter,” The Annals of Mathematical Statistics, vol. 35, no. 1, pp. 73–101, pp. 5592
Huber, J. Peter, Wiley series in probability and statistics. Robust statistics (huber/robust statistics), pp. 294–300, 1981, pp. 5592
Wang, Yue, B. Wang, C. Peng, X. Li, H. Yin, “Huber Regression Analysis with a SemiSupervised Method” Mathematics, vol. 10, no. 20, pp. 3734. Doi: https://doi.org/10.3390/math10203734, pp. 5592
S. Anand, V. Verma, A. Gupta Aggarwal, “Dimensional MultiRelease Software Reliability Modelling considering Fault Reduction Factor under imperfect debugging,” Revista Ingeniería Solidaria, vol. 14, no. 25, pp. 1–12, pp. 5592
H. Tong, “Functional linear regression with Huber loss,” Journal of Complexity, pp. 101696. doi: https://doi.org/10.1016/j.jco.2022.101696, pp. 5592
H. Tong, “Functional linear regression with Huber loss,” Journal of Complexity, pp. 101696. doi: https://doi.org/10.1016/j.jco.2022.101696, pp. 5592
YANHAAS, La gran encuesta, Elecciones 2022. Advanced market research. [Online]. Available: https://yanhaas.com/




