Economic Bulletin of the National Mining University

 

IssuesSectionsAuthorsKeywords

Article

Issue:2025 №4 (92)
Section:Economics of enterprise
UDK:004.932:330.4
DOI:https://doi.org/10.33271/ebdut/92.188
Article language:English
Pages:188-194
Title:Large-scale processing of big data with Python in modern business analytics
Author:Mshvidobadze T. I., Gori State University
Annotation:Methods. The paper is based on a comprehensive review of the impact of Big Data Analytics on modern business analytics and presents an innovative solution for efficient processing of this data in high-performance computing (HPC) environments. The study aims to demonstrate how the integration of Big Data technologies can qualitatively change strategic planning, risk management and operational optimization in a business environment. The analysis covers the dynamic landscape of modern business analytics, emphasizing its transformative power in obtaining critical insights from large and diverse data sets. Special attention is paid to the challenges associated with insufficient performance and general versatility of existing big data processing tools in HPC infrastructures. To address these problems, the study discusses PyCOMPS, a task-based programming model in Python, for the first time. The performance and efficiency of PyCOMPS are evaluated by applying it to implement a complex machine learning algorithm, namely Cascade SVM. Novelty. The novelty of the work lies in the integrated approach, which not only summarizes the multifaceted role of big data analytics (in particular, in customer-oriented initiatives and operational optimization), but also offers a specific, high-performance and flexible solution. PyCOMPS is positioned as an excellent answer to the problem of the lack of productive and universal tools for distributed big data processing in HPC. The results of the Cascade SVM implementation serve as empirical evidence of its advantages. Results. The study details the transformational path of big data analytics in modern business intelligence, confirming its crucial role in reducing risks and increasing operational efficiency. The main result is a demonstration of the high performance of PyCOMPS for the efficient development and execution of Big Data analytical tasks. The work concludes with practical recommendations for organizations seeking to maximize the potential of big data analytics in the data-driven era. Practical value. The work has high practical value for IT architects, data engineers and analysts. Discussion of the benefits PyCOMPS provides a direct toolkit for high-performance and efficient development of big data analytics in the environment of modern business intelligence systems. This solution provides better flexibility and performance compared to traditional HPC models, making complex data processing more accessible to developers using Python. 
Keywords:Big data, Business Intelligence, Data analytics, Modern Business, PyCOMPS
File of the article:EV20254_188-194.pdf
Literature:
  • 1. Abele, D., & D’Onofrio, S. (2020). Artificial intelligence - the big picture. Cognitive Computing: Theorie, Technik und Praxis, 31-65.
  • 2. Amela R., Ishii K., Morizawa R., 2020, Effi- cient development of high performance data analytics in Python, Future Generation Computer Systems, Volume 111. Pages 570-581. https://doi.org/10.1016/j.future.2019.09.051
  • 3. Alam, T. (2022). Blockchain cities: the futuristic cities driven by Blockchain, big data and internet of things. GeoJournal, 87(6), 5383-5412. https://doi.org/10.1007/s10708-021-10508-0
  • 4. Bharadiya, J.P. (2023). A comparative study of business intelligence and artificial intelligence with big data analytics. American Journal of Artificial Intelligence, 7(1), 24. https://doi.org/10.11648/j.ajai.20230701.14
  • 5. Chukwu, E., Adu-Baah, A., Niaz, M., Nwagwu, U., & Chukwu, M.U. (2023). Navigating ethical supply chains: the intersection of diplomatic management and theological ethics. International Journal of Multidiscipli- nary Sciences and Arts, 2(1), 127-139. https://doi.org/10.47709/ijmdsa.v2i1.2874
  • 6. Dekimpe, M.G. (2020). Retailing and retailing research in the age of big data analytics. International Journal of Research in Marketing, 37(1), 3-14. https://doi.org/10.1016/j.ijresmar.2019.09.001
  • 7. Figueira, P.T., Bravo, C.L., & López, J.L.R. (2020). Improving information security risk analysis by including threat-occurrence predictive models. Computers & Security, 88, 101609. https://doi.org/10.1016/j.cose.2019.101609
  • 8. Nguyen, D.K., Sermpinis, G., & Stasinakis, C. (2023). Big data, artificial intelligence and machine learning: A transformative symbiosis in favour of financial technology. European Financial Management, 29(2), 517-548. https://doi.org/10.1111/eufm.12365
  • 9. Pedregosa F., Varoquaux G., Gramfort A., Michel V., Thirion B., Grisel O., Blondel M., Prettenhofer P., Weiss R., Dubourg V., Vanderplas J., Passos A. 2011, Scikit-learn: machine learning in python. Learn. Res., pp. 2825-2830.
  • 10. Rosário, A.T., & Dias, J.C. (2023). How has data-driven marketing evolved: Challenges and opportunities with emerging technologies. International Journal of Information Management Data Insights, 3(2), 100203.
  • 11. Rawat, K.S., & Sood, S.K. (2021). Emerging trends and global scope of big data analytics: a scientometric analysis. Quality & Quantity, 55, 1371-1396.
  • 12. Shah, K., Patel, H., Sanghvi, D., & Shah, M. (2020). A comparative analysis of logistic regression, random forest and KNN models for the text classifica- tion. Augmented Human Research, 1-16. https://doi.org/10.1007/s41133-020-00032-0
  • 13. Turner V. The digital universe of opportunities: rich data and the increasing value of the internet of things, International Data Corporation (2014).
  • 14. Tejedor E., Becerra Y., Alomar G., Queralt A., Badia R.M., Torres J., Cortes T., Labarta J. P2017, COMPSs: parallel computational workflows in python. Int. High Perform. Appl., 31 (1), pp. 66-82. https://doi.org/10.1177/1094342015594678
  • 15. Xu, Y., Liu, H., & Long, Z. (2020). A distributed computing framework for wind speed big data forecasting on Apache Spark. Sustainable Energy Technologies and Assessments, 37, 100582.
  • 16. Zeebaree, S.R., Shukur, H.M., Haji, L.M., Zebari, R.R., Jacksi, K., & Abas, S.M. (2020). Characteristics and analysis of hadoop distributed systems. Technology Reports of Kansai University, 62(4), 1555-1564.