Software Cost Estimation Using Machine Learning Techniques
DOI:
https://doi.org/10.65405/.v10i37.686Keywords:
software cost estimation, machine learning, random forest, neural networks, regression, nasa93, feature importance.Abstract
Accurate software cost estimation (SCE) is paramount for effective project management, enabling stakeholders to allocate resources efficiently, manage timelines, and mitigate financial risks. Traditional parametric models, such as the Constructive Cost Model (COCOMO), often struggle to capture the intricate, nonlinear relationships and complex interactions inherent in modern software development projects. This research addresses this limitation by applying a suite of advanced machine learning (ML) techniques—including Linear Regression, Decision Trees, Random Forest, Support Vector Regression (SVR), and Artificial Neural Networks (ANN)—to the widely recognized NASA93 dataset.
The study meticulously details the entire workflow, encompassing data preprocessing, comprehensive feature analysis, robust model evaluation, and insightful visualization of results. Our findings unequivocally demonstrate that ensemble methods, particularly Random Forest, and neural network architectures significantly outperform conventional estimation approaches in terms of accuracy, reliability, and predictive power. This paper not Preprocessing, feature analysis, model evaluation, workflow modeling, and visualization are fully detailed. Results demonstrate that Random Forest and ANN significantly outperform traditional methods in accuracy and reliability. This paper not only highlights the superior performance of ML models but also provides an in-depth analysis of their underlying mechanisms, contributing to a clearer understanding of their applicability and impact in the domain of software cost estimation.
Downloads
References
Boehm, B. W. (1981). Software engineering economics. Prentice-Hall.
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.
Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine Learning, 20(3), 273–297.
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.
Li, H., Luo, Z., Li, Y., Wu, H., & Li, X. (2018). A hybrid model for software cost estimation using machine learning techniques. Information and Software Technology, 101, 1–12.
Menzies, T., Spinellis, D., Zimmermann, T., & Bird, C. (2007). The promise repository of empirical software engineering data. Empirical Software Engineering, 12(4), 365–371.
Shepperd, M., Menzies, T., Bird, C., & Zimmermann, T. (2012). Data mining for software engineering. IEEE Transactions on Software Engineering, 38(6), 1108–1119.
Gao, Q., Luo, Z., & Huang, J. (2019). Deep learning for software cost estimation. Information and Software Technology, 111, 1–12.
Shamim, M. M. I., et al. (2025). Advancement of artificial intelligence in cost estimation for project management success: A systematic review of machine learning, deep learning, regression, and hybrid models. MDPI. https://www.mdpi.com/2673-3951/6/2/35
Ranković, N. (2024). Recent advances in artificial intelligence in cost estimation in project management. Springer. https://link.springer.com/book/10.1007/978-3-031-76572-8
Singh, S. (2023). Software cost estimation: A literature review and current trends. IEEE Xplore. https://ieeexplore.ieee.org/document/10176495/
Uc-Cetina, V. (2023). Recent advances in software effort estimation using machine learning. arci. https://arxiv.org/pdf/2303.03482
Draz, M. M. (2024). Software cost estimation prediction using a convolutional neural network and particle swarm optimization. PMC. https://pmc.ncbi.nlm.nih.gov/articles/PMC11161658/
Hammann, D. (2024). Big data and machine learning in cost estimation. ScienceDirect. https://www.sciencedirect.com/science/article/abs/pii/S0925527323003699
Sadikoglu, E. (2025). Review of machine learning and artificial intelligence use for cost estimation in construction projects. ResearchGate. https://www.researchgate.net/publication/395122691_Review_of_Machine_Learning_and_Artificial_Intelligence_Use_for_Cost_Estimation_in_Construction_Projects
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.








