Print Email Facebook Twitter ”How Much Data is Enough?” Learning curves for machine learning Title ”How Much Data is Enough?” Learning curves for machine learning: Investigating alternatives to the Levenberg-Marquardt algorithm for learning curve extrapolation Author Negru, Lucian (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Krijthe, J.H. (mentor) Viering, T.J. (mentor) Yue, Z. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2023-06-28 Abstract The conducted research explores fitting algorithms for learning curves. Learning curves describe how the performance of a machine learning model changes with the size of the training input. Therefore, fitting these learning curves and extrapolating them can help determine the required data set size for any desired performance. The paper specifically explores the Learning Curve Database (LCDB) and investigates alternative fitting algorithms to the employed Levenberg-Marquardt (LM). These algorithms are Gradient Descent and BFGS, and the paper aims to determine whether they are more suitable for fitting learning curves than LM. The algorithms were implemented, both in their default and optimised states, and the results were compared to LM. The results measured mean-squared error (MSE), L1 Loss, individual parametric model performance, and computation time. The findings showed that Gradient Descent is not a suitable alternative to LM; however, BFGS proved to be competitive, as it is practically identical in performance while being significantly faster than LM. The results answered the proposed aim of the paper and generated new questions that need answering. Further exploration of the BFGS algorithm and its application on learning curve fitting is recommended. Comparisons between the MSE distribution of LM and BFGS can be further explored, as well as comparisons on new parametric models, learners, and datasets. Subject Learning CurvesLCDBLevenberg-MarquardtGradient descentBFGS To reference this document use: http://resolver.tudelft.nl/uuid:5786dcac-2dee-453e-93ba-bae7aafe260e Part of collection Student theses Document type bachelor thesis Rights © 2023 Lucian Negru Files PDF CSE3000_Lucian_Negru_Final_.pdf 382.9 KB Close viewer /islandora/object/uuid:5786dcac-2dee-453e-93ba-bae7aafe260e/datastream/OBJ/view