Print Email Facebook Twitter Synthetic data generation for the optimization of strains in metabolic engineering using latent space representations derived from a Conditional Variational Autoencoder Title Synthetic data generation for the optimization of strains in metabolic engineering using latent space representations derived from a Conditional Variational Autoencoder Author Alwani, Neil (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor Abeel, T.E.P.M.F. (mentor) van Lent, P.H. (mentor) Hanjalic, A. (graduation committee) Degree granting institution Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2024-02-02 Abstract This study investigates the application of generative models for synthetic data generation in pathway optimization experiments within the field of metabolic engineering. Conditional Variational Autoencoders (CVAEs) use neural networks and latent variable distributions to generate new, plausible data samples. We adapt this model by conditioning the training process on the target flux to acquire increased performance.Additionally, a baseline model, namely Probabilistic Principal Component Analysis (PPCA), was selected for a comparative analysis to generate the underlying latent space to test the hypothesis that a type of Variational Autoencoder (VAE) can be used to learn a reduced-dimensional latent space for configurations of a kinetic pathway model. A dataset comprising 5000 hypothetical configurations of a kinetic pathway model was utilized to extract relationships between elements of a kinetic pathway.The results indicate that PPCA can model the underlying distribution of the dataset when the latent space is large enough. However, the traditional CVAE might struggle to capture the underlying distribution, resulting in an entangled latent space. The study suggests that an implementation of $\beta$-CVAE could lead to a better balance between parts of the objective function during training, offering improved prospects for generating cost-efficient kinetic pathways for combinatorial pathway optimization experiments. Subject Metabolic Flux AnalysisVariational Autoencoder (VAE)Probabilistic analysisPrincipal Component Analysis (PCA)Combinatorial Optimization To reference this document use: http://resolver.tudelft.nl/uuid:0f0fbe65-257d-491d-9fe5-5c1b3864dfd4 Part of collection Student theses Document type bachelor thesis Rights © 2024 Neil Alwani Files PDF RP_paper_Final_3_.pdf 959.45 KB Close viewer /islandora/object/uuid:0f0fbe65-257d-491d-9fe5-5c1b3864dfd4/datastream/OBJ/view