Print Email Facebook Twitter Predicting DNA repair-deficient genotypes based on Cas9-induced DNA repair products Title Predicting DNA repair-deficient genotypes based on Cas9-induced DNA repair products Author Verkooijen, Paul (TU Delft Electrical Engineering, Mathematics and Computer Science) Contributor de Pinho Gonçalves, J.S. (mentor) Reinders, M.J.T. (graduation committee) Tijsterman, Marcel (graduation committee) Degree granting institution Delft University of Technology Date 2021-01-29 Abstract Double strand breaks are lesions to the DNA and can be fatal for cells. Therefore these breaks are repaired, primarily by one of the three major repair pathways. Two of these pathways are non-homologous end-joining (NHEJ) and theta-mediated end-joining (TMEJ). These pathways leave genetic alterations in their repair products, a form of DNA damage. DNA damage is linked to several diseases such as cancer. Understanding of these pathways is important and being able to recognize which pathways are active can be beneficial for research. In this work, repair products are used to predict repair-deficient genotypes using Cas9-induced repair products. Ku80 and PolQ deficient genotypes are used, impairing NHEJ and TMEJ respectively. The ability to recognize a repair-deficient genotype is tested using two predictive tasks. First statistical machine learning algorithms are used to predict the genotype where a repair product can be found. This is done by only using a single repair product as input. Secondly, a set of Cas9-induced repair products from a single cell culture is used to predict the genotype of that cell culture. Results show that when given a single repair products, models have difficulty predicting the correct genotype. However, results are modest and the best classifier achieved an AUC of 0.76. For predicting the genotype of a cell culture using multiple repair products of that culture showed really promising results. When predicting on cell cultures with breaks induced on a target site which the model has seen in the training data, results are near perfect. Predicting on unseen target sites shows that there is room for improvement but the best performing models showed an average AUC of 0.879 across target sites. A Results show that Cas9-induced repair products can be used to predict repair-deficient genotypes. Subject Machine Learningdouble strand breakrepair productspathway deficiency To reference this document use: http://resolver.tudelft.nl/uuid:46282f1b-68b0-4b94-8e65-c648684a2713 Part of collection Student theses Document type master thesis Rights © 2021 Paul Verkooijen Files PDF Thesis_Paul_Verkooijen.pdf 665.44 KB Close viewer /islandora/object/uuid:46282f1b-68b0-4b94-8e65-c648684a2713/datastream/OBJ/view