Print Email Facebook Twitter Extracting location context from transcripts Title Extracting location context from transcripts: a comparison of ELMo and TF-IDF Author Happel, David (TU Delft Electrical Engineering, Mathematics and Computer Science; TU Delft Pattern Recognition and Bioinformatics) Contributor Tax, D.M.J. (mentor) Loog, M. (mentor) Viering, T.J. (mentor) Makrodimitris, S. (mentor) Naseri Jahfari, A. (mentor) Degree granting institution Delft University of Technology Programme Computer Science and Engineering Project CSE3000 Research Project Date 2020-06-22 Abstract Using transcripts of the TV-series FRIENDS, this paper explores the problem of predicting the location in which a sentence was said. The research focuses on using feature extraction on the sentences, and training a logistic regression model on those features. Specifically looking at the differences in performance between using ELMo and TF-IDF for this feature extraction, achieving an accuracy rate of 58\% and 67\% respectively on a binary classification. The paper also explores the effect of several data cleaning techniques on the results. Git repository containing the source code used in the paper - https://github.com/David-Happel/scene-location-NLP Subject Natural Language ProcessingText ClassificationWord embeddingTF-IDFELMo To reference this document use: http://resolver.tudelft.nl/uuid:ad4e3624-4f39-4a64-a678-c232e3f8d7da Part of collection Student theses Document type bachelor thesis Rights © 2020 David Happel Files PDF Research_Paper.pdf 366.7 KB Close viewer /islandora/object/uuid:ad4e3624-4f39-4a64-a678-c232e3f8d7da/datastream/OBJ/view