Print Email Facebook Twitter Enhanced Question Classification with Optimal Combination of Features Title Enhanced Question Classification with Optimal Combination of Features Author Loni, B. Contributor Loog, M. (mentor) Tax, D.M.J. (mentor) Faculty Electrical Engineering, Mathematics and Computer Science Department Media and Knowledge Engineering Programme Computer Science Date 2011-08-11 Abstract An important component of question answering systems is question classification. The task of question classification is to predict the entity type of the answer of a natural language question. Question classification is typically done using machine learning techniques. Different lexical, syntactical and semantic features can be extracted from a question. In this work we introduce two new semantic features which improve the accuracy of classification. Furthermore, we developed a weighed approach to optimally combine different features. We also applied Latent Semantic Analysis (LSA) technique to reduce the large feature space of questions to a much smaller and efficient feature space. We adopted two different classifiers: Back-Propagation Neural Networks (BPNN) and Support Vector Machines (SVM). We found that applying LSA on question classification can not only make the question classification more time efficient, but it also improves the classification accuracy by removing the redundant features. Furthermore, we discovered that when the original feature space is compact and efficient, its reduced space performs better than a large feature space with a rich set of features. In addition, we found that in the reduced feature space, BPNN performs better than SVMs which are widely used in question classification. We tested our proposed approaches on the well-known UIUC dataset and succeeded to achieve a new record on the accuracy of classification on this dataset. Subject Question ClassificationQuestion Answering SystemsLexical FeaturesSyntactical FeaturesSemantic FeaturesCombining FeaturesSupport Vector MachinesBack-Propagation Neural Networks To reference this document use: http://resolver.tudelft.nl/uuid:486154d6-b535-4a2a-b394-c2f7f3f43ac4 Part of collection Student theses Document type master thesis Rights (c) 2011 Loni, B. Files PDF Complete_Report.pdf 837.71 KB Close viewer /islandora/object/uuid:486154d6-b535-4a2a-b394-c2f7f3f43ac4/datastream/OBJ/view