Dhaka University Repository

Ensemble Learning Algorithms for Classification Tasks in Natural Language Processing (NLP)

Show simple item record

dc.contributor.author Hossain, Afzal
dc.date.accessioned 2025-04-20T05:05:41Z
dc.date.available 2025-04-20T05:05:41Z
dc.date.issued 2025-04-20
dc.identifier.uri http://reposit.library.du.ac.bd:8080/xmlui/xmlui/handle/123456789/4110
dc.description This thesis is submitted for the degree of Master of Philosophy. en_US
dc.description.abstract Natural Language Processing (NLP) encompasses a multitude of practical applications, including Information Retrieval, Information Extraction, Machine Translation, Text Simplification, Sentiment Analysis, Text Summarization, Spam Filtering, Auto-prediction, Auto-correction, Speech Recognition, Question Answering, and Natural Language Generation. Many of these applications are essentially classification tasks, which can be performed by machine learning models. Ensemble techniques within machine learning involve combining multiple models to improve predictive performance compared to individual models. This thesis explores the application of ensemble learning techniques to improve classification performance in NLP tasks. Various ensemble learning techniques, including bagging, boosting, random forest, and voting, are explored and experimented with. For each ensemble method, common base models, such as Support Vector Machines (SVM), Naive Bayes, Decision Trees, and K-Nearest Neighbor (KNN), are employed. Various evaluation metrics commonly used in NLP classification tasks are used, including accuracy, precision, recall, F1-score, and time complexity of the algorithms. The findings of the thesis suggest that ensemble methods, especially boosting, generally perform better than traditional machine learning methods for NLP classification tasks. The thesis also describes the modification of two ensemble models – firstly, majority voting is modified for the situation when a tie occurs, and secondly, bagging is modified with a different type of sampling. Both of these methods result in improved performances in the datasets. Overall, the research work provides a comprehensive overview of ensemble learning algorithms and their applications in improving classification performance in NLP tasks, backed by theoretical discussions, case studies, and experimental results. en_US
dc.language.iso en en_US
dc.publisher © University of Dhaka en_US
dc.title Ensemble Learning Algorithms for Classification Tasks in Natural Language Processing (NLP) en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account