BUBT Library Repository

Detecting SMS Spam Using Natural Language Processing

Show simple item record

dc.contributor.author Uday, Saiful Basher
dc.contributor.author Ifti, Ibna Yousuf
dc.contributor.author Chowdhury, Mukit
dc.contributor.author Akther, Tania
dc.date.accessioned 2023-08-13T07:29:41Z
dc.date.available 2023-08-13T07:29:41Z
dc.date.issued 2023-06
dc.identifier.uri http://103.15.140.189/handle/123456789/173
dc.description Internship Report en_US
dc.description.abstract Spam is any unsolicited communication sent in bulk. Spam messages often come in the form of harmless promotional messages. But sometimes spam is a fraudulent or malicious scam. This project proposes a spam message detection system based on Natural Language Processing method. As a part of data preprocessing: data cleaning, tokenization, stop-word removing are applied on the text dataset. Five machine learning algorithms (K-Nearest Neighbors, Random Forest, Logistic Regression, Naive Bayes, Support Vector Classifier, and Decision Tree) are trained with a dataset consisting of 11,572 English sentences. Exploratory Data Analysis is also applied on the dataset to analyze total spam and ham texts. After testing all the algorithms by Voting and Term Frequency-Inverse Document Frequency, the proposed system achieved the highest accuracy of 96% for Multinomial Naive Bayes model. A simple web based application is developed using Python pickle library and Stream lit Python library to test the whole system and it works properly. en_US
dc.language.iso en_US en_US
dc.publisher Department of CSE, BUBT en_US
dc.subject Detecting SMS Spam en_US
dc.subject CSE en_US
dc.subject Natural Language Processing en_US
dc.title Detecting SMS Spam Using Natural Language Processing en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search BUBTLR


Browse

My Account