Detecting SMS Spam Using Natural Language Processing

Uday, Saiful Basher; Ifti, Ibna Yousuf; Chowdhury, Mukit; Akther, Tania

BUBTLR Home
→
Faculty of Engineering & Applied Sciences
→
Computer Science and Engineering
→
Internship Report
→
2023
→
View Item

dc.contributor.author	Uday, Saiful Basher
dc.contributor.author	Ifti, Ibna Yousuf
dc.contributor.author	Chowdhury, Mukit
dc.contributor.author	Akther, Tania
dc.date.accessioned	2023-08-13T07:29:41Z
dc.date.available	2023-08-13T07:29:41Z
dc.date.issued	2023-06
dc.identifier.uri	http://103.15.140.189/handle/123456789/173
dc.description	Internship Report	en_US
dc.description.abstract	Spam is any unsolicited communication sent in bulk. Spam messages often come in the form of harmless promotional messages. But sometimes spam is a fraudulent or malicious scam. This project proposes a spam message detection system based on Natural Language Processing method. As a part of data preprocessing: data cleaning, tokenization, stop-word removing are applied on the text dataset. Five machine learning algorithms (K-Nearest Neighbors, Random Forest, Logistic Regression, Naive Bayes, Support Vector Classifier, and Decision Tree) are trained with a dataset consisting of 11,572 English sentences. Exploratory Data Analysis is also applied on the dataset to analyze total spam and ham texts. After testing all the algorithms by Voting and Term Frequency-Inverse Document Frequency, the proposed system achieved the highest accuracy of 96% for Multinomial Naive Bayes model. A simple web based application is developed using Python pickle library and Stream lit Python library to test the whole system and it works properly.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Department of CSE, BUBT	en_US
dc.subject	Detecting SMS Spam	en_US
dc.subject	CSE	en_US
dc.subject	Natural Language Processing	en_US
dc.title	Detecting SMS Spam Using Natural Language Processing	en_US
dc.type	Other	en_US