prediction of hate speech classification using machine learning with nlp

Bhavani.B,Antony Sibiya Varghese V,Sevanthi.R,Jeevitha.A

Published in International Journal of Advanced Research in Computer Science Engineering and Information Technology

ISSN: 2321-3337          Impact Factor:1.521         Volume:6         Issue:3         Year: 27 April,2023         Pages:1828-1837

International Journal of Advanced Research in Computer Science Engineering and Information Technology

Abstract

The everyday lives of many people have been deeply affected by digital media. Hate speech is a narrative intended to distract or mislead the audience. Because of a number of factors, including the rise of online social networks in recent years, hate speech has become more common in the online world. Users of online social networks may easily be affected by this hateful speech. Hate speech has become a social problem, sometimes spreading more rapidly than the truth. Individuals are unable to identify all instances of hate speech. Thus, it is necessary to employ a machine learning algorithm to automatically detect hate speech. The development of machine learning models involves the utilization of algorithms that distinguish between speech that is considered hate speech, hurtful speech, or neither. Among the various algorithms, the Gradient Boosting Algorithm yields the highest level of accuracy. As a result, it has been chosen for use in this project's launch. The Kaggle dataset employed to classify hate speech comprises characteristics such as tweet count, hate speech, offensiveness, neutrality, classification, and tweet content.

Kewords

Machine learning, Online, Hate speech, Algorithm

Reference

[1] Thomas Davidson,Dana Warmsle, Michael Macy,Ingmar Weber,” Automated Hate Speech Detection and the Problem of Offensive Language” in Proceedings of the international AAAI conference on web and social media 11 (1), 512-515, 2017 [2] Hajime Watanabe, Mondher Bouazizi, Tomoaki Ohtsuki ,” Hate Speech on Twitter: A Pragmatic Approach to Collect Hateful and Offensive Expressions and Perform Hate Speech Detection”in IEEE access 6, 13825-13835, 2018 [3] Georgios Rizos, Konstantin Hemker, Bjorn Schuller,” Augment to Prevent: Short- Text Data Augmentation in Deep Learning for Hate-Speech Classification” in the Proceedings of the 28th ACM internationalconference on information and knowledge management, 991-1000, 2019 [4] Amrutha, B R and Bindu, K R, “Detecting Hate Speech in Tweets Using Different Deep Neural Network Architectures in 2019 International Conference on Intelligent Computing and Control Systems (ICCS), 923-926, 2019 [9] Ching Seh Wu, Unnathi Bhandary,” Detection of Hate Speech in Videos Using Machine Learning” in 2020 International Conference on Computational Science and Computational Intelligence (CSCI), 585- 590, 2020 [5] Sindhu Abro1 , Sarang Shaikh2 , Zafar Ali ,” Automatic Hate Speech Detection using Machine Learning: A Comparative Study”in International Journal of Advanced Computer Science and Applications 11 (8), 2020 [6] Raghad Alshalan and Hend Al-Khalifa,” A Deep Learning Approach for Automatic Hate Speech Detection in the Saudi Twittersphere”in Applied Sciences 10 (23), 8614, 2020 [7]Zewdie Mossie, Jenq-Haur Wang,”Vulnerable community identification using hate speech detection on social media” in Information Processing & Management 57 (3), 102087, 2020 [8] Ching Seh Wu, Unnathi Bhandary,” Detection of Hate Speech in Videos Using Machine Learning” in 2020 International Conference on Computational Science and Computational Intelligence (CSCI), 585- 590, 2020 [10] Varsha Pathak , Manish Joshi , Prasad A. Joshi , Monica Mundada , Tanmay Joshi,” Using Machine Learning for Detection of Hate Speech and Offensive Code-Mixed Social Media text “ in arXiv preprint arXiv:2102.09866, 2021 [11] Binny Mathew, Punyajoy Saha, Seid Muhie Yimam, Chris Biemann, Pawan Goyal, Animesh Mukherjee,” HateXplain “A Benchmark Dataset for Explainable Hate Speech Detection” in Proceedings of the AAAI Conference on Artificial Intelligence 35 (17), 14867-14875, 2021 [12] Soumitra Ghosh , Asif Ekbal , Pushpak Bhattacharyya, Tista Saha, Alka Kumar, and Shikha Srivastava,”SEHC: A Benchmark Setup to Identify Online Hate Speech in English” in IEEE Transactions on Computational Social Systems, 2022