dynamic detection of data truthfulness with vulgarity removal and social media analysis using big datas

Sundhari M,Aravindhan.K,Mano S,Silambarasan.M

Published in International Journal of Advanced Research in Computer Science Engineering and Information Technology

ISSN: 2321-3337          Impact Factor:1.521         Volume:6         Issue:3         Year: 22 April,2024         Pages:1907-1911

International Journal of Advanced Research in Computer Science Engineering and Information Technology

Abstract

Social media's surge in popularity made it possible to gather a lot of organically occurring data on people's conversations with one another. Social media conversations can frequently yield a number of characteristics, including the text content, the sender and recipient, the date and time, and the locations of the sender and recipient. The wealth of data enables researchers to look at human behaviour from a variety of angles. However, when evaluating the components of human behaviour, the majority of research simply look at one dimension. As everyone knows, there are a lot of rumours on social media, and the information is also unreliable. Based on user input, we are eliminating rumours from posts on our prototype social media. We also exclude vulgar phrases from data posts on our prototype social media. We also examine the content that the public posts on social media that expresses their dissatisfaction, frustration, and protests. In order to effectively analyze data, we use big data to filter the content and compare it with the specified keywords. Together with the obscene keywords, we define the collection of protesting keywords. User inputs are sent to big data for efficient filtering and comparison. Our project will delete the post from our prototype social media if the terms match the trained keywords. Our social media prototype was created with a Java user interface and a MySQL backend database. For the purpose of filtering, the server stores each taught term.

Kewords

Vulgar phrases, big data, Filtering

Reference

[1] es Servi, Senior Member, IEEE and Sara Beth Elson,” A Mathematical Approach to Gauging Influence by Identifying Shifts in the Emotions of Social Media Users” December 2014. [2] Vikram Krishnamurthy, Fellow, IEEE and H. Vincent Poor, Fellow, IEEE “A Tutorial on Interactive Sensing in Social Networks” March 2014. [3] SitaramAsur, Bernardo A. Huberman, “Predicting the Future with Social Media”. [4] Miaomiao Wen, Diyi Yang, Carolyn Penstein Rosé, “Sentiment Analysis in MOOC Discussion Forums: What does it tell us?”. [5] Mohamed M. Mostafa, “More than words: Social networks’ text mining for consumer brand sentiments”. [6] O. Banerjee, L. E. Ghaoui, and A. dAspremont. Model selection through sparse maximum likelihood estimation for multivariate gaussian or binary data. Journal of Machine learning research, 9(Mar):485–516, 2008. [7] S. Bhuta and U. Doshi. A review of techniques for sentiment analysis of twitter data. In Proc. Int Issues and Challenges in Intelligent Computing Techniques (ICICT) Conf, pages 583–591, Feb. 2014. [8] J. Bian, Y. Yang, H. Zhang, and T.-S. Chua. Multimedia summarization for social events in microblog stream. IEEE Transactions o multimedia, 17(2):216–228, 2015. [9] P. Bui, D. Rajan, B. Abdul-Wahid, J. Izaguirre, and D. Thain. Work queue+ python: A framework for scalable scientific ensemble applications. In Workshop on python for high performance and scientific computing at sc11, 2011. [10] P.-T. Chen, F. Chen, and Z. Qian. Road traffic congestion monitoring in social media with hinge-loss markov random fields. In Data Mining (ICDM), 2014 IEEE International Conference on, pages 80–89. IEEE, 2014. [11] X. L. Dong, L. Berti-Equille, and D. Srivastava. Integrating conflicting data: the role of source dependence. In Proceedings of the VLDB Endowment, pages 550–561, 2009. [12] X. X. et al. Towards confidence in the truth: A bootstrapping based truth discovery approach. In Proceedings of the 22th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining - KDD ’16, 2016. [13] R. Farkas, V. Vincze, G. Mora, J. Csirik, and G. Szarvas. The coal- 2010 shared task: Learning to detect hedges and their scope in natural language text. In In Proceedings of the Fourteenth Conference on Computational Natural Language Learning., 2010. [14] R. Feldman and M. Taqqu. A practical guide to heavy tails: statistical techniques and applications. Springer Science & Business Media, 1998. [15] A. Galland, S. Abiteboul, A. Marian, and P. Senellart. Corroborating information from disagreeing views. In In Proc. of the ACM International Conference on Web Search and Data Mining (WSDM’10), pages 131–140, 2010.