Enhancing Hate Speech Detection in the Digital Age: A Novel Model Fusion Approach Leveraging a Comprehensive Dataset

Waqas Sharif, Saima Abdullah, Saman Iftikhar, Daniah Al-Madani, Shahzad Mumtaz

Research output: Contribution to journalArticlepeer-review

1 Downloads (Pure)

Abstract

In the era of digital communication, social media platforms have experienced exponential growth, becoming primary channels for information exchange. However, this surge has also amplified the rapid spread of hate speech, prompting extensive research efforts for effective mitigation. These efforts have prominently featured advanced natural language processing techniques, particularly emphasizing deep learning methods that have shown promising outcomes. This article presents a novel approach to address this pressing issue, combining a comprehensive dataset of 18 sources. It includes 0.45 million comments sourced from various digital platforms spanning different time frames. There were two models utilized to address the diversity in the data and leverage distinct strengths found within deep learning frameworks: CNN and BiLSTM with an attention mechanism. These models were tailored to handle specific subsets of the data, allowing for a more targeted approach. The unique outputs from both models were then fused into a unified model. This methodology outperformed recent models, showcasing enhanced generalization capabilities even when tested on the largest and most diverse dataset. Our model achieved an impressive accuracy of 89%, while maintaining a high precision of 0.88 and recall of 0.91.
Original languageEnglish
Pages (from-to)27225-27236
Number of pages12
JournalIEEE Access
Volume12
Early online date19 Feb 2024
DOIs
Publication statusPublished - 23 Feb 2024

Bibliographical note

The authors extend their appreciation to the Arab Open Uni-versity for funding this work through AOU research fund No.(AOUKSA-524008)

Keywords

  • BiLSTM
  • CNN
  • deep learning
  • Hate speech detection
  • model fusion
  • natural language processing

Fingerprint

Dive into the research topics of 'Enhancing Hate Speech Detection in the Digital Age: A Novel Model Fusion Approach Leveraging a Comprehensive Dataset'. Together they form a unique fingerprint.

Cite this