Cross-platform hate speech detection using an attention-enhanced BiLSTM model
Hate speech is rapidly spreading across digital platforms, appearing in diverse forms driven by regional, cultural, and linguistic differences. This growing trend presents serious challenges to social harmony and online safety. Existing hate speech detection models often fall short because they rely...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | en |
| Published: |
PUBETA SINGLE MEMBER P.C.
2025
|
| Online Access: | http://eprints.utem.edu.my/id/eprint/29536/2/028383112202523212866.pdf http://eprints.utem.edu.my/id/eprint/29536/ https://etasr.com/index.php/ETASR/article/view/13249 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1858062973354901504 |
|---|---|
| author | Hussain, Muzammil Sharif, Waqas Faheem, Muhammad Rehan Alsarhan, Yazeed Elsalamony, Hany A. |
| author_facet | Hussain, Muzammil Sharif, Waqas Faheem, Muhammad Rehan Alsarhan, Yazeed Elsalamony, Hany A. |
| author_sort | Hussain, Muzammil |
| building | UTEM Library |
| collection | Institutional Repository |
| content_provider | Universiti Teknikal Malaysia Melaka |
| content_source | UTEM Institutional Repository |
| continent | Asia |
| country | Malaysia |
| description | Hate speech is rapidly spreading across digital platforms, appearing in diverse forms driven by regional, cultural, and linguistic differences. This growing trend presents serious challenges to social harmony and online safety. Existing hate speech detection models often fall short because they rely on limited and homogeneous datasets, making them less effective in real-world, culturally diverse settings. Handling large-scale, diverse datasets adds notable complexity to capturing contextual nuances, as different populations and cultures demonstrate unique language patterns and expressions. This study addresses the necessity for a more universal solution by proposing a deep learning model trained on an extensive and diverse dataset comprising 0.842 million samples collected from various digital platforms. The approach combines a Bidirectional Long Short-Term Memory (BiLSTM) model with a self-attention mechanism to capture contextual depth. Various data embedding techniques were used to assess their impact, along with data resampling and standard Natural Language Processing (NLP) pre-processing steps. The proposed model achieved 0.93 accuracy with an F1-score of 0.92, outperforming several baseline and state-of-the-art models. This work provides a comprehensive and scalable framework for the detection of hate speech across various online platforms. |
| format | Article |
| id | my.utem.eprints-29536 |
| institution | Universiti Teknikal Malaysia Melaka |
| language | en |
| publishDate | 2025 |
| publisher | PUBETA SINGLE MEMBER P.C. |
| record_format | eprints |
| spelling | my.utem.eprints-295362026-02-23T01:41:02Z http://eprints.utem.edu.my/id/eprint/29536/ Cross-platform hate speech detection using an attention-enhanced BiLSTM model Hussain, Muzammil Sharif, Waqas Faheem, Muhammad Rehan Alsarhan, Yazeed Elsalamony, Hany A. Hate speech is rapidly spreading across digital platforms, appearing in diverse forms driven by regional, cultural, and linguistic differences. This growing trend presents serious challenges to social harmony and online safety. Existing hate speech detection models often fall short because they rely on limited and homogeneous datasets, making them less effective in real-world, culturally diverse settings. Handling large-scale, diverse datasets adds notable complexity to capturing contextual nuances, as different populations and cultures demonstrate unique language patterns and expressions. This study addresses the necessity for a more universal solution by proposing a deep learning model trained on an extensive and diverse dataset comprising 0.842 million samples collected from various digital platforms. The approach combines a Bidirectional Long Short-Term Memory (BiLSTM) model with a self-attention mechanism to capture contextual depth. Various data embedding techniques were used to assess their impact, along with data resampling and standard Natural Language Processing (NLP) pre-processing steps. The proposed model achieved 0.93 accuracy with an F1-score of 0.92, outperforming several baseline and state-of-the-art models. This work provides a comprehensive and scalable framework for the detection of hate speech across various online platforms. PUBETA SINGLE MEMBER P.C. 2025 Article PeerReviewed text en http://eprints.utem.edu.my/id/eprint/29536/2/028383112202523212866.pdf Hussain, Muzammil and Sharif, Waqas and Faheem, Muhammad Rehan and Alsarhan, Yazeed and Elsalamony, Hany A. (2025) Cross-platform hate speech detection using an attention-enhanced BiLSTM model. Engineering, Technology and Applied Science Research, 15 (6). pp. 29779-29786. ISSN 1792-8036 https://etasr.com/index.php/ETASR/article/view/13249 10.48084/etasr.13249 |
| spellingShingle | Hussain, Muzammil Sharif, Waqas Faheem, Muhammad Rehan Alsarhan, Yazeed Elsalamony, Hany A. Cross-platform hate speech detection using an attention-enhanced BiLSTM model |
| title | Cross-platform hate speech detection using an attention-enhanced BiLSTM model |
| title_full | Cross-platform hate speech detection using an attention-enhanced BiLSTM model |
| title_fullStr | Cross-platform hate speech detection using an attention-enhanced BiLSTM model |
| title_full_unstemmed | Cross-platform hate speech detection using an attention-enhanced BiLSTM model |
| title_short | Cross-platform hate speech detection using an attention-enhanced BiLSTM model |
| title_sort | cross-platform hate speech detection using an attention-enhanced bilstm model |
| url | http://eprints.utem.edu.my/id/eprint/29536/2/028383112202523212866.pdf http://eprints.utem.edu.my/id/eprint/29536/ https://etasr.com/index.php/ETASR/article/view/13249 |
| url_provider | http://eprints.utem.edu.my/ |
