Title:
Abusive comment detection in Tamil using deep learning

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Abstract

During the recent years, online social media have expanded in volume and coverage and have become a significant source of information for different groups of people. The comments posted on social media can be emotion-laden and hence can create an impact on mental health of an individual or a group of individuals. One such category of posts includes comments that are abusive or hateful in nature. The comments that spread hate and are abusive in nature usually target certain individuals or some specific communities. It is, therefore, very important to know about them and perhaps be able to detect such content in time. While there exist methods for automated detection of hate speech from posts in English language, there is relatively less research done on other low-resource languages, such as Tamil. This chapter presents an overview of research on detecting hate speech in low-resource languages and explores application of various deep learning models for the task. The abusive comments are classified in different categories: Homophobia, Xenophobia, Transphobic, Misandry, Misogyny, Counter-speech, and Hope speech, from Tamil and Tamil–English code-mixed language. Those comments that are not in the Tamil language are categorized as “Not-Tamil.” The following deep learning models: recurrent neural network, long-short term memory (LSTM), and bidirectional LSTM, are applied to the task. Experimental results are presented along with an analysis of the quality of results. © 2024 Elsevier Inc. All rights reserved.

Description

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By