Unsupervised Detection of Violent Content in Arabic Social Media

Abstract

A monitoring system is proposed to detect violent content in Arabic social media. This is a new and challenging task due to the presence of various Arabic dialects in the social media and the non-violent context where violent words might be used. We proposed to use a probabilistic nonlinear dimensionality reduction technique called sparse Gaussian process latent variable model (SGPLVM) followed by k-means to separate violent from non-violent content. This framework does not require any labelled corpora for training. We show that violent and non-violent Arabic tweets are not separable using k-means in the original high dimensional space, however better results are achieved by clustering in low dimensional latent space of SGPLVM.

BibTeX key: dhinaharannagamalai2017unsupervised
entry type: article
year: 2017
month: 1-7
journal: Computer Science & Information Technology (CS & IT)
number: 4
volume: 7
Document: http://airccse.org/V7N66.html

BibSonomy

Unsupervised Detection of Violent Content in Arabic Social Media

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on