UniMC - Pubblicazioni Aperte Digitali

The automatic detection of violence and crimes in videos is gaining attention, specifically as a tool to unburden security officers and authorities from the need to watch hours of footages to identify event lasting few seconds. So far, most of the available datasets was composed of few clips, in low resolution, often built on too specific cases (e.g. hockey fight). While high resolution datasets are emerging, there is still the need of datasets to test the robustness of violence detection techniques to false positives, due to behaviours which might resemble violent actions. To this end, we propose a dataset composed of 350 clips (MP4 video files, 1920 x 1080 pixels, 30 fps), labelled as non-violent (120 clips) when representing non-violent behaviours, and violent (230 clips) when representing violent behaviours. In particular, the non-violent clips include behaviours (hugs, claps, exulting, etc.) that can cause false positives in the violence detection task, due to fast movements and the similarity with violent behaviours. The clips were performed by non-professional actors, varying from 2 to 4 per clip. (C) 2020 The Authors. Published by Elsevier Inc.Y

A dataset for automatic violence detection in videos

Bianculli, M;Falcionelli, N;Sernani, P;Tomassini, S;Contardo, P;Lombardi, M;Dragoni, AF

2020-01-01

Abstract

The automatic detection of violence and crimes in videos is gaining attention, specifically as a tool to unburden security officers and authorities from the need to watch hours of footages to identify event lasting few seconds. So far, most of the available datasets was composed of few clips, in low resolution, often built on too specific cases (e.g. hockey fight). While high resolution datasets are emerging, there is still the need of datasets to test the robustness of violence detection techniques to false positives, due to behaviours which might resemble violent actions. To this end, we propose a dataset composed of 350 clips (MP4 video files, 1920 x 1080 pixels, 30 fps), labelled as non-violent (120 clips) when representing non-violent behaviours, and violent (230 clips) when representing violent behaviours. In particular, the non-violent clips include behaviours (hugs, claps, exulting, etc.) that can cause false positives in the violence detection task, due to fast movements and the similarity with violent behaviours. The clips were performed by non-professional actors, varying from 2 to 4 per clip. (C) 2020 The Authors. Published by Elsevier Inc.Y

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione del prodotto
	
				2020
			
	Editore
	
				Elsevier
			
	Codice DOI
	
				https://dx.doi.org/10.1016/j.dib.2020.106587
			
	Rilevanza della Rivista
	
				Internazionale
			
	Codice Scopus
	
				2-s2.0-85101896616
			
	Codice Web of Science
	
				WOS:000600652300247
			
	Appare nelle tipologie:
	
				01.01 Articolo in Rivista

File in questo prodotto:

File	Dimensione	Formato
Bianculli_datasetviolencedetection_2020.pdf accesso aperto Tipologia: Versione editoriale (versione pubblicata con il layout dell'editore) Licenza: Creative commons Dimensione 715.85 kB Formato Adobe PDF Visualizza/Apri	715.85 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11393/302290

Citazioni

ND

50

34

social impact