WideNoise Dumps for Research Purposes

About

For research purposes we offer a dataset of the WideNoise database in form of SQL dumps to interested people. Before you get access to the datasets, you have to sign up our license agreement and send it via fax to our office. Additionally, we would like to ask you to subscribe to the WideNoise-Research mailing list. Upon receipt of your faxed license agreement, we will approve the subscription request and in the welcome mail you will get instructions on how to access the dataset.

On this page you can download the dumps as compressed tar archive archive. A README describing the format of the files is contained in each archive. Please note that the easiest way to work with the dumps is by using a MySQL database. Detailed information on the table structure can be found below on this page.

We are quite interested in results you got with the help of this dataset. Therefore, please inform us about your publications. Concerning citing this data in publications, refer to the following reference:

Martin Becker, Saverio Caminiti, Donato Fiorella, Louise Francis, Pietro Gravino, Mordechai (Muki) Haklay, Andreas Hotho, Vittorio Loreto, Juergen Mueller, Ferdinando Ricchiuti, Vito Servedio, Alina Sîrbu, and Francesca Tria: Awareness and Learning in Participatory Noise Sensing. PLOS ONE, 8(12):81638, 2013. DOI: 10.1371/journal.pone.0081638. Download: PDF.

If you want to refer to the system, please use the following publication:

Martin Becker, Juergen Mueller, Andreas Hotho, and Gerd Stumme: A Generic Platform for Ubiquitous and Subjective Data. 1st International Workshop on Pervasive Urban Crowdsensing Architecture and Applications, PUCAA 2013, Zurich, Switzerland - September 9, 2013. Proceedings, pp. 1175-1182, ACM, New York, NY, USA, 2013. DOI: 10.1145/2494091.2499776. Download: PDF.

Downloads

File Size Description
2015-01-29.data_widenoise.sql.gz 2.40 MB WideNoise dataset from January 29th, 2015.
WideNoise_License_Agreement.pdf 526 KB WideNoise license agreement (version 2015-01-30)

Dataset Description

The datasets have been created using the mysqldump command of a MySQL database. The CREATE statements for the corresponding tables can be found in the corresponding dump files together with the INSERT statements which insert the data into the database.

We offer two dataset dumps contained in the following files that you can download above after signing the corresponding license agreement.

The WideNoise table contains a shared set of common columns that store shared information across our system as described bellow.

Name Type Description
id bigint(20) unsigned The data points identifier
meta_user_id varchar(64) The hashed unique user identifier. Cells contain NULL in case the user has no user account at our web application .
meta_device_id varchar(64) The hashed unique device identifier.
meta_timestamp_recorded datetime Date and time when the data point was recorded on the smartphone.
meta_timestamp_received datetime Date and time when the data point was received at our server.
geo_lon double The longitude coordinate of the data point's location.
geo_lat double The latitude coordinate of the data point's location.

The WideNoise dump has the following additional data columns that contain the actual measurement:

Name Type Description
id bigint(20) unsigned The data points identifier
data_average double The average dB(A) value shown to the user for the recording.
data_user_estimate double The guessed db(A) entered by the user during recording. Cells contain NULL in case the user didn't used the guess game feature.
data_duration double The duration of the recording in seconds. The default duration of a recording is 5 seconds and can be extended to 10 and 15 seconds.
data_perception_feeling double How lovely was the sampled noise. Ranges from -1 (love) to +1 (hate).
data_perception_disturbance double How disturbing was the sampled noise. Ranges from -1 (calm) to +1 (hectic).
data_perception_isolation double How crowded was the environment during the recording. Ranges from -1 (alone) to +1 (social).
data_perception_artificiality double How artificial was the recorded noise. Ranges from -1 (nature) to +1 (man-made).