AirProbe Dumps for Research Purposes


For research purposes we offer a dataset of the AirProbe database in form of SQL dumps to interested people. Before you get access to the datasets, you have to sign up our license agreement and send it via fax to our office. Additionally, we would like to ask you to subscribe to the AirProbe-Research mailing list. Upon receipt of your faxed license agreement, we will approve the subscription request and in the welcome mail you will get instructions on how to access the dataset.

On this page you can download the dumps as compressed tar archive archive. A README describing the format of the files is contained in each archive. Please note that the easiest way to work with the dumps is by using a MySQL database. Detailed information on the table structure can be found below on this page.

We are quite interested in results you got with the help of this dataset. Therefore, please inform us about your publications. Concerning citing this data in publications, refer to the following reference:

Alina Sîrbu, Martin Becker, Saverio Caminiti, Bernard De Baets, Bart Elen, Louise Francis, Pietro Gravino, Andreas Hotho, Stefano Ingarra, Vittorio Loreto, Andrea Molino, Juergen Mueller, Jan Peters, Ferdinando Ricchiuti, Fabio Saracino, Vito D. P. Servedio, Gerd Stumme, Jan Theunis, Francesca Tria, and Joris Van den Bossche: Participatory Patterns in an International Air Quality Monitoring. PLOS ONE, 10(8):e0136763, 2015. DOI: 10.1371/journal.pone.0136763. Download: PDF.

If you want to refer to the system, please use the following publication:

Martin Becker, Juergen Mueller, Andreas Hotho, and Gerd Stumme: A Generic Platform for Ubiquitous and Subjective Data. 1st International Workshop on Pervasive Urban Crowdsensing Architecture and Applications, PUCAA 2013, Zurich, Switzerland - September 9, 2013. Proceedings, pp. 1175-1182, ACM, New York, NY, USA, 2013. DOI: 10.1145/2494091.2499776. Download: PDF.


File Size Description
2015-01-29.data_airprobe.sql.gz 525 MB AirProbe dataset from January 29th, 2015.
AirProbe_License_Agreement.pdf 530 KB AirProbe license agreement (version 2015-12-02)

Dataset Description

The datasets have been created using the mysqldump command of a MySQL database. The CREATE statements for the corresponding tables can be found in the corresponding dump files together with the INSERT statements which insert the data into the database.

We offer two dataset dumps contained in the following files that you can download above after signing the corresponding license agreement.

The AirProbe table contains a shared set of common columns that store shared information across our system as described bellow.

Name Type Description
id bigint(20) unsigned The data points identifier
meta_user_id varchar(64) The hashed unique user identifier. Cells contain NULL in case the user has no user account at our web application .
meta_device_id varchar(64) The hashed unique device identifier.
meta_timestamp_recorded datetime Date and time when the data point was recorded on the smartphone.
meta_timestamp_received datetime Date and time when the data point was received at our server.
geo_lon double The longitude coordinate of the data point's location.
geo_lat double The latitude coordinate of the data point's location.

The AirProbe dump has the following additional data columns that contain the actual measurement:

Name Type Description
data_co_1 double The measured value of the first CO sensor.
data_co_2 double The measured value of the second CO sensor.
data_co_3 double The measured value of the third CO sensor.
data_co_4 double The measured value of the fourth CO sensor.
data_no2_1 double The measured value of the first NO2 sensor.
data_no2_2 double The measured value of the second NO2 sensor.
data_voc_1 double The measured value of the VOC sensor.
data_o3_1 double The measured value of the O3 sensor.
data_bc_1 double The computed black carbon value based on the four CO, the two NO2, the VOC, and the O3 sensor.
data_temp_1 double The measured value of the temperature sensor.
data_hum_1 double The measured value of the humidity sensor.
data_sensorbox_mac char(17) The MAC address of the associated sensor box.