<p dir="ltr"><b>DISCLAIMER: The license for this dataset is 'Restrictive License', but please refer to the original sources of the data for licensing information. We are only redistributing it within their limitation.</b></p><p dir="ltr"><b>Information</b></p><p>__________________________________________________________________________</p><p dir="ltr">This is the Air Quality Sensor Data Repository as published in the following work</p><p dir="ltr">https://www.arxiv.org/abs/2508.02724</p><p dir="ltr">The dataset is a zip file sized roughly 25GB. The unzipped data is roughly 70GB of only CSV and JSON data.</p><p dir="ltr">To abide by the original owners' licensing, we publish only the raw data and provide all code for preprocessing through the following repository:</p><p dir="ltr">https://github.com/YahiDar/AQ-SDR</p><p dir="ltr">Please check the documentation in the repository to further understand the dataset characteristics.</p><p dir="ltr">We also provide the modeling and machine learning aspect of the work through:</p><p dir="ltr"><a href="https://github.com/YahiDar/Veli" target="_blank"><u>https://github.com/YahiDar/Veli</u></a></p><p><br></p><p dir="ltr"><b>Licenses</b></p><p>__________________________________________________________________________</p><p dir="ltr">Each data source has a different license. Please make sure you are using the data appropriately as requested by the original provided.</p><p dir="ltr">KNMI Data (folder name: /EU_data/KNMI):</p><p dir="ltr">The original license is CC BY 4.0</p><p dir="ltr">as documented on their webpage: <a href="https://www.knmidata.nl/open-data" target="_blank"><u>https://www.knmidata.nl/open-data</u></a></p><p><br></p><p dir="ltr">LuchtMeetNet data (folder names: /EU_data/lucht_root and /EU_data/luchtmeetnet_csvs):</p><p dir="ltr">The original license is CC BY-ND 4.0</p><p dir="ltr">as documented on their webpage: <a href="https://www.luchtmeetnet.nl/informatie/download-data/open-data" target="_blank"><u>https://www.luchtmeetnet.nl/informatie/download-data/open-data</u></a></p><p><br></p><p dir="ltr">RIVM SamenMeten data (folder name: /EU_data/crowd_stations_root):</p><p dir="ltr">The original license is not specified, but it is open to use and redistribute.</p><p dir="ltr">as documented on their webpage: https://www.samenmeten.nl/international/OpenData</p><p><br></p><p><br></p><p dir="ltr">Sensor.Community data (folder name: /EU_data/sencom_hourly):</p><p dir="ltr">The original license is DbCL v1.0</p><p dir="ltr">as documented on their webpage: <a href="https://sensor.community/nl/" target="_blank"><u>https://sensor.community/nl/</u></a></p><p><br></p><p dir="ltr">Taiwan Ministry of Environment data (folder name: /out_of_distribution_downloaded/downloaded_ref):</p><p dir="ltr">The original license is The Open Government Data License, version 1.0</p><p dir="ltr">as documented on their webpage: <a href="https://data.gov.tw/license" target="_blank"><u>https://data.gov.tw/license</u></a></p><p><br></p><p dir="ltr">PM2.5 Open Data Portal - LASS (folder name: /out_of_distribution_downloaded/downloaded_lcs):</p><p dir="ltr">The original license is CC BY-NC-SA 4.0</p><p dir="ltr">as documented on their webpage: <a href="https://pm25.lass-net.org/" target="_blank"><u>https://pm25.lass-net.org/</u></a></p><p><br></p><p dir="ltr"><b>Acknowledgement</b></p><p>__________________________________________________________________________</p><p dir="ltr">We sincerely thank the Dutch government for supporting this research with the starter grant (startersbeurzen). We also thank the organizations and researchers who provide the open data to enable this research, including the Dutch National Institute for Public Health and the Environment (RIVM), the Dutch Royal Netherlands Meteorological Institute (KNMI), Dr. Ling-Jyh Chen in Taiwan Academia Sinica for the AirBox project, the Taiwan Ministry of Environment, the Sensor.Community platform, and the European Environmental Agency (EEA). We also thank the GGD Amsterdam and RIVM for providing information about how air quality sensor stations work in the Netherlands. We also thank the CREATE Lab at the Robotics Institute at Carnegie Mellon University for the technical support in building the air quality dashboard.</p><p><br></p>