David Lu
Software Developer
Yung-Sheng Lu
Oct 14, 2016
@NCKU CSIE
Yu Zheng, Furui Liu, Hsun-Ping Hsieh @ Microsoft Research Asia, Beijing China
U-Air: When Urban Air Quality Inference Meets Big Data
U-Air: When Urban Air Quality Inference Meets Big Data
real-time and fine-grained air quality information
semi-supervised learning approach based on a co-training framework
spatial classifier
temporal classifier
Exp: Beijing and Shanghai
U-Air: When Urban Air Quality Inference Meets Big Data
expensive cost of building and maintaining
urban air quality varies by locations non-linearly
hard to portable and need a relatively long sensing period
identify discriminative features from a variety of data sources
incorporate heterogeneous features into a data analytics model effectively
the labeled data is insufficient
U-Air: When Urban Air Quality Inference Meets Big Data
Air quality index (AQI)
United States Environmental Protection Agency
Trajectory (τ)
Point of Interests (POI)
Road Network (RN)
Grid and Affecting Region (G)
Pre-processing data flow
receive spatial trajectories
stored in a trajectory database
geo-indexed
Learning data flow
spatio-temporal properties (ST)
semi-supervised learning approach based on co-training
temporal classifier (TC)
spatial classifier (SC)
individual model for each kind of pollutant
semi-supervised learning approach based on
co-training
temporal classifier (TC)
linear-chain conditional random field (CRF)
temporally-related features
spatial classifier (SC)
artificial neural network (ANN)
spatially-related features
Inference data flow
offline - spatially-related features
online - temporally-related features
conduct the inference every hour
U-Air: When Urban Air Quality Inference Meets Big Data
Parse data
http://nrl.iis.sinica.edu.tw/LASS/AirBox/
Classify data
Spatial-related and temporal-related
Other data resources
Framework
Follow this paper
By David Lu
U-Air: When Urban Air Quality Inference Meets Big Data - Intro