"Prediction of High-ozone Events Using GAM, SMOTE, and Tail Dependence Approaches in Texas (2005-2019)"

ABSTRACT We test two methods for ozone prediction in the El Paso (ELP) and Houston-Galveston-Brazoria (HGB) regions of Texas from 2005–2019: (1) a Generalized Additive Model (GAMs) with the Synthetic Minority Over-sampling TEchnique (SMOTE) and (2) a tail dependence regression approach. We also compare the feature selection capabilities of the tail dependence approach to other feature selection methods. We find that, generally, the GAM+SMOTE model outperformed the GAM-only model when predicting ozone values, particularly with regard to the above-threshold ozone values. We also find that the tail dependence approach is capable of predicting extreme ozone events, but algorithmic stability and configuration complexity can make this approach difficult to operationalize on a broad scale and that the selection of the threshold needs to be carefully considered. In addition, we find that the improvement of above-threshold MDA8 O3 prediction tends to come at the cost of below-threshold prediction, which is particularly important if MDA8 O3 trends are of interest. Finally, the feature selection via the tail dependence method performs comparably to other forms of machine learning-based feature selection and we find that there are multiple parameter sets that can predict MDA8 O3 with equal success.

Keywords: dependence; selection; tail dependence; prediction; ozone

Journal Title: Aerosol and Air Quality Research
Year Published: 2021

Link to full text (if available)

Share on Social Media: Sign Up to like & get
recommendations!
1

LAUSR

You are not signed in:

Sign Up!

Related content

More Information News Social Media Video Recommended