Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

Machine learning classification of significant tornadoes and hail in the U.S. using ERA5 proximity soundings

Machine learning classification of significant tornadoes and hail in the U.S. using ERA5... <jats:title>Abstract</jats:title><jats:p>Previous studies have identified environmental characteristics that skillfully discriminate between severe and significant-severe weather events, but they have largely been limited by sample size and/or population of predictor variables. Given the heightened societal impacts of significant-severe weather, this topic was revisited using over 150 000 ERA5 reanalysis-derived vertical profiles extracted at the grid-point nearest—and just prior to—tornado and hail reports during the period 1996–2019. Profiles were quality-controlled and used to calculate 84 variables. Several machine learning classification algorithms were trained, tested, and cross-validated on these data to assess skill in predicting severe or significant-severe reports for tornadoes and hail. Random forest classification outperformed all tested methods as measured by cross-validated critical success index scores and area under the receiver operating characteristic curve values. In addition, random forest classification was found to be more reliable than other methods and exhibited negligible frequency bias. The top three most important random forest classification variables for tornadoes were wind speed at 500 hPa, wind speed at 850 hPa, and 0–500-m storm-relative helicity. For hail, storm-relative helicity in the 3–6 km and -10 to -30 °C layers, along with 0–6-km bulk wind shear, were found to be most important. A game theoretic approach was used to help explain the output of the random forest classifiers and establish critical feature thresholds for operational nowcasting and forecasting. A use case of spatial applicability of the random forest model is also presented, demonstrating the potential utility for operational forecasting. Overall, this research supports a growing number of weather and climate studies finding admirable skill in random forest classification applications.</jats:p> http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Weather and Forecasting CrossRef

Machine learning classification of significant tornadoes and hail in the U.S. using ERA5 proximity soundings

Weather and ForecastingOct 7, 2021

Machine learning classification of significant tornadoes and hail in the U.S. using ERA5 proximity soundings


Abstract

<jats:title>Abstract</jats:title><jats:p>Previous studies have identified environmental characteristics that skillfully discriminate between severe and significant-severe weather events, but they have largely been limited by sample size and/or population of predictor variables. Given the heightened societal impacts of significant-severe weather, this topic was revisited using over 150 000 ERA5 reanalysis-derived vertical profiles extracted at the grid-point nearest—and just prior to—tornado and hail reports during the period 1996–2019. Profiles were quality-controlled and used to calculate 84 variables. Several machine learning classification algorithms were trained, tested, and cross-validated on these data to assess skill in predicting severe or significant-severe reports for tornadoes and hail. Random forest classification outperformed all tested methods as measured by cross-validated critical success index scores and area under the receiver operating characteristic curve values. In addition, random forest classification was found to be more reliable than other methods and exhibited negligible frequency bias. The top three most important random forest classification variables for tornadoes were wind speed at 500 hPa, wind speed at 850 hPa, and 0–500-m storm-relative helicity. For hail, storm-relative helicity in the 3–6 km and -10 to -30 °C layers, along with 0–6-km bulk wind shear, were found to be most important. A game theoretic approach was used to help explain the output of the random forest classifiers and establish critical feature thresholds for operational nowcasting and forecasting. A use case of spatial applicability of the random forest model is also presented, demonstrating the potential utility for operational forecasting. Overall, this research supports a growing number of weather and climate studies finding admirable skill in random forest classification applications.</jats:p>

Loading next page...
 
/lp/crossref/machine-learning-classification-of-significant-tornadoes-and-hail-in-4A0YKJW1mX
Publisher
CrossRef
ISSN
0882-8156
DOI
10.1175/waf-d-21-0056.1
Publisher site
See Article on Publisher Site

Abstract

<jats:title>Abstract</jats:title><jats:p>Previous studies have identified environmental characteristics that skillfully discriminate between severe and significant-severe weather events, but they have largely been limited by sample size and/or population of predictor variables. Given the heightened societal impacts of significant-severe weather, this topic was revisited using over 150 000 ERA5 reanalysis-derived vertical profiles extracted at the grid-point nearest—and just prior to—tornado and hail reports during the period 1996–2019. Profiles were quality-controlled and used to calculate 84 variables. Several machine learning classification algorithms were trained, tested, and cross-validated on these data to assess skill in predicting severe or significant-severe reports for tornadoes and hail. Random forest classification outperformed all tested methods as measured by cross-validated critical success index scores and area under the receiver operating characteristic curve values. In addition, random forest classification was found to be more reliable than other methods and exhibited negligible frequency bias. The top three most important random forest classification variables for tornadoes were wind speed at 500 hPa, wind speed at 850 hPa, and 0–500-m storm-relative helicity. For hail, storm-relative helicity in the 3–6 km and -10 to -30 °C layers, along with 0–6-km bulk wind shear, were found to be most important. A game theoretic approach was used to help explain the output of the random forest classifiers and establish critical feature thresholds for operational nowcasting and forecasting. A use case of spatial applicability of the random forest model is also presented, demonstrating the potential utility for operational forecasting. Overall, this research supports a growing number of weather and climate studies finding admirable skill in random forest classification applications.</jats:p>

Journal

Weather and ForecastingCrossRef

Published: Oct 7, 2021

References