Performance analysis and optimization of the Weather Research and Forecasting Model (WRF) advection schemes [presentation]
Sobhani, N., and D. Del Vento, 2017: Performance analysis and optimization of the Weather Research and Forecasting Model (WRF) advection schemes [presentation]. 97th AMS Annual Meeting, American Meteorological Society, Seattle, WA, US.
Times Viewed on OpenSky: 41 Times Downloaded on OpenSky: 2
The Weather Research and Forecasting (WRF) model is an open source mesoscale numerical weather prediction (NWP) system extensively used in atmospheric research, operational forecasting and educational settings. For profiling and identifying the bottlenecks of WRF code, we used Allinea MAP, TAU (Tuning and Analysis Utilities) and Intel Vtune Amplifier XE. The results show that advection is the most time consuming routine in the ARW (Advanced Research WRF) dynamic solver. For further investigating the hotspots, the advection code is extracted from the WRF Fortran code as a separate kernel. WRF model uses the Runge-Kutta (RK) 3rd order for integrating the time, and 2nd through 6th order advection schemes in both horizontal and vertical directions. This basic RK scheme will produce negative values and oscillations near sharp gradients due to the numerical dispersion errors. Positive definite and monotonic flux limiter schemes are used for reducing the negative values and spurious oscillations from the dispersion errors. We designed a testing framework for optimizing, compiling, profiling, and evaluating the kernel with different Fortran compilers including Intel, GNU, and PGI compilers. Several optimization techniques including exposing more vectorization and improving the cache hit rates are utilized on the advection modules and the outputs are verified. Reasonable speed-up (up to 4x) can be achieved on different compilers using the modified advection kernel. This talk won 1st place for Student Oral Presentation during the Third Symposium on High Performance Computing for Weather, Water, and Climate.