Huawei Mla L12 Model

0 views
Skip to first unread message

Ray Kowalewski

unread,
Aug 5, 2024, 2:57:05 PM8/5/24
to fecfietelmi
Thepublication marks the first time that employees of a Chinese technology company are the sole authors of a Nature paper, according to Nature Index. The paper, describing how to develop a precise and accurate global AI weather forecast system based on deep learning using 43 years of data, appeared in the prestigious journal on July 5, 2023.

Pangu-Weather is the first AI prediction model to demonstrate higher precision than traditional numerical weather forecast methods. The model allows a 10,000x improvement in prediction speed, reducing global weather prediction time to just seconds. The paper, titled "Accurate medium-range global weather forecasting with 3D neural networks" provides independent verifications of these capabilities.


Pangu-Weather challenges the previously held assumptions that the accuracy of AI weather forecast is inferior to traditional numerical forecasts. The model, developed by the HUAWEI CLOUD team, is the first AI prediction model with higher precision than traditional numerical prediction methods.


With the rapid development of computing power over the past 30 years, the accuracy of numerical weather forecast has improved dramatically, providing extreme disaster warning and climate change predictions. But the method remains relatively time-consuming. To improve prediction speeds, researchers have been exploring how to using deep learning methods. Still, the precision of AI-based forecasting for medium and long-term forecasts has remained inferior to numerical forecasts. AI has been mostly unable to predict extreme and unusual weather such as typhoons.


Every year, there are around 80 typhoons worldwide. In 2022, in China alone, the direct economic loss caused by typhoons was 5.42 billion yuan, according to the figures from China Ministry of Emergency Management. The earlier that warnings can be sent out, the easier and better it is to make adequate preparations.


Because of their speed, AI weather forecast models have been attractive but have lacked precision for two reasons. First, the existing AI meteorological forecast models are based on 2D neural networks, which cannot process uneven 3D meteorological data well. Second, medium-range weather forecast can suffer from cumulative forecast errors when the model is called too many times.


During scientific trials, Pangu-Weather model has demonstrated its higher precision compared to traditional numerical prediction methods for forecasts of 1 hour to 7 days, with a prediction speed gain of 10,000 times. The model can accurately predict in seconds fine-grained meteorological features including humidity, wind speed, temperature, and sea level pressure.


The model uses a 3D Earth-Specific Transformer (3DEST) architecture to process complex non-uniform 3D meteorological data. Using a hierarchical, temporal, aggregation strategy, the model was trained for different forecast intervals using 1 hour, 3-hour, 6- hour and 24-hour intervals. This resulted in a minimization of the quantity of iterations for predicting a meteorological condition at a specific time and a reduction in erroneous forecasts.


To train the model for specific time intervals, the researchers trained 100 epochs (cycles) using hourly samples of weather data from 1979-2021. Each of the sub-models that resulted required 16 days of training on 192 V100 graphics cards. Pangu-Weather Model can now complete 24-hour global weather forecasts in just 1.4 seconds on a V100 graphics card, a 10,000-time improvement compared with the traditional numerical prediction.


Explaining why the HUAWEI CLOUD AI team chose to focus on weather predictions, Dr. Tian Qi, Chief Scientist of HUAWEI CLOUD AI Field, an IEEE Fellow, and Academician of the International Eurasian Academy of Sciences, explained "Weather forecasting is one of the most important scenarios in the field of scientific computing because meteorological prediction is a very complex system, yet it is difficult to cover all aspects of mathematical and physical knowledge. We are therefore delighted that our research has been recognized by the Nature magazine. AI models can mine statistical laws of atmospheric evolution from massive data. At present, Pangu-Weather mainly completes the work of the forecast system, and its main ability is to predict the evolution of atmospheric states. Our ultimate goal is to build next-generation weather forecasting framework using AI technologies to strengthen the existing forecasting systems. "


Commenting on the significance and quality of the research by HUAWEI CLOUD, academic reviewers from Nature explained that not only is Pangu-Weather very easy to download and run, but that it executed quickly on even a desktop computer. "This means that anyone in the meteorological community can now run and test these models to their hearts' desire. What a great opportunity for the community to explore how well the model predicts specific phenomena. That's going to help with progress in the field." Another reviewer noted that "the results themselves are a significant step beyond previous results. This work will, in my opinion, make people reevaluate what forecasting models might look like in the future".


In May 2023, Typhoon Mawar caught the world's attention as the strongest tropical cyclone of the year thus far. According to the China Meteorological Administration, Pangu-Weather accurately predicted the trajectory of Typhoon Mawar five days before it changed course in the eastern waters of the islands of Taiwan.


[Shenzhen, China, August 3, 2023] Huawei Cloud's Pangu-Weather Model, an AI model for global weather forecasting, is now available on the website of the European Centre for Medium-Range Weather Forecasts (ECMWF) . On this website, global weather forecasters, meteorologists, weather enthusiasts, as well as the general public can now view Pangu-Weather's 10-day global weather forecasts for free. ECMWF also released a technical report titled The rise of data-driven weather forecasting which tested Pangu-Weather. The report includes a comparison between the forecasts made by Pangu-Weather and ones from ECMWF IFS (a leading global NWP system), from April to July this year. "The results are very promising, with comparable skill for both global metrics and extreme events, when verified against both the operational analysis and synoptic observations."


According to the report, the uptake of machine learning methods like Pangu-Weather could be a game-changer compared to the incremental and rather slow progress of traditional numerical weather prediction (NWP) methods. Before Pangu, forecast execution had been increasing by about one day per decade (according to the World Meteorological Organization, or WMO), which can be attributed to the high computational cost of running a forecast with standard NWP systems. ML models are poised to revolutionize weather forecasting with predictions that require much lower computational costs and that are equally or more accurate.


ECMWF is now running a series of data-driven forecasts as part of its operational suite. The ECMWF website shows the forecasts made by Pangu-Weather in six different types of charts, including: mean sea level pressure and 850 hPa wind speed, 500 hPa geopotential height and 850 hPa temperature, mean sea level pressure and 200 hPa wind, temperature and geopotential at various pressure levels, 2 m temperature and 10 m wind speed, wind speed and geopotential heights at various pressure levels. All of this information is critical to predicting the development of weather systems, storm trajectories, air quality, and weather patterns. ECMWF recently used Pangu-Weather to successfully predict the path of Typhoon DOKSURI which landed in southern China in July.


ECMWF has called for more efforts from the global weather forecasting community to use AI models as additional components of their forecasting systems and to further explore the strengths and weaknesses of such models.


The Atlas 800 training server (model: 9000) is powered by the Kunpeng and Ascend processors. It features ultra-high computing density, ultra-high energy efficiency, and high network bandwidth. The server is widely used in deep learning model development and training scenarios, and is an ideal option for computing-intensive industries, such as smart city, intelligent healthcare, astronomical exploration, and oil exploration.


Powered by the Ascend processor, the Atlas 800 inference server (model: 3000) supports up to 8 Atlas 300I inference cards to provide powerful real-time inference. It is widely used for AI inference in data centers.


YANG is a data modeling language. The YANG model defines a hierarchical data structure, which can be used for operations based on network configuration management protocols (such as NETCONF/RESTCONF). The operations include configuration, status data, remote procedure calls (RPCs), and notifications. Compared with the SNMP model MIB, YANG is more hierarchical, can distinguish between configurations and status, and provides high extensibility.


It is used to model the configuration data, status data, RPCs, and notifications used by network configuration management protocols (such as NETCONF and RESTCONF). YANG generates YANG models (also called YANG files) by describing data structures, data integrity constraints, and data operations.


Through ongoing standardization, YANG is gradually becoming a mainstream data description specification in the industry. Standards organizations, vendors, carriers, and OTTs all define their own YANG models. As shown in the following figure, the YANG model is integrated on the devices, which function as the servers. Network administrators can use NETCONF or RESTCONF to centrally manage, configure, and monitor various YANG-capable network devices, simplifying network O&M and reducing O&M costs.


The following uses part of the if-mib file as an example. The MIB is a tiled table, in which all elements of an IfEntry are arranged side by side. This makes it impossible to distinguish between configuration data and status data.

3a8082e126
Reply all
Reply to author
Forward
0 new messages