HSE Researchers Demonstrate Effectiveness of Machine Learning in Forecasting Inflation
Inflation is a key indicator of economic stability, and being able to accurately forecast its levels across regions is crucial for governments, businesses, and households. Tatiana Bukina and Dmitry Kashin at HSE Campus in Perm have found that machine learning techniques outperform traditional econometric models in long-term inflation forecasting. The results of the study focused on several regions in the Privolzhskiy Federal District have been published in HSE Economic Journal.
Inflation forecasting is crucial for the economy, particularly since Russia's transition to an inflation targeting regime in 2014. This means that the Bank of Russia sets specific inflation targets and employs a range of tools to meet them.
Various data is used to predict inflation, including the consumer price index, the unemployment rate, exchange rates, and the Central Bank rate. To systematise this information for forecasting, economists at the HSE Campus in Perm used data from the Unified Interdepartmental Statistical Information System.
The researchers' main objective was to determine which model predicts regional inflation more accurately: traditional econometric time series models or more recent machine learning methods. The study analysed data from 14 regions in the Privolzhskiy Federal District from January 2010 to December 2022. R Studio and Python were used for the analysis: time series forecasting was performed in R Studio, while machine learning models, including support vector machines, gradient boosting, and random forests, were implemented in Python. The forecasts were conducted on test samples, which helped prevent model overfitting and provided more accurate estimates.
The authors employed a cross-validation method using test samples of equal size. This approach allows models to be trained on data from one period and tested on data from another, ensuring stability and accuracy of the forecasts.
'To ensure accurate performance of machine learning methods, it is essential to select the optimal hyperparameters for the models. Hyperparameters differ from other model parameters in that they are set before training begins and define the model's specifications. Cross-validation is employed to select the optimal hyperparameters. When cross-validating time series, the training data precedes the test data without overlap, unlike in standard data validation,' according to Tatiana Bukina, Associate Professor, Faculty of Computer Science, Economics, and Social Sciences, HSE Campus in Perm.
The study found the gradient boosting model to be the most accurate of all machine learning models considered for predicting regional inflation. It delivers more accurate forecasts than autoregressive models over more time horizons. Thus, at forecasting horizons of 3, 6, 21, and 24 months, the gradient boosting model outperforms the basic AR(1) model by 20.3%, 16.2%, 72.5%, and 77.7%, respectively. The AR(1) model, a statistical tool for analysing and predicting time series, assumes that the current value of a series depends on its previous value plus a random error.
The random forest model and the support vector machine also demonstrated accurate forecasts over the long horizons of 21 and 24 months, outperforming the AR(1) model by 72.5% and 77.7%, respectively. A random forest combines multiple decision trees to enhance the accuracy and stability of forecasts, and then uses regression to average the predictions or select the most frequent value. The support vector machine identifies the optimal line that separates the data while minimising classification errors.

According to the authors, their results confirm that machine learning methods can be effective for forecasting inflation across various time horizons.
Tatiana Bukina notes, 'Our research has demonstrated that machine learning provides more reliable tools for long-term forecasts. However, traditional econometric models continue to play a crucial role in short-term forecasts and should not be entirely excluded from analysts' toolkits. Combining econometric modelling with machine learning methods can significantly improve the accuracy of regional inflation forecasts. This is particularly important in an environment characterised by high uncertainty and rapid changes in economic conditions.'
The study also highlighted the specific characteristics of inflation forecasting for different regions. For example, in machine learning models, inflation seasonality was observed only in the Perm, Nizhny Novgorod, Penza, and Saratov regions. In the Republic of Tatarstan, the specific month for which the forecast was calculated proved to be a significant factor.
In the random forest model, the average inflation value for the previous three months emerged as a significant factor for the Republic of Mordovia, Nizhny Novgorod and Ulyanovsk regions, and the Chuvash Republic.
Each region has unique characteristics related to its economic structure, natural resources, and geographical location. These factors account for the variations in inflation dynamics and key macroeconomic indicators.
See also:
HSE Researchers Teach Neural Network to Distinguish Origins from Genetically Similar Populations
Researchers from the AI and Digital Science Institute, HSE Faculty of Computer Science, have proposed a new approach based on advanced machine learning techniques to determine a person’s genetic origin with high accuracy. This method uses graph neural networks, which make it possible to distinguish even very closely related populations.
HSE Economists Reveal the Secret to Strong Families
Researchers from the HSE Faculty of Economic Sciences have examined the key factors behind lasting marriages. The findings show that having children is the primary factor contributing to marital stability, while for couples without children, a greater income gap between spouses is associated with a stronger union. This is the conclusion reported in Applied Econometrics.
Fifteen Minutes on Foot: How Post-Soviet Cities Manage Access to Essential Services
Researchers from HSE University and the Institute of Geography of the Russian Academy of Sciences analysed three major Russian cities to assess their alignment with the '15-minute city' concept—an urban design that ensures residents can easily access essential services and facilities within walking distance. Naberezhnye Chelny, where most residents live in Soviet-era microdistricts, demonstrated the highest levels of accessibility. In Krasnodar, fewer than half of residents can easily reach essential facilities on foot, and in Saratov, just over a third can. The article has been published in Regional Research of Russia.
HSE Researchers Find Counter-Strike Skins Outperform Bitcoin and Gold as Alternative Investments
Virtual knives, custom-painted machine guns, and gloves are common collectible items in videogames. A new study by scientists from HSE University suggests that digital skins from the popular video game Counter-Strike: Global Offensive (CS:GO) rank among the most profitable types of alternative investments, with average annual returns exceeding 40%. The study has been published in the Social Science Research Network (SSRN), a free-access online repository.
HSE Neurolinguists Reveal What Makes Apps Effective for Aphasia Rehabilitation
Scientists at the HSE Centre for Language and Brain have identified key factors that increase the effectiveness of mobile and computer-based applications for aphasia rehabilitation. These key factors include automated feedback, a variety of tasks within the application, extended treatment duration, and ongoing interaction between the user and the clinician. The article has been published in NeuroRehabilitation.
'Our Goal Is Not to Determine Which Version Is Correct but to Explore the Variability'
The International Linguistic Convergence Laboratory at the HSE Faculty of Humanities studies the processes of convergence among languages spoken in regions with mixed, multiethnic populations. Research conducted by linguists at HSE University contributes to understanding the history of language development and explores how languages are perceived and used in multilingual environments. George Moroz, head of the laboratory, shares more details in an interview with the HSE News Service.
Slim vs Fat: Overweight Russians Earn Less
Overweight Russians tend to earn significantly less than their slimmer counterparts, with a 10% increase in body mass index (BMI) associated with a 9% decrease in wages. These are the findings made by Anastasiia Deeva, lecturer at the HSE Faculty of Economic Sciences and intern researcher in Laboratory of Economic Research in Public Sector. The article has been published in Voprosy Statistiki.
Scientists Reveal Cognitive Mechanisms Involved in Bipolar Disorder
An international team of researchers including scientists from HSE University has experimentally demonstrated that individuals with bipolar disorder tend to perceive the world as more volatile than it actually is, which often leads them to make irrational decisions. The scientists suggest that their findings could lead to the development of more accurate methods for diagnosing and treating bipolar disorder in the future. The article has been published in Translational Psychiatry.
Scientists Develop AI Tool for Designing Novel Materials
An international team of scientists, including researchers from HSE University, has developed a new generative model called the Wyckoff Transformer (WyFormer) for creating symmetrical crystal structures. The neural network will make it possible to design materials with specified properties for use in semiconductors, solar panels, medical devices, and other high-tech applications. The scientists will present their work at ICML, a leading international conference on machine learning, on July 15 in Vancouver. A preprint of the paper is available on arxiv.org, with the code and data released under an open-source license.
HSE Linguists Study How Bilinguals Use Phrases with Numerals in Russian
Researchers at HSE University analysed over 4,000 examples of Russian spoken by bilinguals for whom Russian is a second language, collected from seven regions of Russia. They found that most non-standard numeral constructions are influenced not only by the speakers’ native languages but also by how frequently these expressions occur in everyday speech. For example, common phrases like 'two hours' or 'five kilometres’ almost always match the standard literary form, while less familiar expressions—especially those involving the numerals two to four or collective forms like dvoe and troe (used for referring to people)—often differ from the norm. The study has been published in Journal of Bilingualism.