Data Science in Financial Economics

The research group “Data Science in Financial Economics” focuses on developing and applying novel data science and AI methods in the field of financial economics. These methods are used to generate economic indicators from unstructured data, such as textual data, satellite imagery, or web scraping. These indicators are then utilized in econometric analysis to address pertinent questions in financial economics. Research projects of the group deploy such indicators to study, for instance:

Information frictions in financial markets, such as the quality of firm disclosures and governance.

Bank lending behaviour and risk shifting in real estate markets as a response to macroprudential regulation.

Markets’ adoption to climate change and the limits thereof.

The research group is also involved in developing IWH’s European Real Estate Index (EREI), which systematically tracks European real estate markets. EREI includes data on quoted prices, supply-side depth (number of listings), and liquidity (dwell time). Harmonized at the NUTS-3 regional level, the index supports consistent cross-country comparisons and spatial analyses for researchers, policymakers, and the public.

Research Cluster
Financial Resilience and Regulation

Your contact

Professor Dr Fabian Woebbeking
Professor Dr Fabian Woebbeking
- Department Financial Markets
Send Message +49 345 7753-851 LinkedIn profile

EXTERNAL FUNDING

11.2022 ‐ 06.2025

Linguistic Sentiment Analysis for Environmental, Social, Governance and Climate Risks

Frankfurt Institute for Risk Management and Regulation (FIRM)

Professor Dr Fabian Woebbeking

Refereed Publications

cover_management-science.jpg

"Let Me Get Back to You" — A Machine Learning Approach to Measuring NonAnswers

Andreas Barth Sasan Mansouri Fabian Woebbeking

in: Management Science, Vol. 69 (10), 2023

Abstract

Using a supervised machine learning framework on a large training set of questions and answers, we identify 1,364 trigrams that signal nonanswers in earnings call questions and answers (Q&A). We show that this glossary has economic relevance by applying it to contemporaneous stock market reactions after earnings calls. Our findings suggest that obstructing the flow of information leads to significantly lower cumulative abnormal stock returns and higher implied volatility. As both our method and glossary are free of financial context, we believe that the measure is applicable to other fields with a Q&A setup outside the contextual domain of financial earnings conference calls.

read publication

Exploring Accounting Research Topic Evolution: An Unsupervised Machine Learning Approach

June Cao Zhanzhong Gu Iftekhar Hasan

in: Journal of International Accounting Research, Vol. 22 (3), 2023

Abstract

This study explores the evolution of accounting research by utilizing an unsupervised machine learning approach. We aim to identify the latent topics of accounting from the 1980s up to 2018, the dynamics and emerging topics of accounting research, and the economic reasons behind those changes. First, based on 23,220 articles from 46 accounting journals, we identify 55 topics using the latent Dirichlet allocation model. To illustrate the connection between topics, we use HistCite to generate a citation map along a timeline. The citation clusters demonstrate the “tribalism” phenomenon in accounting research. We then implement the dynamic topic model to reveal the dynamics of topics to show changes in accounting research. The emerging research trends are identified from the topic analytics. We further explore the economic reasons and in-depth insights into the topic evolution, indicating the economic development embeddedness nature of accounting research.

read publication

cover_journal-of-empirical-finance.jpg

Herding Behavior and Systemic Risk in Global Stock Markets

Iftekhar Hasan Radu Tunaru Davide Vioto

in: Journal of Empirical Finance, Vol. 73 (September), 2023

Abstract

This paper provides new evidence of herding due to non- and fundamental information in global equity markets. Using quantile regressions applied to daily data for 33 countries, we investigate herding during the Eurozone crisis, China’s market crash in 2015–2016, in the aftermath of the Brexit vote and during the Covid-19 Pandemic. We find significant evidence of herding driven by non-fundamental information in case of negative tail market conditions for most countries. This study also investigates the relationship between herding and systemic risk, suggesting that herding due to fundamentals increases when systemic risk increases more than when driven by non-fundamentals. Granger causality tests and Johansen’s vector error-correction model provide solid empirical evidence of a strong interrelationship between herding and systemic risk, entailing that herding behavior may be an ex-ante aspect of systemic risk, with a more relevant role played by herding based on fundamental information in increasing systemic risk.

read publication

cover_the-journal-of-financial-research.jpg

Market-implied Ratings and Their Divergence from Credit Ratings

Iftekhar Hasan Winnie P. H. Poon Jianfu Shen Gaiyan Zhang

in: Journal of Financial Research, Vol. 46 (2), 2023

Abstract

In this article, we investigate the divergence between credit ratings (CRs) and Moody's market-implied ratings (MIRs). Our evidence shows that rating gaps provide incremental information to the market regarding issuers' default risk over CRs alone in the short horizon and outperform CRs over extended horizons. The predictive ability of rating gaps is greater for more opaque and volatile issuers. Such predictability was more pronounced during the 2008 financial crisis but weakened in the post-Dodd-Frank Act period. This finding is consistent with credit rating agencies' efforts to improve their performance when facing regulatory pressure. Moreover, our analysis identifies rating-gap signals that do (do not) lead to subsequent Moody's actions to place issuers on negative outlook and watchlists. We find that negative signals from MIR gaps have a real economic impact on issuers' fundamentals such as profitability, leverage, investment, and default risk, thus supporting the recovery-efforts hypothesis.

read publication

cover_journal-of-economic-behavior-and-organization.jpg

Correlation Scenarios and Correlation Stress Testing

Natalie Packham Fabian Woebbeking

in: Journal of Economic Behavior and Organization, Vol. 205 (January), 2023

Abstract

We develop a general approach for stress testing correlations of financial asset portfolios. The correlation matrix of asset returns is specified in a parametric form, where correlations are represented as a function of risk factors, such as country and industry factors. A sparse factor structure linking assets and risk factors is built using Bayesian variable selection methods. Regular calibration yields a joint distribution of economically meaningful stress scenarios of the factors. As such, the method also lends itself as a reverse stress testing framework: using the Mahalanobis distance or Highest Density Regions (HDR) on the joint risk factor distribution allows to infer worst-case correlation scenarios. We give examples of stress tests on a large portfolio of European and North American stocks.

read publication

Working Papers

cover_DP_2025-21.jpg

The Price of Beauty: Biodiversity Effects on Residential Housing Markets

Michael Koetter Birte Winter Fabian Woebbeking

in: IWH Discussion Papers, No. 21, 2025

Abstract

We study how and why local biodiversity affects residential property values. Leveraging remotely sensed greenness indicators and a novel dataset of granular property listings, we examine how changes in vegetation load on real estate prices. Hikes in greenness are associated with higher listing prices, fewer properties listed, and reduced liquidity in housing markets. These results suggest that price hikes in housing markets are driven by supply-side constraints instead of a “greenium” that buyers might be willing to pay due to innate preferences. Exogenous zoning shocks to foster biodiversity corroborate the presence of supply side constraints as price drivers in residential housing markets. Our findings emphasize the need to calibrate biodiversity and (social) housing policy objectives more explicitly.

read publication

cover_DP_2025-13.jpg

Information Flow and Market Efficiency - The Economic Impact of Precise Language

Andreas Barth Sasan Mansouri Fabian Woebbeking

in: IWH Discussion Papers, No. 13, 2025

Abstract

This paper examines the impact of complex yet precise language, particularly financial jargon, on information dissemination and ultimately market efficiency. As a natural laboratory, we analyze the information exchanged during earnings conference calls, where we instrument jargon with the Plain Writing Act of 2010. Our findings suggest that the Act‘s promotion of plain language usage results in a reduction in complex financial jargon for US firms. However, in contrast to the presumed benefits of accessible language, this reduction in jargon is associated with a decrease in market efficiency, implying that the Act may inadvertently hinder information flow. This finding is particularly important at the juncture where human-generated information is received by machines, which are known to be vunerable to ambiguous inputs.

read publication

cover_DP_2025-04.jpg

Contractionary Macroprudential Policy, Collateral Valuation, and Risk-shifting in EU Banking

Michael Koetter Felix Noth Fabian Woebbeking

in: IWH Discussion Papers, No. 4, 2025

Abstract

We study real estate lending responses to tighter macroprudential policy (MPP) in the form of lower required loan-to-value (LTV) ratios. Contract details of 2.4 million mortgage loans originated between 2008 and 2020 reveal significantly fewer new loan issuances in response to contractionary MPP, commensurate with an average reduction in aggregate lending of 21 percent. Loan-level analyses reveal, however, that banks comply with lower LTVs by systematically more benevolent valuations of residential real estate pledged as collateral instead of reducing loan size. Exploiting earthquakes as plausible exogenous shocks to property values corroborates these risk-shifting patterns by banks in the form of inflated property valuations after LTV shocks.

read publication

cover_SSRN.png

How to Talk Down Your Stock Performance

Andreas Barth Sasan Mansouri Fabian Woebbeking Severin Zörgiebel

in: SSRN Discussion Papers, 2020

Abstract

We process the natural language of verbal firm disclosures in order to study the use of context specific language or jargon and its impact on financial performance. We observe that, within the Q&A of earnings conference calls, managers use less jargon in responses to tougher questions, and after a quarter of bad economic success. Moreover, markets interpret the lack of precise information as a bad signal: we find lower cumulative abnormal returns and a higher implied volatility following earnings calls where managers use less jargon. These results support the argument that context specific language or jargon helps to efficiently and precisely transfer information.

read publication
Mitglied der Leibniz-Gemeinschaft LogoTotal-Equality-LogoSupported by the BMWK