Journal of Risk Model Validation
ISSN:
1753-9579 (print)
1753-9587 (online)
Editor-in-chief: Steve Satchell

Incorporating financial reports and deep learning for financial distress prediction: empirical evidence from Chinese listed companies
Jiaming Liu, Ming Jia, Yanan Hao and Lu Wang
Need to know
- This paper explores the use of textual data from financial reports for predicting financial distress in Chinese listed companies. It proposes models incorporating Word2Vec and BERT for text embedding to capture sentiment and tone from the texts.
- The study compares different text embedding methods, including word2vec-averaging, word2vec-weighting, BERT-word, and BERT-sentence, to examine their effectiveness in improving the predictive performance of financial distress models.
- The paper also investigates the combination of textual features with traditional financial indicators for enhanced prediction accuracy. Results show that the inclusion of textual features can improve the discriminability of predictions, especially over longer prediction horizons.
Abstract
This study conducts a comparative study on text information processing methods for financial distress prediction. Word2vec and bidirectional encoder representations from transformers (BERT) are employed to convert financial reports into vectors. Weighted word2vec and BERT-sentence are also used for enhanced text processing and report quantification. Experimental results based on a data set of 62 312 Chinese listed companies from 2000 to 2021 show that weighted word2vec achieves an average prediction accuracy of 85.27% in cross-validation and 84.67% in sliding-time- window validation. The findings indicate that incorporating semantic information from management discussion and analysis (MD&A) significantly improves the performance of distress prediction models for listed companies, regardless of the text-processing technique used. Text-based features become comparable with financial indicators and even surpass them as the prediction horizon extends. Combination features offer greater enhancement than financial indicators, especially for longer prediction horizons. We therefore offer a comprehensive validation of the MD&A for the purpose of predicting financial distress, and we firmly believe that it serves as a valuable tool in mitigating risk within financial risk management.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@risk.net
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@risk.net