Enhancing ESG Data Analysis in Green Banking Using Large Language Models
Keywords:
ESG Data, Green Banking, Large Language Models, Sustainable Finance, Natural Language Processing, Greenwashing DetectionAbstract
Environmental, Social and Governance (ESG) data is an integral part of the sustainable finance landscape, enabling financial institutions to understand long-term risks, align portfolios with climate goals, and meet growing disclosure requirements from regulators. However, such data is often not available in structured formats. Instead, it is embedded in longer textual disclosures such as sustainability reports, corporate filings and other narrative disclosures. Traditional rule-based or keyword driven approaches are insufficient to deal with the complexity of the language, context, and idiosyncrasy of real-world ESG disclosures. This paper presents a three-layer LLM-based framework to automate the extraction of ESG signals from financial documents and their multi-class classification into environmental, social and governance categories, as well as the identification of greenwashing practices. The framework is developed based on fine-tuned transformer architectures, and a separate decision support layer integrates the extracted ESG signals in credit risk and green loan classification processes. Experiments on publicly available ESG datasets show important improvements in classification accuracy, precision, and F1-scores over rule-based baselines and domain-adapted BERT models, and the proposed system can be used to reliably identify discrepancies between narrative sustainability claims and quantitative performance indicators. It discusses model bias, explainability, and compliance with the EU Sustainable Finance Disclosure Regulation. It concludes that LLMs could become a transformative green banking analytics tool, but appropriate governance structures will require development to fully realize their potential.
Downloads
References
Jacob Devlin et al., "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding," Proceedings of NAACL-HLT, 2019. Available: https://arxiv.org/abs/1810.04805
European Commission, "EU Taxonomy for Sustainable Activities," European Commission Sustainable Finance, 2020. Available: https://finance.ec.europa.eu/sustainable-finance/tools-and-standards/eu-taxonomy-sustainable-activities_en
European Commission, "Sustainable Finance Disclosure Regulation (SFDR)," Official Journal of the European Union, 2021. Available: https://finance.ec.europa.eu/sustainable-finance/disclosures/sustainability-related-disclosure-financial-services-sector_en
Singhania, Monica, and Neha Saini. "Institutional framework of ESG disclosures: comparative analysis of developed and developing countries." Journal of Sustainable Finance & Investment 13, no. 1 (2023): 516-559. Available: https://doi.org/10.1080/20430795.2021.1964810
Huang, Allen H., Hui Wang, and Yi Yang. "FinBERT: A large language model for extracting information from financial text." Contemporary Accounting Research 40, no. 2 (2023): 806-841. Available: https://onlinelibrary.wiley.com/doi/10.1111/1911-3846.12832
Friede, Gunnar, Timo Busch, and Alexander Bassen. "ESG and financial performance: aggregated evidence from more than 2000 empirical studies." Journal of Sustainable Finance & Investment 5, no. 4 (2015): 210-233. Available: https://www.tandfonline.com/doi/pdf/10.1080/20430795.2015.1118917
Sakis Kotsantonis and George Serafeim, "Four Things No One Will Tell You About ESG Data," Journal of Applied Corporate Finance, 2019. Available: https://doi.org/10.1111/jacf.12346
Emily M. Bender et al., "On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?," Proceedings of FAccT, 2021. Available: https://dl.acm.org/doi/10.1145/3442188.3445922
Zou, Yi, Mengying Shi, Zhongjie Chen, Zhu Deng, ZongXiong Lei, Zihan Zeng, Shiming Yang, Hongxiang Tong, Lei Xiao, and Wenwen Zhou. "ESGReveal: An LLM-based approach for extracting structured data from ESG reports." Journal of Cleaner Production 489 (2025): 144572. Available: https://arxiv.org/abs/2312.17264
Schimanski, Tobias, Andrin Reding, Nico Reding, Julia Bingler, Mathias Kraus, and Markus Leippold. "Bridging the gap in ESG measurement: Using NLP to quantify environmental, social, and governance communication." Finance Research Letters 61 (2024): 104979. Available: https://www.sciencedirect.com/science/article/pii/S1544612324000096
Vo, Ace, Rosemary Kim, Miloslava Plachkinova, and Jake Tam Lestyk. "ESG Risk Classification in 10-K Filings: Benchmarking Finbert and Large Language Models." Available at SSRN 5387933. Available: https://dx.doi.org/10.2139/ssrn.5387933
Lee, Haein, Seon Hong Lee, Heungju Park, Jang Hyun Kim, and Hae Sun Jung. "ESG2PreEM: Automated ESG grade assessment framework using pre-trained ensemble models." Heliyon 10, no. 4 (2024). Available: https://www.sciencedirect.com/science/article/pii/S2405844024024356
Xu, Congluo, Jiuyue Liu, Ziyang Li, and Chengmengjia Lin. "DeepGreen: Effective LLM-Driven Greenwashing Monitoring System Designed for Empirical Testing—Evidence from China." Computational Economics (2026): 1-35. Available: https://link.springer.com/article/10.1007/s10614-026-11328-5
Davidescu, Adriana AnaMaria, Eduard Mihai Manta, Ioana Bîrlan, Alexandra-Mădălina Miler, and Sorin-Cristian Niță. "Detecting greenwashing in ESG disclosure: an NLP-based analysis of central and eastern european firms." Sustainability 18, no. 3 (2026): 1486. Available: https://www.mdpi.com/2071-1050/18/3/1486
Giudici, Paolo, and Lunshuai Wu. "Sustainable artificial intelligence in finance: Impact of ESG factors." Frontiers in Artificial Intelligence 8 (2025): 1566197. Available: https://www.frontiersin.org/journals/artificial-intelligence/articles/10.3389/frai.2025.1566197/full
Musleh Al-Sartawi, Abdalmuttaleb MA, Khaled Hussainey, and Anjum Razzaque. "The role of artificial intelligence in sustainable finance." Journal of Sustainable Finance & Investment (2022): 1-6. Available: https://www.tandfonline.com/doi/full/10.1080/20430795.2022.2057405
Dipierro, Anna Rita, Fernando Jimenéz Barrionuevo, and Pierluigi Toma. "Predicting ESG controversies in banks using machine learning techniques." Corporate Social Responsibility and Environmental Management 32, no. 3 (2025): 3525-3544. Available: https://onlinelibrary.wiley.com/doi/full/10.1002/csr.3146
Lim, Tristan. "Environmental, social, and governance (ESG) and artificial intelligence in finance: State-of-the-art and research takeaways." Artificial Intelligence Review 57, no. 4 (2024): 76. Available: https://link.springer.com/article/10.1007/s10462-024-10708-3
Dewasiri, Narayanage Jayantha, Dunusinghe G. Dharmarathna, and Mrinalini Choudhary. "Leveraging artificial intelligence for enhanced risk management in banking: A systematic literature review." Artificial intelligence enabled management: An emerging economy perspective (2024): 197-213. Available: https://books.google.co.in/books?hl=en&lr=&id=wgkFEQAAQBAJ&oi=fnd&pg=PA197
Pluskota, Przemysław, Kamila Słupińska, Agata Wawrzyniak, and Barbara Wąsikowska. "The Application of Artificial Intelligence (AI) in the Implementation of ESG-Oriented Sustainable Development Strategies in the Banking Sector: A Case Study." Sustainability 18, no. 2 (2026): 732. Available: https://www.mdpi.com/2071-1050/18/2/732
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.


