The Application of RAG in Langchain Framework in Classical Chinese

Authors

  • Liu Zhi Hao Nantong Institute of Technology, Nantong, Jiang Su, China
  • Leong Wai Yie INTI International University, Nilai, Negeri Sembilan, Malaysia

Keywords:

RAG, Data Source Creation Method, Classical Chinese, Energy Efficiency

Abstract

Currently, the world's mainstream Large Language Models (LLMs) offer significantly less support for Chinese than for English, resulting in challenges when utilizing generative LLMs to produce high-quality Chinese traditional literature works. This paper proposes a data source creation method, this method interprets words according to their extended meanings, which means one meaning of a word produces another or several meanings related to it in the process of language development, then use a word segmentation tool to divide the different meanings of a word, which re-quantifies the nouns, verbs, stories and histories in classical Chinese, the advantage of quantifying in this way is that it can effectively solve the problem of polysemy of words, and enhances the logical correlation between contexts. From the results, the correlation between the generated classical Chinese and the real results has been greatly improved. We use the Retrieval Augmented Generation (RAG) method to get the results at the least cost without retraining the new LLM.

Downloads

Published

2025-08-18

How to Cite

Zhi Hao, L., & Wai Yie, L. (2025). The Application of RAG in Langchain Framework in Classical Chinese. INTI Journal, 2025(2). Retrieved from https://iuojs.intimal.edu.my/index.php/intijournal/article/view/710

Issue

Section

Articles