[an error occurred while processing this directive]

China Terminology ›› 2024, Vol. 26 ›› Issue (2): 29-36.doi: 10.12339/j.issn.1673-8578.2024.02.004

• Technical Methods • Previous Articles     Next Articles

Research on Term Interpretation Generation Methods Based on Deep Learning Text Summarization

DU Zhenlei1(), CHEN Ruoyu2(), JIANG Yushan2()   

  • Received:2023-03-27 Revised:2023-06-29 Online:2024-03-29 Published:2024-03-29

Abstract:

Scientific and technological terms are the foundation and prerequisite for the formation, accumulation, communication, and dissemination of scientific knowledge. Generating encyclopedic definitions for these terms is of great practical significance for the general public and Chinese learners to grasp the connotations of scientific terms and use them correctly. In this study, we propose a deep learning-based method for generating encyclopedic definitions of scientific and technological terms. We collected encyclopedic texts and expert-written term definitions from the internet to construct a dataset for scientific and technological term definitions. Based on the T5 PEGASUS pre-trained model, we fine-tuned the model to build a generative text summarization model and developed a system for generating definitions of scientific and technological terms. Experimental results demonstrate that the proposed model exhibits high performance in terms of generation quality, semantic coherence, and versatility.

Key words: deep learning, text summarization, scientific and technological terms, definition of terms, definition generation, dataset