[an error occurred while processing this directive]

China Terminology ›› 2022, Vol. 24 ›› Issue (1): 36-44.doi: 10.12339/j.issn.1673-8578.2022.01.004

Previous Articles     Next Articles

Automatic Recognition and Terminology Database Construction of English Network Informal Language Expressions

XIA Rongjing(), ZHANG Keliang()   

  • Received:2021-09-27 Revised:2021-11-30 Online:2022-01-05 Published:2021-12-27

Abstract:

Network Informal Language Expression (NILE) has the characteristics of novelty, unconventionality and colloquialism,which poses a challenge to many natural language processing tasks. In the process of using online language for communication, some NILEs are gradually standardized and normalized, forming a crucial part of the NILE terminology. By collecting, processing and analyzing more than 460 000 tweets, we divide English NILEs into 13 categories from the perspectives of sound, form and sense, and further analyzed their characteristics. Taking the advantage of statistic-based approach and rule-based approach, we design an automatic English NILE recognition system based on the integration of statistical techniques and linguistic rules, and thereupon build a terminology database of 7000 NILE items.

Key words: Network Informal Language Expression (NILE), automatic recognition, terminology database

CLC Number: