中国科技术语 ›› 2022, Vol. 24 ›› Issue (2): 65-69.doi: 10.12339/j.issn.1673-8578.2022.02.009

• • 上一篇    下一篇

面向计算机辅助翻译的民航规章术语库词性规则研究

王坤()   

  1. 中国民航大学外国语学院,天津 300300
  • 收稿日期:2021-10-12 修回日期:2022-03-08 出版日期:2022-04-05 发布日期:2022-03-31
  • 作者简介:王坤(1990—),男,中国民航大学外国语学院讲师。研究方向为民航英语、典籍翻译。有多年翻译学研究与民航翻译经验,主持完成中国民航局“直升机空中救护运行相关资料的翻译”项目,参编“十三五”规划教材,曾在《山东外语教学》《外国语言文学》《中国民航飞行学院学报》等刊物发表论文。通信方式: ggggfjhh@yeah.net
  • 基金资助:
    中国民航大学中央高校基金项目“英汉翻译中的透明话语策略研究”(3122018R010)

Analysis on POS Configuration for Civil Aviation Regulations Termbase based on CAT System

WANG Kun()   

  • Received:2021-10-12 Revised:2022-03-08 Online:2022-04-05 Published:2022-03-31

摘要:

当前主流计算机辅助翻译系统(CAT)借助翻译记忆(TM)和术语库(TB)提高翻译效率。翻译记忆以自然句为主要匹配单位,需要整句相似或重复,匹配难度大。与之相比,术语库以词块为匹配单位,较为灵活,可弥补翻译记忆的缺陷。术语库的构建涉及术语自动提取,需要参考特定文本类型中高频语块的词性规则。文章使用n-gram提取英语民航规章文本的复现语块,探究不同词项长度和复现频数下高频语块的词性组合特征;并将其与文学文本进行对比。研究发现,在英语民航规章文本中,适用于计算机辅助翻译系统术语库的复现语块以名词短语为主,与文学文本存在显著差异。

关键词: 计算机辅助翻译, 术语库, n-gram, 民航规章

Abstract:

Most of the current CAT systems leverage Translation Memory (TM) and Termbase(TB) to enhance efficiency of translation. With respect to TM, due to its limitations in practice, whole sentence repetition often should be complemented by translation termbase, which is more flexible in use. Building a termbase requires the automatic extraction of terms, which demands knowledge of its POS (part of speech) configuration in the specific text typology. With corpus tools, we extracted n-grams of certain length and frequency from Civil Aviation Regulations in the US and examined the POS configuration of those recurrent chunks, followed by a contrast with that of literary texts. The study shows a dominance of NP and PP in recurrent chunks suitable for CAT termbase in those Civil Aviation Regulations, different from the result in literary texts.

Key words: Computer Aided Translation(CAT), termbase, n-gram, civil aviation regulations

中图分类号: