首页 | 本学科首页   官方微博 | 高级检索  
     

维哈柯汉多语种词典中关键词语言识别技术
引用本文:买日旦.吾守尔,维尼拉.木沙江. 维哈柯汉多语种词典中关键词语言识别技术[J]. 新疆大学学报(理工版), 2014, 0(1): 7-11
作者姓名:买日旦.吾守尔  维尼拉.木沙江
作者单位:新疆大学信息科学与工程学院,新疆乌鲁木齐830046
基金项目:Supported by National Natural Science Foundation of China(6126206).
摘    要:本文以维哈柯汉多语种、多向词典为背景,指出了语言所特有的一些技术难点,这些技术难题包括:如何识别书写方向,如何区分维哈柯字母。针对这些问题,本文给出了相应的解决方案,例如:用XML属性和Unicode区域分析来决定书写方向,计算特殊字母出现的频率并选择用户定义字体。最后通过实验验证我们的方案的可行性。

关 键 词:词典  多语种  自动检测

Keyword Language Identification in Uighur,Kazakh, Kyrgyz and Chinese Multi-lingual Dictionary System
Affiliation:Mardan Hoshur, Winira Musajan (College of lnformation Science and Engineering, Xinjiang University, Urumqi, Xinjiang 830046, China)
Abstract:This paper takes the designing of Chinese, Uyghur, Kazak, Kirghiz Multi-lingual Multi-directional dictio-nary system as background, pointed out the language specific technical difficulties including how to determine the writing directions, how to distinguish the letters of Uyghur, Kazak, Kirghiz from each other. Then proposed corresponding solu-tions:using XML attributes and Unicode region analyzing method to determine the writing directions;calculate the usage rates of letters in specific words select the user defined fonts. Applying results indicate the feasibility and validity of these solutions.
Keywords:XML  Dictionary  Multilanguage,Auto Detection  XML
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号