漢語自動切分 的英文怎麼說
中文拼音 [hànyǔzìdòngqiēfēn]
漢語自動切分
英文
automatic chinese word segmentation- 漢 : 1 (朝代) the han dynasty (206 b c a d 220)2 (漢族) the han nationality3 (漢語) chinese (...
- 語 : 語動詞[書面語] (告訴) tell; inform
- 自 : Ⅰ代詞(自己) self; oneself; one s own Ⅱ副詞(自然;當然) certainly; of course; naturally; willin...
- 切 : 切Ⅰ動詞1 (合; 符合) correspond to; be close to 2 (用在反切后頭 表示前兩個字是注音用的反切)見 ...
- 分 : 分Ⅰ名詞1. (成分) component 2. (職責和權利的限度) what is within one's duty or rights Ⅱ同 「份」Ⅲ動詞[書面語] (料想) judge
- 漢語 : chinese (language)漢語方言 chinese dialects; 漢語拼音方案 the scheme for the chinese phonetic a...
- 自動 : 1 (自己主動) voluntarily; of one s own accord 2 (不憑借人為的力量) automatic; spontaneous 3 ...
-
The strategy of overcoming ambiguity and unlanded words in chinese words automatically seperating system
漢語自動分詞系統中切分歧義與未登錄詞的處理策略Through analyzing segmentation methods in existence, this paper points out three developing directions of future chinese automatic segmentation research, that is, effective segmentation to traditional text, rapid development of computer technology, and changing the writing rules of chinese text
本文通過對現有分詞方法的分析,指出了今後漢語自動分詞研究的三個發展方向,即對傳統文本的有效切分,計算機技術的快速發展,改造書面漢語書寫規則。A semantic based disambiguation algorithm was designed and implemented. with the algorithm, word sense disambiguation and structure disambiguation can be done by semantic pattern rules matching during syntax parsing. the experiment result indicates that : ( a ) the presentation of semantic pattern rules can formalize the construction of chinese phrase quite well ; ( b ) the corpus - based algorithm for acquiring and filtering binary semantic pattern rules is effective, and it can reduce the human labor, avoid subjectivity and unilateralism caused by writing rules manually ; ( c ) the semantic based disambiguation algorithm can achieve satisfactory effects
實驗表明: 1 )本文設計的語義模式規則能夠較準確地刻畫漢語短語構造的語義規律; 2 )本文提出的基於語料庫的二元語義模式規則自動挖掘和優選演算法是切實可行的,它大大減少完全由人工從大規模語料庫中總結規則的工作量,避免了純人工編制規則的主觀性和片面性; 3 )本文提出的語義分析排歧演算法能夠有效消解短語分析中的詞義歧義和結構歧義。Cross ambiguity is a major type of ambiguity in chinese word segmentation
交集型切分歧義(交集型歧義切分欄位)是漢語自動分詞中遇到的主要歧義類型。分享友人