詞的自動切分 的英文怎麼說
中文拼音 [cídezìdòngqiēfēn]
詞的自動切分
英文
automatic segmentation of words- 詞 : 名詞1 (說話或詩歌、文章、戲劇中的語句) speech; statement; lines of play 2 (一種韻文形式 起於唐...
- 的 : 4次方是 The fourth power of 2 is direction
- 自 : Ⅰ代詞(自己) self; oneself; one s own Ⅱ副詞(自然;當然) certainly; of course; naturally; willin...
- 切 : 切Ⅰ動詞1 (合; 符合) correspond to; be close to 2 (用在反切后頭 表示前兩個字是注音用的反切)見 ...
- 分 : 分Ⅰ名詞1. (成分) component 2. (職責和權利的限度) what is within one's duty or rights Ⅱ同 「份」Ⅲ動詞[書面語] (料想) judge
- 自動 : 1 (自己主動) voluntarily; of one s own accord 2 (不憑借人為的力量) automatic; spontaneous 3 ...
-
In other words, it includes that anchors, the direct interpersonal communicator, embracing with a civilianization appearance, transmitting understanding, favor, sentimentality and anger with their true feelings ; and that the news content and commentary are plain and common, in which reflecting the respection to the accepting habit and capacity of audience ; are true but not artificial, simple but not lack of details, considering thoroughly of the audience ' s aesthetic needs, communicating with microphone and camera, the origin function of broadcasting
在傳播方式上,尋找與受眾的貼近和平等,這包括最直接的人際傳播者? ?主持人以平民化的形象出現,以親切、真誠與節目相契合的內心情感的自然流露向觀眾表達理解、關切、傷感、憤怒;內容和解說詞樸實自然、通俗易懂,從中體現出對受眾接受習慣和能力的尊重;充分考慮受眾的審美需求,真實而不造作,簡潔而不乏細節,真正啟動廣播電視功能的本源即用話筒和鏡頭說話。In this paper, the word segmentation technology of chinese text classification is debated emphatically. and the method of word segmentation based on the phrase labeling of 2 - gram syntax is put forward combining the method of setting separate - signs and the method based on the statistic of word - frequency, which can recognize the vocabularies which the method based on the dictionary can not manage
對于基於信息過濾的自動分類問題,使用字典分詞並不是一個必須的過程,因而本文提出了基於2元語法短語標引的分詞方法,它將設立切分標志法與基於詞頻統計的方法相結合,可以識別基於詞典方法處理不了的詞匯,如:人名、地名、專業術語等。The strategy of overcoming ambiguity and unlanded words in chinese words automatically seperating system
漢語自動分詞系統中切分歧義與未登錄詞的處理策略Based on 6. 4 million chars of chinese ancient poetry, the computer aided research system of chinese ancient poems provides a word - based analysis platform of chinese ancient poems. more than 50000 chinese words, including 40814 multi - char words, were extracted from the corpora via statistic method. besides the full text retrieving function, the system also provide word - based statistic analysis, sentence based similarity retrieving, automatic pinyin tagging and some other useful functions to benefit the profound analysis of the chinese ancient poems. the national social science foundation of china 1998 - 1999 funded the project
在對詩文進行詞語切分的基礎上,建立了詞匯的共現關系對仗關系以及詞匯的作者分佈特徵信息。系統除了提供面向詩文內容的全文檢索功能外,還進一步開發了基於詞匯的統計分析和詩句相似性檢索等功能,實現了對全唐詩的自動注音。In this novelty approach, proper nouns " term information together with rules attained by transformed - based error - driven learning is used to label properties of segmented text, so as to recognize proper nouns
該方法利用從語料庫中自動提取到的專有名詞信息和採用基於轉換的錯誤驅動學習方法獲得的規則,對切分文本進行屬性標注,最終實現專有名詞的識別。Through analyzing segmentation methods in existence, this paper points out three developing directions of future chinese automatic segmentation research, that is, effective segmentation to traditional text, rapid development of computer technology, and changing the writing rules of chinese text
本文通過對現有分詞方法的分析,指出了今後漢語自動分詞研究的三個發展方向,即對傳統文本的有效切分,計算機技術的快速發展,改造書面漢語書寫規則。A semantic based disambiguation algorithm was designed and implemented. with the algorithm, word sense disambiguation and structure disambiguation can be done by semantic pattern rules matching during syntax parsing. the experiment result indicates that : ( a ) the presentation of semantic pattern rules can formalize the construction of chinese phrase quite well ; ( b ) the corpus - based algorithm for acquiring and filtering binary semantic pattern rules is effective, and it can reduce the human labor, avoid subjectivity and unilateralism caused by writing rules manually ; ( c ) the semantic based disambiguation algorithm can achieve satisfactory effects
實驗表明: 1 )本文設計的語義模式規則能夠較準確地刻畫漢語短語構造的語義規律; 2 )本文提出的基於語料庫的二元語義模式規則自動挖掘和優選演算法是切實可行的,它大大減少完全由人工從大規模語料庫中總結規則的工作量,避免了純人工編制規則的主觀性和片面性; 3 )本文提出的語義分析排歧演算法能夠有效消解短語分析中的詞義歧義和結構歧義。Cross ambiguity is a major type of ambiguity in chinese word segmentation
交集型切分歧義(交集型歧義切分欄位)是漢語自動分詞中遇到的主要歧義類型。分享友人