詞語切分 的英文怎麼說
中文拼音 [cíyǔqiēfēn]
詞語切分
英文
word segmentation- 詞 : 名詞1 (說話或詩歌、文章、戲劇中的語句) speech; statement; lines of play 2 (一種韻文形式 起於唐...
- 語 : 語動詞[書面語] (告訴) tell; inform
- 切 : 切Ⅰ動詞1 (合; 符合) correspond to; be close to 2 (用在反切后頭 表示前兩個字是注音用的反切)見 ...
- 分 : 分Ⅰ名詞1. (成分) component 2. (職責和權利的限度) what is within one's duty or rights Ⅱ同 「份」Ⅲ動詞[書面語] (料想) judge
- 詞語 : words and expressions; terms
-
While the present century was in its teens, and on one sunshiny morning in june, there drove up to the great iron gate of miss pinkerton ' s academy for young ladies, on chiswick mall, a large family coach, with two fat horses in blazing harness, driven by a fat coachman in a three - cornered hat and wig, at the rate of four miles an hour
有時英語長句中主句與從句或主句與修飾語間的關系並不十分密切,翻譯時可按漢語多用短句的習慣,把長句中的從句或短語化為句子,分開來敘述;為使語意連貫,有時還可適當增加詞語。Focusing on the function words, it recognizes all of the phrases which contain some function word, and analyzes the relationship of the phrases, to form the syntactic tree
它以語法功能詞為切入點,首先識別出所有包含語法功能詞的短語,然後分析短語之間的組合關系,最終得到句法樹。The effects of word splitting and character - space on comprehension of chinese text presented in leading format
詞切分與字間距對引導式漢語文本閱讀工效的影響The introduction reveals if s value to the research of the development of zen, and of the development of thought of zen in china. the second chapter tries to solve the problems of the decision of the names of some manual scripts in dunhuang zen documents and some other problems. it clears the misunderstanding in a few important questions. the next chapter, stepping in from a researching and data - sorting - out view, discusses in details the writing symbols and their explanations in the zen documents and the mistakes often seen in proof - reading, etymological and semantic explanations of words and phrases
第一章概要論述了敦煌禪宗文獻的內容、時代及研究的歷史與現狀,揭示了敦煌禪宗文獻在中國禪宗發展史、禪思想史研究方面的重要價值;第二章對敦煌禪宗文獻部分寫卷的定名及諸多問題進行了考論,澄清了已往對一些重大問題的錯誤認識;第三章從文獻整理與研究的角度切入,對敦煌禪宗文獻的書寫符號與釋讀、寫本的常見錯誤、校勘以及詞語考釋等作了詳細論述,揭示出釋讀校理敦煌文獻與研治敦煌學的普遍規律;第四章就敦煌禪宗文獻的語料價值,包括對大型語文辭書的局限性、失收詞語、釋義、溯源等問題的探討,為辭書編纂修訂和構建科學完備的漢語史提供了第一手資料。This book comprehensively systematically inspected chinese and english color word cuts minute on in the semantic field the similarities and differences, has enumerated chinese and english color word follows in the significance the characteristic extremely with the social history culture relation
這本書全面系統地考察了漢語和英語顏色詞在語義場切分上的異同,列舉了漢語和英語顏色詞的伴隨意義上的特點極其與社會歷史文化的聯系。In this paper, the word segmentation technology of chinese text classification is debated emphatically. and the method of word segmentation based on the phrase labeling of 2 - gram syntax is put forward combining the method of setting separate - signs and the method based on the statistic of word - frequency, which can recognize the vocabularies which the method based on the dictionary can not manage
對于基於信息過濾的自動分類問題,使用字典分詞並不是一個必須的過程,因而本文提出了基於2元語法短語標引的分詞方法,它將設立切分標志法與基於詞頻統計的方法相結合,可以識別基於詞典方法處理不了的詞匯,如:人名、地名、專業術語等。In addition to word segmentation and part - of - speech tagging, the processing involves the tagging of proper nouns ( person names, place names, organization names arid so on ), morpheme subcategories and the special usages of verbs and adjectives
加工項目除詞語切分和詞性標注外,還包括專有名詞(人名、地名、團體機構名稱等)標注、語素子類標注以及動詞、形容詞的特殊用法標注。In addition to word segmentation and part - of - speech tagging, the processing involves the tagging of proper nouns ( person names, place names, organization names and so on ), morpheme subcategories and the special usages of verbs and adjectives
加工項目除詞語切分和詞性標注外,還包括專有名詞(人名、地名、團體機構名稱等)標注、語素子類標注以及動詞、形容詞的特殊用法標注。In this paper, a processing approach to the complex long sentence based on multi - strategy is proposed, in which the rule - based analysis and case - based consistent matching are combined by using various linguistic features, including sentence length, punctuation, functional words and contextual condition and so on, which is used to segment the complex sentence into several simple sentences and to compose the translation text of these simple sentences together
本文提出一種多語種通用的基於多策略分析的復雜長句翻譯處理演算法,該演算法通過基於實例模式匹配和規則分析相結合的方法,綜合利用源語言句子中多種相關的語言特徵,包括語法語義特徵、句子長度、標點符號、功能詞以及上下文語境條件等對復雜長句進行切分簡化處理和譯文的復合生成。The core algorithms in the candidate set creating function module are code pretreatment algorithm and candidate set creating algorithm. candidate list creating function module is the most important module, its core algorithms include candidate list adjusting algorithm, candidate matrix creating algorithm, original word lattice creating algorithm, language element node creating algorithm, optimal candidate words searching algorithm and candidate list creating without code algorithm which
候選列表生成模塊是整個系統最主要的模塊,主要核心演算法包括候選字詞調整演算法、候選矩陣生成演算法、初步詞網格生成演算法、語言元素結點的生成演算法、尋找最佳語句候選演算法以及無編碼候選列生成演算法,無編碼候選列生成演算法利用對用戶已輸入的漢字進行切分標注,通過系統中的知識庫信息在沒有輸入編碼的情況下預測后續的輸入。The strategy of overcoming ambiguity and unlanded words in chinese words automatically seperating system
漢語自動分詞系統中切分歧義與未登錄詞的處理策略3 、 chinese words divided syncopation technology is the difficulty of the query technique based on phrase. some divided syncopation such as mechanical matching method, feature phrase library method, restriction matrix method, syntax analysis method and comprehended syncopation method are emphasized
對其中的幾種方法,如機械匹配法(即mm法) 、特徵詞庫法、約束矩陣法、語法分析法和理解切分法等做了詳細的比較和分析,並歸納出各自特點。Based on 6. 4 million chars of chinese ancient poetry, the computer aided research system of chinese ancient poems provides a word - based analysis platform of chinese ancient poems. more than 50000 chinese words, including 40814 multi - char words, were extracted from the corpora via statistic method. besides the full text retrieving function, the system also provide word - based statistic analysis, sentence based similarity retrieving, automatic pinyin tagging and some other useful functions to benefit the profound analysis of the chinese ancient poems. the national social science foundation of china 1998 - 1999 funded the project
在對詩文進行詞語切分的基礎上,建立了詞匯的共現關系對仗關系以及詞匯的作者分佈特徵信息。系統除了提供面向詩文內容的全文檢索功能外,還進一步開發了基於詞匯的統計分析和詩句相似性檢索等功能,實現了對全唐詩的自動注音。In this novelty approach, proper nouns " term information together with rules attained by transformed - based error - driven learning is used to label properties of segmented text, so as to recognize proper nouns
該方法利用從語料庫中自動提取到的專有名詞信息和採用基於轉換的錯誤驅動學習方法獲得的規則,對切分文本進行屬性標注,最終實現專有名詞的識別。And also in this article, the chinese word automatic segmentation is designed which uses the melio - rative markoff statistics process. the chinese word bank is built up by matching the unimportant - word base, the single - word base and the first - word base. the feature vector is drawn out through assessing the weight of every word, enters into the regular band as regulations and becomes a criterion for text filtering
本文設計了中文分詞處理演算法,此演算法通過對停用詞庫、單字詞庫與前置詞庫進行匹配,並通過對分詞進行切分,利用改造型馬爾可夫n元語言模型處理分詞中出現的歧義詞方法,建立了中文分詞詞庫,然後對分詞進行權重評價從而提取特徵向量,並作為規則歸入規則庫里,作為文本過濾的標準,最後提出防止敏感信息通過的一些過濾策略。Some of the terms or items in box 8 are accounting terms and have no precise or uniform definition
第8欄的部分詞語或項目是會計用詞,沒有確切或統一的定義。Some of the terms or items in box 11 ( box 10 ) are accounting terms and have no precise or uniform definition
第11欄(第10欄)的部分詞語或項目是會計用詞,沒有確切或統一的定義。The original knowledge which is described in natural language in electronic text is disposed by filtrating, semantic block segmenting, word segmenting, syntax and semantic analyzing, pattern matching. in the end it is expressed in the semantic network and storaged in the multimedia knowledge database
在本系統中,原始的以電子文本形式存在的自然語言描述的知識經過過濾,切分語義塊,分詞及語法、語義分析,模式匹配,以語義網路的形式存入多媒體數據庫。Through analyzing segmentation methods in existence, this paper points out three developing directions of future chinese automatic segmentation research, that is, effective segmentation to traditional text, rapid development of computer technology, and changing the writing rules of chinese text
本文通過對現有分詞方法的分析,指出了今後漢語自動分詞研究的三個發展方向,即對傳統文本的有效切分,計算機技術的快速發展,改造書面漢語書寫規則。Disambiguity and recognition of unknown words are most important points for design of word segmentation systems. in this paper, firstly, we introduce an applied strategy to disambiguity. then we put forward an integrated and fast recognition strategy of proper noun, including chinese person names, chinese place names, translated foreign names and corporation & organization names, in modern chinese word segmentation system, which successfully resolves the conflict among these proper nouns and ordinary words
本文首先闡述了現代漢語通用分詞系統( gpws )中歧義切分技術和專名識別技術,在歧義切分技術中,提出了一種切分規則庫與基於歧義知識庫動態校正相結合的實用歧義處理策略;在專名識別技術中,本文提出了一種專名(包括譯名在內的人名、地名、企業字號、企業名和機構名等)一體化、快速識別方法。分享友人