自動分詞 的英文怎麼說
中文拼音 [zìdòngfēncí]
自動分詞
英文
automatic word segmentation- 自 : Ⅰ代詞(自己) self; oneself; one s own Ⅱ副詞(自然;當然) certainly; of course; naturally; willin...
- 分 : 分Ⅰ名詞1. (成分) component 2. (職責和權利的限度) what is within one's duty or rights Ⅱ同 「份」Ⅲ動詞[書面語] (料想) judge
- 詞 : 名詞1 (說話或詩歌、文章、戲劇中的語句) speech; statement; lines of play 2 (一種韻文形式 起於唐...
- 自動 : 1 (自己主動) voluntarily; of one s own accord 2 (不憑借人為的力量) automatic; spontaneous 3 ...
-
The method based on statistics has the problem of training data ' s rarefaction, and what restricts the more progress of corpus is the too large workload of manual tagging
基於統計的漢語自動分詞方法存在訓練數據稀疏的問題,而人工標注工作量過大又制約著語料庫規模的進一步擴大。Thi s paper proposes a new automatic natura1 1anguage text categorization modu1e based on concept. thi s modu1e takes howwenet as the main source of knowledge, the concepts of words as the bas ; is of text categorization. the concepts of words are reduced to sememes and the tc is performed in the classfiab1e sememe vector space ( csvs )
本文提出了一個基於概念的自然語言文本自動分類模型,該模型以《知網》為主要的概念知識源,以詞所表示的概念為分類基礎,把概念繼續分解至義原,並在可分義原組成的向量空間進行文本分類。And then, the paper use the measure of literal similarity to classify the chinese web pages, which based on lots of experiential classifying data. then, the author uses borland delphi and visual foxpro to develop an automatic indexing system, which is used to process chinese web pages. the experiential system is composed of web pages text analysis, automatic words extracting, automatic subject indexing, classifying, indexing result confirmation and knowledge database maintenance
隨后,本文利用borlanddelphi 、 visualfoxpro等工具設計並開發了一個包括中文網頁文本信息提取、自動抽詞、自動主題與分類標引、標引結果處理、知識庫維護等功能,用以處理中文網頁信息的自動標引實驗系統;並簡要介紹了系統的設計、工作流程、使用方法及運行條件。Word segmentation is the basis of chinese information processing ( nlp )
自動分詞技術是中文信息處理的基礎工程。One part, natural chinese language understanding part, is composed of lexical analysis module, producing mql ( middle query language ) module and producing sql query sentence module
在自然漢語理解部分中,有自動分詞、中間語言生成和sql語句生成三個模塊。The dissertation gives a minute description of the chinese automatic segmentation model and the way to apply it in the chinese intelligence search engine
本文詳細描述了中文智能搜索引擎中漢語自動分詞模塊的設計方案與實現方法。At first system accomplishes chinese language automatic word segmentation and part - of - speech tagging through chinese input approach with word segmentation, then forms corresponding surface semantic network according to the semantic structure grammar, and finally gets corresponding data flow diagram and data dictionary according to the automatic generation algorithms of data flow diagram and data dictionary, the whole completion of the work, can not only provide a description environment of natural language for case, but also develop into the system which takes the question described on the basis of the natural language as the system ' s input
工作的中心是自然語言篇章理解。系統首先通過分詞輸入法實現漢語自動分詞與詞性標注,然後根據語義結構文法產生相應的表層語義網路,最後根據數據流圖、數據字典自動生成演算法轉換為相應的數據流圖和數據字典。這項工作的徹底完成,不僅可以給case提供一個自然語言的描述環境,而且可進一步發展為基於自然語言描述問題作為輸入的系統。21 fu jianlian, chen qunxiu. topic analysis in the automatic summarization system
中文句法分析的基礎是自動分詞詞性標注和命名實體識別。The strategy of overcoming ambiguity and unlanded words in chinese words automatically seperating system
漢語自動分詞系統中切分歧義與未登錄詞的處理策略Through discussing such core technologies in the automatic processing of chinese information as automatic word segmentation, feature selecting and automatic representation of texts, the thesis makes some improvements and perfection on the current methods of automatic word segmentation and text space reduction of chinese texts, therefore improved their efficiencies and effects. with regard to the methods of text classification, the paper introduced two supervisory automatic classification methods of chinese texts based on multi - classification, i. e. fuzzy clustering and boosting, which settled the problem of low percentage of recall. through comparing the results of experiments with the two methods, an automatic classification system of multi - classification texts is constructed based on the boosting method, which received good effects in application and provides a good resolution to the problem of real - time classification of information
通過對漢語信息自動處理中自動分詞、特徵提取、文本自動表示等核心技術討論,對目前漢語文本自動分詞和文本降維方法中的不足和缺陷作了改進,提高了分詞和文本分類的效率和效果;在文本自動分類方法上,介紹了兩種有監督的基於多類的漢語文本自動分類處理方法? ?模糊聚類方法和boosting方法,解決了實踐中文本分類查全率不高的問題;通過對兩種方法的實驗比較結果,構建了基於boosting方法的多類文本自動分類系統,在實際應用中收到了良好的效果,較好的解決了信息的實時分類問題。A critical appraisal of the research on chinese word segmentation
漢語自動分詞研究評述After words - cutting and word - tagging with maximum matching method, lexical analysis module check syncopate ambiguity, and disambiguate ambiguity of intersection with right - first rule
先對查詢語句自動分詞和詞性標注,本文採用最大正向匹配演算法,對每個分詞結果進行尾部歧義檢查,對最主要的交集型歧義採用歸右原則處理。The chinese automatic word segmentation is an important part in the chinese information processing
漢語自動分詞是中文信息處理中的重要環節。Through analyzing segmentation methods in existence, this paper points out three developing directions of future chinese automatic segmentation research, that is, effective segmentation to traditional text, rapid development of computer technology, and changing the writing rules of chinese text
本文通過對現有分詞方法的分析,指出了今後漢語自動分詞研究的三個發展方向,即對傳統文本的有效切分,計算機技術的快速發展,改造書面漢語書寫規則。It discusses and analyzes about automatic word segmentation methods for the chinese language emphatically. moreover it gives the application expectation among this system a method of word segmentation based on the reverse directional maximum matching method. it also proposes that the key of the word segmentation still needs an intact and rational word segmentation dictionary
本文首先研究和討論了基於自然語言的語義分析方法,對漢語的自動分詞方法進行了著重討論和分析,並給出了一種基於反向最大匹配法的分詞方法在本系統中的應用展望,提出分詞的關鍵還需要一個完整合理的分詞詞典。The results showed that the new dictionary mechanism based on the two - word - bitmap could improve speed and achieve more efficiency in chinese word segmentation
實驗結果表明,基於二字詞檢測位圖表的分詞詞典機制有效地提高了漢語自動分詞的速度和效率。Cross ambiguity is a major type of ambiguity in chinese word segmentation
交集型切分歧義(交集型歧義切分欄位)是漢語自動分詞中遇到的主要歧義類型。Chinese automatic segmentation is one of the most difficult problems in computer chinese information disposal
摘要漢語自動分詞是計算機中文信息處理中的難題。Researching the automatic judge of chinese word - split and sentence - split becomes an urgent task
漢語自動分詞分句的自動評判系統的研究已經成為一項亟待解決的緊迫課題。In a word, researching the automatic judge of chinese word - split and sentence - split is just on the drawing boards
漢語自動分詞分句的自動評判的研究總的來說還處在剛剛起步的階段。分享友人