詞干檢索 的英文怎麼說
中文拼音 [cígānjiǎnsuǒ]
詞干檢索
英文
auto stemming- 詞 : 名詞1 (說話或詩歌、文章、戲劇中的語句) speech; statement; lines of play 2 (一種韻文形式 起於唐...
- 干 : 干Ⅰ名詞1 (事物的主體或 重要部分) trunk; main part 2 (幹部的簡稱) short for cadre Ⅱ動詞1 (做...
- 檢 : Ⅰ動詞1 (查) check up; inspect; examine 2 (約束; 檢點) restrain oneself; be careful in one s c...
- 索 : Ⅰ名詞1 (大繩子; 大鏈子) a large rope 2 (姓氏) a surname Ⅱ動詞1 (搜尋; 尋找) search 2 (要; ...
- 檢索 : retrieval; retrieve; search; searching
-
At present, more corpora of higher quality are required in the fields of machine translation ( mt ), information retrieval ( ir ), web text mining, etc. automatic stem segmentation and part of speech ( pos ) tagging are fundamental to the construction of tagged corpora
目前,在機器翻譯、信息檢索、 web文本挖掘等許多領域對語料庫的使用越來越多,要求也越來越高。而自動詞干提取和詞性標注是建立標注語料庫的基礎性工作。Firstly, the paper introduces the main theoretics and technologies of the web information retrieval. then it applies the spider to realize the information gathering. according to characteristic of uighur language, using uighur stemming based on table searching regular and arithmetic of the combined mode, uighur text segmentation is realized ; using vector space model, the paper switches uighur text information into structured data ; and appling clustering analytical method, these structured text is clustered
本文首先分析了web信息檢索的主要理論基礎和關鍵技術,然後利用spider信息採集技術,實現了信息檢索的源信息採集;根據維吾爾語詞的特點,利用詞干表查找的維文詞干提取演算法和結合模式的維文詞語組合演算法,對維文網頁文本進行詞特徵表示;採用向量空間模型實現文本信息的結構化表達;使用聚類分析法,對結構化文本信息進行聚類,得到文本分類結果。In chapter four, the realization of system is illustrated based on previous chapters. some function during realization is set forth, which include query expansion, association retrieval and intelligent analysis
第四章在前幾章的基礎上,建立中醫藥信息智能檢索系統,並重點闡述若干功能的實現,包括基於詞表的擴展檢索、關聯檢索等檢索方法以及智能分析的實現。
分享友人