檢索詞聚類 的英文怎麼說

中文拼音 [jiǎnsuǒlèi]
檢索詞聚類 英文
term clustering
  • : Ⅰ動詞1 (查) check up; inspect; examine 2 (約束; 檢點) restrain oneself; be careful in one s c...
  • : Ⅰ名詞1 (大繩子; 大鏈子) a large rope 2 (姓氏) a surname Ⅱ動詞1 (搜尋; 尋找) search 2 (要; ...
  • : 名詞1 (說話或詩歌、文章、戲劇中的語句) speech; statement; lines of play 2 (一種韻文形式 起於唐...
  • : 動詞(聚集; 聚積) assemble; gather; get together
  • : Ⅰ名1 (許多相似或相同的事物的綜合; 種類) class; category; kind; type 2 (姓氏) a surname Ⅱ動詞...
  • 檢索 : retrieval; retrieve; search; searching
  1. After clustering of the documents, in the process of retrieval, we make a comparison between the retrieval words the users point out and cluster center of the documents, and as a result, achieve a cluster that is most similar to retrieval words. through the calculation of both the selected documents and those retrieval words, thence the retrieval range will be reduced, the efficiency of retrieval be increased, and the retrieval deviation be overcome to a certain extent

    對文檔進行了,在的期間,對用戶提出的先進行和每一心比較,得到與之最近的別,僅將屬于該別中的文檔與用戶提出的進行運算,從而縮小了的范圍,提高了的效率,也在一定程度上克服了結果的偏差。
  2. A method that combines category - based and keyword - based concepts for a better information retrieval system is introduced. to improve document clustering, a document similarity measure based on cosine vector and keywords frequency in documents is proposed, but also with an input ontology. the ontology is domain specific and includes a list of keywords organized by degree of importance to the categories of the ontology, and by means of semantic knowledge, the ontology can improve the effects of document similarity measure and feedback of information retrieval systems. two approaches to evaluating the performance of this similarity measure and the comparison with standard cosine vector similarity measure are also described

    介紹了一種綜合各層級分目和對應關鍵來構造概念體系並用於改進信息系統效果的方法.為了改進文本的效果,提出了將領域知識本體和文本關鍵頻相結合的基於餘弦向量的文本相似性測度方法.該本體面向特定領域,將關鍵以不同權值對應于各分目,通過其語義知識來改進文本相似性測度以及信息系統的效果.進一步給出了對基於本體的相似性測度方法進行效果評價的2種策略以及該方法與經典餘弦向量測度方法的比較結果
  3. Meanwhile it accelerates the search process by cache. ( 3 ) the chinese word segmentation module to support the text segmentation of system, it uses hmm - based disambiguation algorithm to improve the accuracy of the word segmentation. ( 4 ) the search module to response the users ’ search request, it applies an efficient clustering / classification algorithm to optimize the search service quality

    它使用基於hmm模型的歧義消除演算法來提高分預處理的切分精度。 ( 4 )模塊用來響應用戶的查詢請求。它利用簡單靈活的/分演算法來優化系統的搜服務。
  4. According to such an idea, we propose a new retrieval method that combines xpath and vector space model, named as the vector retrieval model based on xpath. secondly, we make full use of the hierarchical architecture of xml data, and analyze the structure of every document to construct a structure thesaurus, which is designed to navigate the user query and to eliminate the structural conflict

    根據這一思想,作者提出了將xpath語言與傳統的向量空間模型相結合,實現基於簡單xpath路徑的向量演算法來實現對xml文檔的。充分利用xml文檔分層次體系結構的特點,對于每篇xml文檔分析其文檔結構,並採用學習演算法形成文檔結構典,從而實現xml文檔查詢的導航機制和消除文檔結構的異構性。
  5. Firstly, the paper introduces the main theoretics and technologies of the web information retrieval. then it applies the spider to realize the information gathering. according to characteristic of uighur language, using uighur stemming based on table searching regular and arithmetic of the combined mode, uighur text segmentation is realized ; using vector space model, the paper switches uighur text information into structured data ; and appling clustering analytical method, these structured text is clustered

    本文首先分析了web信息的主要理論基礎和關鍵技術,然後利用spider信息採集技術,實現了信息的源信息採集;根據維吾爾語的特點,利用干表查找的維文干提取演算法和結合模式的維文語組合演算法,對維文網頁文本進行特徵表示;採用向量空間模型實現文本信息的結構化表達;使用分析法,對結構化文本信息進行,得到文本分結果。
  6. Abstract this paper introduces the status of the digital reference book system in the university library and systematically explains the distributing problem of the teaching resource. based on the problem mentioned above, the paper introduces a system, which can cluster and reconstruct the search result through the technology of the keyword index

    摘要簡要介紹數字圖書館中教學參考書系統的應用現狀,並就其中數字教學資源分佈零散的問題,提出一種通過關鍵引技術,能夠對文本教學資源進行,然後將相關聯的結果重組之後集中呈現的系統。
  7. Li this part, the thesis first profiles semantic features of each document by employing chinese information processing technology in order to change documents into the form which can be operated with the help of mathematical methods. second, the thesis profiles each user ' s information needs by three ways : 1 ) accepting the information provided by the user himself ; 2 ) watching the user ' s retrieval action ; and 3 ) analyzing web server log. in this module, users are also classified into different categories according to their information needs

    在用戶建模中,系統從三方面獲取用戶信息需求特徵,第一,用戶主動地向系統提供需求信息;第二,系統測用戶行為,從用戶分析其需求;第三,系統通過分析web訪問日誌,得到用戶的興趣所在及興趣的變化狀況,並進一步利用對用戶訪問文檔內容的分析來追蹤其興趣變化,將用戶興趣同樣表示為興趣特徵向量,相似用戶。
分享友人