information clustering 中文意思是什麼

information clustering 解釋
信息聚合法
  • information : n. 1. 通知,通報,報告。2. 報導,消息,情報。3. 資料,知識,學識。4. 【自動化】信息,數據。5. 【法律】起訴,告發。adj. -al
  • clustering : 叢聚
  1. This paper adopts an adaptive learning algorithm based on hierarchy clustering to update user profile, which continuously abstract the cancroids of one class of optimum information from the feedback flow of system, which effectively shield the learning process from plenty of feedback noises produced by distorted threshold and sparseness of initial information, which also can imitate artificial feedback approximately to perfect the intelligence of adaptive learning mechanism

    摘要本文採用一種基於層次聚類的自適應學習策略,從系統反饋的信息流中,動態提取一類最優信息的質心更新用戶模型,有效屏蔽了閾值失真和初始信息稀疏造成的大量反饋噪聲,並且能夠近似模仿人工反饋,完善自適應學習機制的智能性。
  2. ( 3 ) for the product failures of the refrigeratory, research the algorithm of product policy decision tree and clustering analysis, resulting in a satisfactory failure information report for manufacture departments

    ( 3 )針對冰箱產品故障信息的缺陷,通過聚類分析,並按決策樹演算法對產品故障進行研究,得出令生產部門滿意故障信息報表。
  3. First, we have expatiated the working principle, performance parameters and major technologies. farther, we have analyzed the shortcomings of the existing catalog search engine and introduced the clustering analysis and the ant algorithm ; on the basis of this, we discussed the possibility and necessity of the connection between them, which avoids the local optimization of the clustering analysis to a degree. in the end, we appraise the idea that we deal with the information data by the data structure of the binary tree, m - branch tree and tree established by the ant algorithm, which can improve the efficiency of the search engine

    首先闡述了搜索引擎的工作原理,性能指標,主要技術;分析了現有目錄式搜索引擎的缺點,接著介紹了聚類分析演算法與螞蟻演算法的理論,並論述了二者結合的可能性和必要性,這種結合方法也在一定程度上克服了聚類分析演算法容易陷入局部最優的缺點,最終提出了通過使用螞蟻演算法建立二叉樹、 m叉樹和樹作為信息數據處理的思想,大大提高了搜索引擎搜索的效率。
  4. Based on this kind of relations between the topological structures and the content distributions we study the web modelling, community identification and some related application problems in detail : first, after some existed characteristics of the web topology are verified, some new characteristics are discovered : the high clustering property in micro - topology ( high average gathering coefficient ), the obvious mapping relation between the topological struture and the content in micro - level 、 linear irrelevant between the degree distribution of network nodes and the relative degree distribution of contents etc. then after analysis the topology of the complex network and the network modeling, the muti - scale determinism is proposed, especially for the information network a web evolvement model ( prcp model ) that fused the node authority and the node correlation is proposed. the model deduction, evolving learning verification and large scale experiment proof indicate that the model can explain the micro - topology centralizing phenomena, can imitate the mapping relation between the network connecting distribution and network content relative distribution and also can predict the mapping relation between the topology clustering and content clustering

    本文在詳細觀察了web網路的拓撲結構特徵以及拓撲結構與內容分佈相互關系的基礎上,以信息網路的物理連接拓撲結構與節點內容相關度分佈之間的相互關系為主線,從網路特徵、網路建模、社區分析及相關應用方面問題進行了深入細致地探討:首先在驗證了前人提出的web網路拓撲結構特徵基礎上,進一步發現了信息網路所具有的一些新特徵: 1 )網路微觀顆粒度的拓撲結構聚團與內容聚團存在明顯的映射關系,具體包括節點之間的物理連邊概率與節點之間的內容相關度成指數比例關系、節點形成三角形拓撲結構的概率與節點內容相關緊密程度之間同樣具有一種指數比例關系; 2 )網路節點連接度整體分佈與節點內容相關度整體分佈是線性無關的; 3 )網路微觀拓撲結構中的存在很強的集聚性(平均聚團系數很高) 。
  5. The strategy is made based on the competition analysis in view of the economy, politics, technology, industry environment, industry clustering, demand and growth of the industry. the competence and the information of main competitors are also key factors leading to the making of the strategy

    通過競爭分析,即對行業社會、經濟、政治和技術環境的一般性分析,對行業集中度、行業需求、行業增長、競爭力和主要競爭對手資料的綜合整理和分析,結合企業資源和競爭能力,制定三一重工競爭戰略。
  6. In this text, we first do some research on the genetic algorithm about clustering, discuss about the way of coding and the construction of fitness function, analyze the influence that different genetic manipulation do to the effect of cluster algorithm. then analyze and research on the way that select the initial value in the k - means algorithm, we propose a mix clustering algorithm to improve the k - means algorithm by using genetic algorithm. first we use k - learning genetic algorithm to identify the number of the clusters, then use the clustering result of the genetic clustering algorithm as the initial cluster center of k - means clustering. these two steps are finished based on small database which equably sampling from the whole database, now we have known the number of the clusters and initial cluster center, finally we use k - means algorithm to finish the clustering on the whole database. because genetic algorithm search for the best solution by simulating the process of evolution, the most distinct trait of the algorithm is connotative parallelism and the ability to take advantage of the global information, so the algorithm take on strong steadiness, avoid getting into the local

    本文首先對聚類分析的遺傳演算法進行了研究,討論了聚類問題的編碼方式和適應度函數的構造方案與計算方法,分析了不同遺傳操作對聚類演算法的性能和聚類效果的影響意義。然後對k - means演算法中初值的選取方法進行了分析和研究,提出了一種基於遺傳演算法的k - means聚類改進(混合聚類演算法) ,在基於均勻采樣的小樣本集上用k值學習遺傳演算法確定聚類數k ,用遺傳聚類演算法的聚類結果作為k - means聚類的初始聚類中心,最後在已知初始聚類數和初始聚類中心的情況下用k - means演算法對完整數據集進行聚類。由於遺傳演算法是一種通過模擬自然進化過程搜索最優解的方法,其顯著特點是隱含并行性和對全局信息的有效利用的能力,所以新的改進演算法具有較強的穩健性,可避免陷入局部最優,大大提高聚類效果。
  7. New comprehensive evaluation algorithm based on fuzzy clustering and information entropy

    基於模糊聚類和信息熵的綜合評價演算法
  8. This article discussed the chinese word slice, character extraction, character expression and character matching methods, and established the chinese text classification and clustering algorithms based on neural network. in the design of chinese text mining based on web, the paper analyzed and researched the expression of web page information, structure feature, web page control symbol and html control symbol, and built the extraction flow of web page information, then gave two concrete application of chinese text mining based on web through combining with practical problems

    討論了文本分類中的中文詞切分、特徵提取、特徵表示、特徵匹配方法,建立了基於神經網路的中文文本分類、聚類演算法,在web中文文本信息挖掘的設計中,對網頁信息的表示、結構特點、網頁控制符、 html控制符號處理進行了詳細分析與研究,構建了網頁信息提取流程,並結合實際問題,給出了web環境下中文文本信息挖掘的兩個具體應用。
  9. Different from traditional classification machine, our research is preceded under the situation of lacking class label and class information, replacing manual classification with clustering in order to gain classification information and the rustle is good

    與傳統分類器不同,我們在缺乏類信息的情況下,採用聚類替代領域專家的人工分類獲得類信息,為構造分類器提供合適的類信息,取得了較好效果。
  10. A method that combines category - based and keyword - based concepts for a better information retrieval system is introduced. to improve document clustering, a document similarity measure based on cosine vector and keywords frequency in documents is proposed, but also with an input ontology. the ontology is domain specific and includes a list of keywords organized by degree of importance to the categories of the ontology, and by means of semantic knowledge, the ontology can improve the effects of document similarity measure and feedback of information retrieval systems. two approaches to evaluating the performance of this similarity measure and the comparison with standard cosine vector similarity measure are also described

    介紹了一種綜合各層級分類類目和對應關鍵詞來構造概念體系並用於改進信息檢索系統效果的方法.為了改進文本聚類的效果,提出了將領域知識本體和文本關鍵詞詞頻相結合的基於餘弦向量的文本相似性測度方法.該本體面向特定領域,將關鍵詞以不同權值對應于各分類類目,通過其語義知識來改進文本相似性測度以及信息檢索系統的效果.進一步給出了對基於本體的相似性測度方法進行效果評價的2種策略以及該方法與經典餘弦向量測度方法的比較結果
  11. Firstly, it presents the storing arithmetic based on the mapping policy between xml data modal and object - oriented modal. reference to the arithmetic of extracting object - oriented database schemas from xml dtds using inheritance and other commercial tools for xml storing, it improves mapping policy from xml to object, which optimizes the new semantic classes, what ' s more, it present object clustering policy to resolve the uncertainty of xml schema and the complexity of information intergration, which simultaneously focuses on the semanteme and structure of new object classes. on the other hand, it presents method to realize exchanging from object to xml

    本文研究構造基於xml信息集成系統結構的面向對象數據庫包裝器,提出面向對象數據庫包裝器的系統結構;根據該結構提出xml的數據模式與面向對象數據庫對象數據模式的映射策略及相互存儲轉化演算法,一方面我們借鑒基於dtd模式的繼承對象映射提取演算法及各種商業工具,提出dtd簡化演算法和基於dtd簡化結構的對象圖映射演算法,優化了生成的對象類、提高了對象類的語義表達能力,也改進了對象映射提取策略;同時採用模糊聚類策略,提出對象聚類處理演算法,改善了xml語義定義的隨意性給對象類提取及信息集成帶來的復雜性;另一方面本文提出對象到xml的轉化演算法,採用系統自動定義對象到xml的轉化方法實現對象到xml的轉化處理。
  12. It adopts the hierachical clustering in vocabulary vsm model because of its special function, on the other hand enriches the subcategory tagging information by rules, it can decrease me data sparse problem, and introduces the confidence intervals into the model for the selection of priority between statistics and rules

    另外還對標注模型從兩方面作了優化,由於詞匯特徵向量的特殊作用,本文對特徵詞匯採用層次聚類來提高其分類精度;另一方面,引入規則來進一步豐富細分類標注信息,減少數據稀疏等問題,並且引入置信度來選擇統計與規則的優先關系。
  13. With respect to a kind of clustering problem of which both the value of characteristic index and weight of index are of linguistic assessment information, a new approach is presented for cluster analysis

    摘要針對一類特徵指標值和指標權重均為語言評價信息的聚類問題,提出了一種新的聚類分析方法。
  14. Users of web search engines are often forced to shift through the long ordered list of document " snippets " returned by the engines. this paper applied web content mining to the field of search engine. search engine results clustering relies on the information returned by the search engine

    本文將web內容挖掘技術應用於搜索引擎領域,它依賴于搜索引擎結果所提供的信息來歸納出聚類,使得在搜索引擎返回的非常大的文檔列表中的過濾操作變得十分方便。
  15. The application of clustering into information filtering, to a certain degree, promotes the filtering efficiency of the system, and plays an active role in the examination of the precision and recall of the text. the indeterminacy and vagueness of natural language cause difficulty to nlp

    本質上,聚類屬於一種無監督的學習,將聚類技術應用於信息過濾中可以在一定程度上提高系統的過濾效率,同時也對信息過濾的查準率與查全率有積極的作用。
  16. ( 1 ) puts forward a new text representation model, which originates from the theory of equivalence division of the rough set, defines the similitude of this model, and proposes the approach to calculate the text similitude of this model. ( 2 ) puts the text clustering techniques into the practice of information filtering

    提出了一種新的文本表示模型,該模型基於粗糙集的對知識的等價劃分的思想,試圖保持文本的概念信息:定義了該模型下的粗糙相似度;並提出了基於該模型的計算文本相似度的方法。
  17. Hong kong s cyberport, a world - class information infrastructure for the clustering of quality information technology companies, is now inviting applications for office tenancy

    具備世界一流資訊基建設施以匯聚優質資訊科技公司的數碼港現正接受租用申請。請立即查看計劃詳情,下載及向我們遞交申請表吧!
  18. However, their current status is still far from user ' s satisfaction. lt includes : ( 1 ) the content that search engine returns is a enormous flat bill ( information overloading question ) ; ( 2 ) the items return with search engine are not the content that user requisite in deed ( low precision question ) this paper presents a fuzzy ( soft ) clustering algorithm htsc ( hyperlink - text based soft clustering ) using a mixed similarity metric of document content and inter - document hyperlinks, for clustering web search results from a search engine in order to help users find relevant web information more easily

    這主要表現為: ( 1 )搜索引擎返回的結果是一個龐大的平坦結構的資源清單(即信息負載問題) ; ( 2 )搜索結果中的信息項並非都是用戶真正需要的信息資源(即低精度問題) ;論文提出了一種基於文檔文本內容和文檔間超鏈信息的混合相似度的模糊(軟)聚類演算法htsc 。該演算法可對搜索引擎返回的結果進行模糊聚類,以方便用戶從中找到真正需要的信息。
  19. Through discussing such core technologies in the automatic processing of chinese information as automatic word segmentation, feature selecting and automatic representation of texts, the thesis makes some improvements and perfection on the current methods of automatic word segmentation and text space reduction of chinese texts, therefore improved their efficiencies and effects. with regard to the methods of text classification, the paper introduced two supervisory automatic classification methods of chinese texts based on multi - classification, i. e. fuzzy clustering and boosting, which settled the problem of low percentage of recall. through comparing the results of experiments with the two methods, an automatic classification system of multi - classification texts is constructed based on the boosting method, which received good effects in application and provides a good resolution to the problem of real - time classification of information

    通過對漢語信息自動處理中自動分詞、特徵提取、文本自動表示等核心技術討論,對目前漢語文本自動分詞和文本降維方法中的不足和缺陷作了改進,提高了分詞和文本分類的效率和效果;在文本自動分類方法上,介紹了兩種有監督的基於多類的漢語文本自動分類處理方法? ?模糊聚類方法和boosting方法,解決了實踐中文本分類查全率不高的問題;通過對兩種方法的實驗比較結果,構建了基於boosting方法的多類文本自動分類系統,在實際應用中收到了良好的效果,較好的解決了信息的實時分類問題。
  20. Because of the shortage of traditional unsupervised algorithm, three algorithms are proposed in the paper based on the spatial information of the difference image and the clustering characteristic of 2 - d histogram formed by pixel gray levels and the local average gray levels. the proposed algorithms segment the pixels of

    本文針對典型演算法存在的不足,充分利用差異圖象灰度空間分佈信息和差異圖象灰度-鄰域平均灰度二維直方圖的聚類特性提出了三種非監督變化檢測方法,將差異圖象所有像素分成變化和非變化兩個類別。
分享友人