data set extension 中文意思是什麼

data set extension 解釋
數據集擴充
  • data : n 1 資料,材料〈此詞系 datum 的復數。但 datum 罕用,一般即以 data 作為集合詞,在口語中往往用單數...
  • set : SET =safe electronic transaction 安全電子交易〈指用信用卡通過因特網支付款項的商業交易〉。n 【埃...
  • extension : n 1 伸長,伸展,延長,延伸,擴展,擴大;廣度,范圍。2 延期;〈美國〉(房屋的)增建部分,(鐵路等...
  1. Optimized association rules are permitted to contain uninstantiated attributes. the optimization procedure is to determine the instantiations such that some measures of the roles are maximized. this paper tries to maximize interest to find more interesting rules. on the other hand, the approach permits the optimized association rule to contain uninstantiated numeric attributes in both the antecedence and the consequence. a naive algorithm of finding such optimized rules can be got by a straightforward extension of the algorithm for only one numeric attribute. unfortunately, that results in a poor performance. a heuristic algorithm that finds the approximate optimal rules is proposed to improve the performance. the experiments with the synthetic data sets show the advantages of interest over confidence on finding interesting rules with two attributes. the experiments with real data set show the approximate linear scalability and good accuracy of the algorithm

    優化關聯規則允許在規則中包含未初始化的屬性.優化過程就是確定對這些屬性進行初始化,使得某些度量最大化.最大化興趣度因子用來發現更加有趣的規則;另一方面,允許優化規則在前提和結果中各包含一個未初始化的數值屬性.對那些處理一個數值屬性的演算法進行直接的擴展,可以得到一個發現這種優化規則的簡單演算法.然而這種方法的性能很差,因此,為了改善性能,提出一種啟發式方法,它發現的是近似最優的規則.在人造數據集上的實驗結果表明,當優化規則包含兩個數值屬性時,優化興趣度因子得到的規則比優化可信度得到的規則更有趣.在真實數據集上的實驗結果表明,該演算法具有近似線性的可擴展性和較好的精度
  2. Followed by the rapid extension of data size, the usage of parallel technology is a very important method to improve the efficiency of data ming. sliq uses novel pre - sorting and breadth - first techniques to build a decision tree fast and accurately on a large data set, and can deal both categorical and numeric attributes. but the primary algorithm contains the abundant computing on attribute and record

    本文首先分析了串列sliq演算法的原理和特點,針對其不足提出了一些改進方法,然後在基於pvm的環境下實現了演算法的并行化,分析了演算法的時間復雜度和加速比,提高了sliq演算法的效率,具有一定的理論意義和實用價值。
  3. By using the classification information provided by decision attribute, this method not only avoids the complex clustering operation but also can output a result with high data consistency. then the thesis discusses the extension of rough set theory in order to deal with incomplete information system

    該方法通過利用決策屬性提供的分類信息對屬性值空間進行離散化,不僅避免了復雜的聚類運算,而且使離散化結果保持了較高的數據一致性。
分享友人