更改

跳到导航 跳到搜索
添加723字节 、 2020年8月24日 (一) 20:40
第169行: 第169行:  
or a simplified process such as (1) Pre-processing, (2) Data Mining, and (3) Results Validation.
 
or a simplified process such as (1) Pre-processing, (2) Data Mining, and (3) Results Validation.
   −
或一个简化的过程,如(1)预处理,(2)数据挖掘,和(3)结果验证。
+
或一个简化的过程,如(1)预处理,(2)数据挖掘,(3)结果验证。
      第200行: 第200行:     
* [[Anomaly detection]] (outlier/change/deviation detection) – The identification of unusual data records, that might be interesting or data errors that require further investigation.
 
* [[Anomaly detection]] (outlier/change/deviation detection) – The identification of unusual data records, that might be interesting or data errors that require further investigation.
 +
 +
'''<font color="#ff8000">异常检测 Anomaly detection</font>'''(异常值/变化/偏差检测):识别异常数据记录,可能是有趣的或需要进一步调查的数据错误。
    
* [[Association rule learning]] (dependency modeling) – Searches for relationships between variables. For example, a supermarket might gather data on customer purchasing habits. Using association rule learning, the supermarket can determine which products are frequently bought together and use this information for marketing purposes. This is sometimes referred to as market basket analysis.
 
* [[Association rule learning]] (dependency modeling) – Searches for relationships between variables. For example, a supermarket might gather data on customer purchasing habits. Using association rule learning, the supermarket can determine which products are frequently bought together and use this information for marketing purposes. This is sometimes referred to as market basket analysis.
 +
 +
'''<font color="#ff8000">关联规则学习 Association rule learning</font>'''(依赖关系建模):搜索变量之间的关系。例如,超市可能会收集顾客购买习惯的数据。使用关联规则学习,超市可以确定哪些产品经常一起购买,并将这些信息用于营销目的。这有时被称为市场篮子分析。
    
* [[Cluster analysis|Clustering]] – is the task of discovering groups and structures in the data that are in some way or another "similar", without using known structures in the data.
 
* [[Cluster analysis|Clustering]] – is the task of discovering groups and structures in the data that are in some way or another "similar", without using known structures in the data.
 +
 +
'''<font color="#ff8000">聚类 Clustering</font>''':是指在数据中发现以某种方式或其他方式“相似”的组和结构,而不使用数据中已知的结构。
    
* [[Statistical classification|Classification]] – is the task of generalizing known structure to apply to new data. For example, an e-mail program might attempt to classify an e-mail as "legitimate" or as "spam".
 
* [[Statistical classification|Classification]] – is the task of generalizing known structure to apply to new data. For example, an e-mail program might attempt to classify an e-mail as "legitimate" or as "spam".
463

个编辑

导航菜单