码迷,mamicode.com
首页 > 其他好文 > 详细

Pattern Evaluation

时间:2015-02-17 22:25:38      阅读:277      评论:0      收藏:0      [点我收藏+]

标签:pattern   evaluation   

Pattern Evaluation

@(Pattern Discovery in Data Mining)
本文介绍了数据挖掘中模式挖掘,评估所得模式与规则科学性的方法。

Limitation of Support-Confidence Framework

Pattern-mining will generate a large set of patterns/rules. However, not all the generated patterns/rules are interesting.

The interestingness measures: Objective vs. subjective
* Objective interestingness measures
* Support, confidence, correlation, …
* Subjective interestingness measures: One man’s trash could be another man’s treasure
* Query-based: Relevant to a user’s particular request
* Against one’s knowledge-base: unexpected, freshness, timeliness
* Visualization tools: Multi-dimensional, interactive examination

An example of limitations:
技术分享

Interesting Measures: Lift and χ2

  1. Lift

    • Measure of dependent/correlated events: lift

      lift(B,C)=c(BC)s(C)=s(BC)s(B)×s(C)

    • Lift(B, C) may tell how B and C are correlated

    • Lift(B, C) = 1: B and C are independent
    • > 1: positively correlated
    • < 1: negatively correlated

Example:

技术分享
技术分享

Thus, B and C are negatively correlated since list < 1; But B and ?C are positively correlated since lift > 1.

  1. χ2

    • Measure to test correlated events

      χ2=Observed?ExpectedExpected

    • General rules:

    • χ2=0, independent
    • χ2>0, correlated, either positive or negative. So it needs additional test

Example:
技术分享
技术分享

  1. Null transaction(?A?B)
    • Notion: Lift and χ2 are not always good measures
      技术分享

Null Invariance Measures

  • Null Invariance: Value does not change with the number of null-transactions.
  • Why is null invariance crucial for the analysis of massive transaction data? Because Many transactions may contain neither milk nor coffee!

技术分享

Comparison of Null-invariance Measures

技术分享

Use Imbalanced Ratio to measure the imbalance of two itemsets A and B in rule implications.
技术分享

技术分享

Pattern Evaluation

标签:pattern   evaluation   

原文地址:http://blog.csdn.net/rk2900/article/details/43867993

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!