=============第一章:DM介绍=================
Data mining的范畴:
- data collection and database creation
- data management (including data storage and retrieval, and database transaction processing)
- advanced data analysis (involving data warehousing and data mining).
Data mining的步骤:
- Data cleaning (to remove noise and inconsistent data)
- Data integration (where multiple data sources may be combined)
- Data selection (where data relevant to the analysis task are retrieved fromthe database)
- Data transformation (where data are transformed or consolidated into forms appropriate for mining by performing summary or aggregation operations, for instance)
- Data mining (an essential process where intelligent methods are applied in order to extract data patterns)
- Pattern evaluation (to identify the truly interesting patterns representing knowledge based on some interestingness measures)
- Knowledge presentation (where visualization and knowledge representation techniques are used to present the mined knowledge to the user)
Data来源:db;dw;交易数据;文本;多媒体数据;流数据;web数据。
Data mining的分类——2大类 Descriptive mining 和 Predictive mining:
- Concept/Class Description: Characterization and Discrimination
- Mining Frequent Patterns, Associations, and Correlations
- Classification and Prediction
- Cluster Analysis
- Outlier Analysis
- Evolution Analysis
有意义的pattern:
- easily understood by humans
- valid on new or test data with some degree of certainty
- potentially useful
- novel
DM任务的要素(书本中用DMQL来描述这些要素)
- The set of task-relevant data to be mined
- The kind of knowledge to be mined
- The background knowledge to be used in the discovery process
- The interestingness measures and thresholds for pattern evaluation
- The expected representation for visualizing the discovered patterns
相关推荐
Data Mining Concepts and Techniques 3rd Edition(数据挖掘概念与技术第三版)
數據挖掘:概念與技術(原書第三版英文版)
Data Mining Concepts and Techniques 3rd Edition(数据挖掘概念与技术第三版)英文原版
清晰版(非扫描),并且带书签,经典书籍,5分应该很值!
Data Mining Concepts and Techniques.pdf
Data Mining - Concepts and Techniques Third Edition Jiawei Han University of Illinois at Urbana–Champaign Micheline Kamber Jian Pei Simon Fraser University
Not only does the third of edition of Data Mining: Concepts and Techniques continue the tradition of equipping you with an understanding and application of the theory and practice of discovering ...
韩家炜的《数据挖掘:概念与技术》是数据挖掘方面学习的入门经典,但中文版的翻译较差,难于理解作者本义。 网上已有的英文原版资源要么是第二版,要么是第三版的整理版,现特别奉献原书第二版与第三版的高清PDF版本...
2011年出版的数据挖掘领域知名入门教材《数据挖掘:概念与技术》第3版英文电子版。同第2版相比增加了06年以后不少新内容。内容清晰。
经典数据挖掘著作,重新分享!有需要的来吧!
Data Mining: Concepts and Techniques (3rd ed.) Jiawei Han, Micheline Kamber, and Jian PeiUniversity of Illinois at Urbana-Champaign &Simon; Fraser University©2013 Han, Kamber & Pei.
数据挖掘的经典教材《Data Mining.Concepts and Techniques》.
DataMiningConceptsAndTechniques
数据挖掘概念与技术 pdf part1 解压密码:DataMining 用7z压缩,不清楚别的方式能不能打开 打不开的请: 7-Zip 官方首页/7z下载 http://www.7-zip.org/ 中文首页 http://7z.sparanoid.com/
Data Mining Concepts And technology 3End Data Mining Concepts And technology 3End Data Mining Concepts And technology 3End Data Mining Concepts And technology 3End Data Mining Concepts And technology ...
《数据挖掘概念与技术-3rd英文版》韩家炜
经典著作的最新版,识货的来下!