DataMininginBioinformatics.ppt

  1. 1、本文档共31页,可阅读全部内容。
  2. 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
  3. 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载
  4. 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
DataMininginBioinformatics

* Cross Validation: Example One-tier cross validation Train on different data than test data Two-tier cross validation The score from one-tier cross validation is used by the bias optimizer to select the best learning algorithm parameters (# of control points) . The more you optimize the more you over-fit. The second tier is to measure the level of over-fit (unbiased measure of accuracy). Useful for comparing learning algorithms with control parameters that are optimized. Number of folds is not optimized. Computational complexity: #folds of top tier X #folds of bottom tier X #control points X CPU of algorithm * Summary Microarray problem Computational biology Major objective of microarray technology Input and output of data analysis Data mining and image analysis steps Image normalization, grid alignment, feature construction Data mining techniques Prior knowledge Expected results of data mining Validation Issues Cross validation techniques Overviewheader Peter Bajcsy, PhD Automated Learning Group National Center for Supercomputing Applications University of Illinois pbajcsy@ncsa.uiuc.edu January 31, 2002 优秀精品课件文档资料 Data Mining in Bioinformatics * Outline Introduction Overview of Microarray Problem Image Analysis Data Mining Validation Summary * Introduction: Recommended Literature 1. Bioinformatics – The Machine Learning Approach by P. Baldi S. Brunak, 2nd edition, The MIT Press, 2001 2. Data Mining – Concepts and Techniques by J. Han M. Kamber, Morgan Kaufmann Publishers, 2001 3. Pattern Classification by R. Duda, P. Hart and D. Stork, 2nd edition, John Wiley Sons, 2001 * Introduction: Microarray Problem in Bioinformatics Domain Problems in Bioinformatics Domain Data production at the levels of molecules, cells, organs, organisms, populations Integration of structure and function data, gene expression data, pathway data, phenotypic and clinical data, … Prediction of Molecular Function and Structure Computational biology: synthesis (simulations) a

文档评论(0)

shenland + 关注
实名认证
内容提供者

该用户很懒,什么也没介绍

1亿VIP精品文档

相关文档