书籍 Machine Learning in Action的封面

Machine Learning in Action

Peter Harrington

出版时间

2012-04-18

ISBN

9781617290183

评分

★★★★★
书籍介绍

It's been said that data is the new "dirt"—the raw material from which and on which you build the structures of the modern world. And like dirt, data can seem like a limitless, undifferentiated mass. The ability to take raw data, access it, filter it, process it, visualize it, understand it, and communicate it to others is possibly the most essential business problem for the coming decades.

"Machine learning," the process of automating tasks once considered the domain of highly-trained analysts and mathematicians, is the key to efficiently extracting useful information from this sea of raw data. By implementing the core algorithms of statistical data processing, data analysis, and data visualization as reusable computer code, you can scale your capacity for data analysis well beyond the capabilities of individual knowledge workers.

Machine Learning in Action is a unique book that blends the foundational theories of machine learning with the practical realities of building tools for everyday data analysis. In it, you'll use the flexible Python programming language to build programs that implement algorithms for data classification, forecasting, recommendations, and higher-level features like summarization and simplification.

As you work through the numerous examples, you'll explore key topics like classification, numeric prediction, and clustering. Along the way, you'll be introduced to important established algorithms, such as Apriori, through which you identify association patterns in large datasets and Adaboost, a meta-algorithm that can increase the efficiency of many machine learning tasks.

Peter Harrington holds Bachelors and Masters Degrees in Electrical Engineering. He worked for Intel Corporation for seven years in California and China. Peter holds five US patents and his work has been published in three academic journals. He is currently the chief scientist for Zillabyte Inc. Peter spends his free time competing in programming competitions, and building 3D pr...

(展开全部)

目录
Part 1: Classification
1 Machine learning basics
2 Classifying with k-nearest neighbors
3 Splitting datasets one feature at a time: decision trees
4 Classifying with probability distributions: Na�ve Bayes

显示全部
用户评论
python和机器学习浅显地入门材料。很有趣。
超级赞的入门好书,很多之前模糊的概念都通过本书中的例子弄明白了
一般般
随便翻翻,当复习Python和相关库了。适合初学者。
入门书籍。。超多python代码..
业界的书接地气
第三章决策树,说好的要用决策树测试一个实例,结果根本就没有测试的内容,让人失望。
一般吧,简单入门书
这本书适合理论功底强,而实践能力弱的人读。算法的理论部分极其鸡肋,有些算法根本讲不清楚,需要自己查阅资料去学习。作为一本重实践的书,可以没有公式推导,但书中出现的公式要求描述清楚不过分吧?另外最近几年机器学习飞速发展,本书显然略显老旧,书中代码是Python2的,很多需要安装的module并没有Python3的版本。还有一些抓取网站数据的,网站已经不存在了。一些近年来火爆的算法并没有包含,比如深度学习。但是作为新人的入门书籍,本书还是很好读的,可以了解一些机器学习的基本问题,基本算法以及基本面貌。最主要的是,学习一些作者好的代码习惯。
算法