Learning Apache Mahout
Acquire practical skills in Big Data Analytics and explore data science with Apache Mahout
About This Book
- Learn to use Apache Mahout for Big Data Analytics
- Understand machine learning concepts and algorithms and their implementation in Mahout.
- A comprehensive guide with numerous code examples and end-to-end case studies on Customer Analytics and Text Analytics.
Who This Book Is For
If you are a Java developer and want to use Mahout and machine learning to solve Big Data Analytics use cases then this book is for you. Familiarity with shell scripts is assumed but no prior experience is required.
What You Will Learn
- Configure Mahout on Linux systems and set up the development environment
- Become familiar with the Mahout command line utilities and Java APIs
- Understand the core concepts of machine learning and the classes that implement them
- Integrate Apache Mahout with newer platforms such as Apache Spark
- Solve classification, clustering, and recommendation problems with Mahout
- Explore frequent pattern mining and topic modeling, the two main application areas of machine learning
- Understand feature extraction, reduction, and the curse of dimensionality
In the past few years the generation of data and our capability to store and process it has grown exponentially. There is a need for scalable analytics frameworks and people with the right skills to get the information needed from this Big Data. Apache Mahout is one of the first and most prominent Big Data machine learning platforms. It implements machine learning algorithms on top of distributed processing platforms such as Hadoop and Spark.
Starting with the basics of Mahout and machine learning, you will explore prominent algorithms and their implementation in Mahout development. You will learn about Mahout building blocks, addressing feature extraction, reduction and the curse of dimensionality, delving into classification use cases with the random forest and Naive Bayes classifier and item and user-based recommendation. You will then work with clustering Mahout using the K-means algorithm and implement Mahout without MapReduce. Finish with a flourish by exploring end-to-end use cases on customer analytics and test analytics to get a real-life practical know-how of analytics projects.
*An electronic version of a printed book that can be read on a computer or handheld device designed specifically for this purpose.
Formats for this Ebook
|Required Software||Any PDF Reader, Apple Preview|
|Supported Devices||Windows PC/PocketPC, Mac OS, Linux OS, Apple iPhone/iPod Touch.|
|# of Devices||Unlimited|
|Flowing Text / Pages||Pages|
|The message text*:|
The Wedding Planners: Brides to Be: WITH Always the Bridesmaid... It's Her Turn to Be the Bride! AND Contracted: His High-society Bride AND Stranded with the Bad Boy... (Mills and Boon Special Releases)