Deep Learning with Hadoop

Front Cover
Packt Publishing Ltd, Feb 20, 2017 - Computers - 206 pages

Build, implement and scale distributed deep learning models for large-scale datasets

About This BookGet to grips with the deep learning concepts and set up Hadoop to put them to useImplement and parallelize deep learning models on Hadoop's YARN frameworkA comprehensive tutorial to distributed deep learning with HadoopWho This Book Is For

If you are a data scientist who wants to learn how to perform deep learning on Hadoop, this is the book for you. Knowledge of the basic machine learning concepts and some understanding of Hadoop is required to make the best use of this book.

What You Will LearnExplore Deep Learning and various models associated with itUnderstand the challenges of implementing distributed deep learning with Hadoop and how to overcome itImplement Convolutional Neural Network (CNN) with deeplearning4jDelve into the implementation of Restricted Boltzmann Machines (RBM)Understand the mathematical explanation for implementing Recurrent Neural Networks (RNN)Get hands on practice of deep learning and their implementation with Hadoop.In Detail

This book will teach you how to deploy large-scale dataset in deep neural networks with Hadoop for optimal performance.

Starting with understanding what deep learning is, and what the various models associated with deep neural networks are, this book will then show you how to set up the Hadoop environment for deep learning. In this book, you will also learn how to overcome the challenges that you face while implementing distributed deep learning with large-scale unstructured datasets. The book will also show you how you can implement and parallelize the widely used deep learning models such as Deep Belief Networks, Convolutional Neural Networks, Recurrent Neural Networks, Restricted Boltzmann Machines and autoencoder using the popular deep learning library deeplearning4j.

Get in-depth mathematical explanations and visual representations to help you understand the design and implementations of Recurrent Neural network and Denoising AutoEncoders with deeplearning4j. To give you a more practical perspective, the book will also teach you the implementation of large-scale video processing, image processing and natural language processing on Hadoop.

By the end of this book, you will know how to deploy various deep neural networks in distributed systems using Hadoop.

Style and approach

This book takes a comprehensive, step-by-step approach to implement efficient deep learning models on Hadoop. It starts from the basics and builds the readers' knowledge as they strengthen their understanding of the concepts. Practical examples are included in every step of the way to supplement the theory.

 

Contents

Preface
1
Introduction to Deep Learning
8
Distributed Deep Learning for LargeScale Data
39
Convolutional Neural Network
71
Recurrent Neural Network
100
Restricted Boltzmann Machines
122
Autoencoders
145
Miscellaneous Deep Learning Operations using Hadoop
165
References
175
Index
183
Copyright

Other editions - View all

Common terms and phrases

About the author (2017)

Dipayan Dev has completed his M.Tech from National Institute of Technology, Silchar with a first class first and is currently working as a software professional in Bengaluru, India. He has extensive knowledge and experience in non-relational database technologies, having primarily worked with large-scale data over the last few years. His core expertise lies in Hadoop Framework. During his postgraduation, Dipayan had built an infinite scalable framework for Hadoop, called Dr. Hadoop, which got published in top-tier SCI-E indexed journal of Springer (http://link.springer.com/article/10.1631/FITEE.1500015). Dr. Hadoop has recently been cited by Goo Wikipedia in their Apache Hadoop article. Apart from that, he registers interest in a wide range of distributed system technologies, such as Redis, Apache Spark, Elasticsearch, Hive, Pig, Riak, and other NoSQL databases. Dipayan has also authored various research papers and book chapters, which are published by IEEE and top-tier Springer Journals. To know more about him, you can also visit his LinkedIn profile https://www.linkedin.com/in/dipayandev.

Bibliographic information