Apache Flume: Distributed Log Collection for Hadoop - Second Edition 2nd Edition

Ebook Details


Steve Hoffman

Year 2015
Pages 175
Publisher Packt Publishing
Language en
ISBN 9781784392178
File Size 1.82 MB
File Format EPUB
Download Counter 1,045
Amazon Link

Ebook Description

Apache Flume is a distributed, reliable, and available service used to efficiently collect, aggregate, and move large amounts of log data. It is used to stream logs from application servers to HDFS for ad hoc analysis. This book starts with an architectural overview of Flume and its logical components. It explores channels, sinks, and sink processors, followed by sources and channels. By the end of this book, you will be fully equipped to construct a series of Flume agents to dynamically transport your stream data and logs from your systems into Hadoop.