Apache Flume Online Training

Apache Flume

Apache Flume is a tool used for data injection from various data producers (webservers) into Hadoop. It is a standard, simple, robust, flexible, and extensible tool. Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of streaming data into the Hadoop Distributed File System (HDFS).

LaraOnlineTraining will provide instructor-led, live training, participants will have an in-depth understanding of  Flume.

Features of Apache Flume:

  • It is a robust, fault-tolerant, and highly available service.
  • It is a distributed system with tunable reliability mechanisms for fail-over and recovery.
  • It is horizontally scalable.
  • It supports complex data flows such as multi-hop flows, fan-in flows, fan-out flows. Contextual routing etc.
  • This provides support for large sets of sources, channels, and sinks.
  • It can efficiently ingest log data from various servers into a centralized repository.
  • With Flume, we can collect data from different web servers in real-time as well as in batch mode.
  • We can import large volumes of data generated by social networking sites and e-commerce sites into Hadoop DFS using Apache Flume.

 

By the end of this training, participants will be able to:

  • Enhance their knowledge of Apache Flume features
  • Understand the architecture and data flow in Apache Flume
  • Apply their learning’s to real-world use cases and scenarios
  • Use it  for collecting, combining, and transferring large amounts of log data to a centralized datastore

Target audience

  • Students
  • Big Data and Hadoop Professionals
  • IT professionals

Prerequisites

Professionals with experience in IT and Students eager to learn can undergo this training.

Why Should I choose Lara Online Training?

  • We provide Very in-depth course material with Real Time Scenarios for each topic with its Solutions for, JApache Flume Online Training.
  •  At Lara, we provide case studies in real-time applications with a professional explanation.
  •  We do Schedule the sessions based upon your comfort by our Highly Qualified Trainers and Real-time Experts.
  •  Class recordings are available immediately after the session for future reference.
  •  We do Normal Track, Fast Track, and Weekend Batches for Apache Flume Online Training.
  •  We assist you with cost Effective and Flexible Payment Schemes.
  •  At Lara, we provide Placement Assistance.
  •  We provide Assessment and Mock Interviews

 

Introduction and Understanding the Fundamentals of Apache Flume

1
About Apache Flume
2
Understanding How Flume Works
3
Overview of the Important Components of Apache Flume
4
Architecture of Apache Flume
5
Data Flow Mode
6
Reliability
7
Recoverability

Setting Up Apache Flume

1
Setting up and Configuring an Agent
2
Starting an Agent
3
Using Environment Variables
4
Logging Raw Stream of Data
5
Installing Third-Party Plugins

Ingesting Data from External Resources

1
Using Avro RPC Mechanism
2
Executing Commands
3
Exploring Network Streams

Setting Multi-Agent Flow

Consolidating Events into a Single Channel

Defining a Flow Multiplexer

Flow Configuration

1
Defining the Flow
2
Setting Up Individual Components
3
Adding Multiple Flows in an Agent
4
Setting Up a Multi-Tier Flow
5
Fanning Out the Flow from a Single Source to Multiple Channels

Implementing a Flume Source

1
Using Avro Source
2
Using Thrift Source
3
Using Exec Source
4
Using JMS Source
5
Using Spooling Directory Source
6
Using Taildir Source
7
Using Twitter 1% firehose Source
8
Using Kafka Source
9
Using NetCat TCP Source
10
Using NetCat UDP Source
11
Using Sequence Generator Source
12
Using Syslog TCP Source
13
Using Multiport Syslog TCP Source
14
Using Syslog UDP Source
15
Using HTTP Source
16
Using Stress Source
17
Using Legacy Sources
18
Using Custom Source
19
Using Scribe Source

Implementing a Flume Sink

1
Using HDFS Sink
2
Using Hive Sink
3
Using Logger Sink
4
Using Avro Sink
5
Using Thrift Sink
6
Using IRC Sink
7
Using File Roll Sink
8
Using Null Sink
9
Using HBaseSinks
10
Using MorphlineSolrSink
11
Using ElasticSearchSink
12
Using Kite Dataset Sink
13
Using Kafka Sink
14
Using HTTP Sink
15
Using Custom Sink

Implementing a Flume Channel Interface

1
Using Memory Channel
2
Using JDBC Channel
3
Using Kafka Channel
4
Using File Channel
5
Using Spillable Memory Channel
6
Using Pseudo Transaction Channel
7
Using a Custom Channel

Using Flume Channel Selectors

1
Using the Replicating Channel Selector
2
Using the Multiplexing Channel Selector
3
Using a Custom Channel Selector

Implementing Flume Sink Processors

1
Using the Defauult Sink Processor
2
Using the Failover Sink Processor
3
Using the Load balancing Sink Processor
4
Using a Custom Sink Processor

Using Event Serializers

Using Flume Interceptors

1
Using the Timestamp Interceptor
2
Using the Host Interceptor
3
Using the Static Interceptor
4
Using the Remove Header Interceptor
5
Using the UUID Interceptor
6
Using the Morphline Interceptor
7
Using the Search and Replace Interceptor
8
Using the Regex Filtering Interceptor
9
Using the Regex Extractor Interceptor

Understanding Flume Properties

Security Configurations on Apache Flume

Monitoring and Reporting in Apache Flume

Using Tools in Apache Flume

1
Using the File Channel Integrity Tool
2
Using the Event Validator Tool

Understanding Topology Design Considerations

Handling Agent Failures

Handling Compatibility

Troubleshooting

1

Be the first to add a review.

Please, login to leave a review
Add to Wishlist
Get course
Enrolled: 2557 students
Duration: 35
Lectures: 79

Contact Us For Free Demo


    Quick Connect

    | |

     

    Apache Flume Online Training
    Price:
    Membership