Flume

Ingest data in HDFS(Solr, HBase) from different sources.

Configuring an agent

Have a configuration file flume.conf (java properties file) Configuration example for file feed and output to logger.

agent1.sources = source1
agent1.sinks = sink1
agent1.channels = channel1

agent1.sources.source1.channels = channel1
agent1.sources.source1.type = spooldir
agent1.sources.source1.spoolDir = /tmp/spooldir

agent1.sinks.sink1.channel = channel1
agent1.sinks.sink1.type = logger

agent1.channels.channel1.type = file

Create the source folder

mkdir /tmp/spooldir

Start an agent

flume-ng agent --name <agent_name> --conf <config_dir> --conf-file <config_file_path.conf> [-Dflume.root.logger=INFO,console]

Configure a memory channel

agent1.channels.channel1.type = memory
agent1.channels.channel1.capacity = 1000 # nb events

Last updated

Was this helpful?