Flume环境搭建_五种案例
http://flume.apache.org/FlumeUserGuide.html
A simple example
Here, we give an example configuration file, describing a single-node Flume deployment. This configuration lets a user generate events and subsequently logs them to the console.
# example.conf: A single-node Flume configuration
# Name the components on this agent
a1.sources = r1
a1.sinks = k1
a1.channels = c1
# Describe/configure the source
a1.sources.r1.type = netcat
a1.sources.r1.bind = localhost
a1.sources.r1.port = 44444
# Describe the sink
a1.sinks.k1.type = logger
# Use a channel which buffers events in memory
a1.channels.c1.type = memory
a1.channels.c1.capacity = 1000
a1.channels.c1.transactionCapacity = 100
# Bind the source and sink to the channel
a1.sources.r1.channels = c1
a1.sinks.k1.channel = c1
This configuration defines a single agent named a1. a1 has a source that listens for data on port 44444, a channel that buffers event data in memory, and a sink that logs event data to the console. The configuration file names the various components, then describes their types and configuration parameters. A given configuration file might define several named agents; when a given Flume process is launched a flag is passed telling it which named agent to manifest.
Given this configuration file, we can start Flume as follows:
$ bin/flume-ng agent --conf conf --conf-file example.conf --name a1 -Dflume.root.logger=INFO,console
Note that in a full deployment we would typically include one more option: --conf=<conf-dir>. The <conf-dir> directory would include a shell script flume-env.sh and potentially a log4j properties file. In this example, we pass a Java option to force Flume to log to the console and we go without a custom environment script.
From a separate terminal, we can then telnet port 44444 and send Flume an event:
$ telnet localhost 44444
Trying 127.0.0.1...
Connected to localhost.localdomain (127.0.0.1).
Escape character is '^]'.
Hello world! <ENTER>
OK
The original Flume terminal will output the event in a log message.
12/06/19 15:32:19 INFO source.NetcatSource: Source starting
12/06/19 15:32:19 INFO source.NetcatSource: Created serverSocket:sun.nio.ch.ServerSocketChannelImpl[/127.0.0.1:44444]
12/06/19 15:32:34 INFO sink.LoggerSink: Event: { headers:{} body: 48 65 6C 6C 6F 20 77 6F 72 6C 64 21 0D Hello world!. }
Congratulations - you’ve successfully configured and deployed a Flume agent! Subsequent sections cover agent configuration in much more detail.
以下为具体搭建流程
Flume搭建_案例一:单个Flume
安装node2上
1. 上传到/home/tools,解压,解压后移动到/home下
2. 重命名,并修改flume-env.sh
data:image/s3,"s3://crabby-images/62237/62237f7a2758b0e3ee31f067ad00dd051c014291" alt=""
data:image/s3,"s3://crabby-images/64ef9/64ef905bfab4db3dea885dc3e28edccc4dcc715e" alt=""
data:image/s3,"s3://crabby-images/b2be4/b2be4e216bac9e6bf44742590c50d9d6f6c6bd54" alt=""
data:image/s3,"s3://crabby-images/3290d/3290dc433f53621e2b8e7f6701b7b021d2dbd8ec" alt=""
data:image/s3,"s3://crabby-images/bf8cb/bf8cb85fec3980c2a1206091a20d84fd1e8c2495" alt=""
data:image/s3,"s3://crabby-images/3f7f6/3f7f6573707fadaa4e2031fcf1a37a65056451f5" alt=""
data:image/s3,"s3://crabby-images/3234f/3234f789a6e03802b7ab5babc062c8664443db67" alt=""
data:image/s3,"s3://crabby-images/3c377/3c37761de70f2f47fc9f1cd52b7c6760eec91d31" alt=""
data:image/s3,"s3://crabby-images/9e121/9e121681bda51975dbea925288500a58dadc3b6b" alt=""
data:image/s3,"s3://crabby-images/ff5ae/ff5aea4760bebc7f4bd4b3355eab3a0a4ce33230" alt=""
data:image/s3,"s3://crabby-images/1d15f/1d15f1dbb33fadfce8d9eb5c72be6d71213325fc" alt=""
Flume搭建_案例二:两个Flume做集群
data:image/s3,"s3://crabby-images/7288f/7288fc946bd07afb5f81f8c488ecdf060e09b3aa" alt=""
MemoryChanel配置
capacity:默认该通道中最大的可以存储的event数量是100,
trasactionCapacity:每次最大可以从source中拿到或者送到sink中的event数量也是100
keep-alive:event添加到通道中或者移出的允许时间
byte**:即event的字节量的限制,只包括eventbody
data:image/s3,"s3://crabby-images/dc3e9/dc3e9b924b1bc3de027f176a8590908e973c82d4" alt=""
data:image/s3,"s3://crabby-images/de7c7/de7c7e463fad84ffa89c1b01f166da929e804640" alt=""
data:image/s3,"s3://crabby-images/a07f1/a07f1978f26b8248f5545f2f7d1d82e363437b38" alt=""
data:image/s3,"s3://crabby-images/4cac1/4cac1a2a83df4177ebddb2f0545b8da03a69a258" alt=""
先启动node02的Flume
flume-ng agent -n a1 -c conf -f avro.conf -Dflume.root.logger=INFO,console
flume-ng agent -n a1 -c conf -f /home/test_flume/flume22 -Dflume.root.logger=INFO,console
再启动node01的Flume
flume-ng agent -n a1 -c conf -f simple.conf2 -Dflume.root.logger=INFO,console
flume-ng agent -n a1 -c conf -f /home/test_flume/flume21 -Dflume.root.logger=INFO,console
data:image/s3,"s3://crabby-images/63706/6370641103f6febd2601c7d22dbb5451597b1bec" alt=""
data:image/s3,"s3://crabby-images/ca06d/ca06d23155697b4381e0e813fd77a6f00eb210c7" alt=""
data:image/s3,"s3://crabby-images/7056f/7056fb9790d06844063000fcfa7663a585eadcd6" alt=""
data:image/s3,"s3://crabby-images/53ee0/53ee052ba0ec92a70ea7a8a07800a8104a5324c7" alt=""
data:image/s3,"s3://crabby-images/d3e1c/d3e1c4099a529135c93eb86016676ff2924d778d" alt=""
data:image/s3,"s3://crabby-images/ecec5/ecec5e56c14cd97e3bb866d2c010bea0877f9578" alt=""
Flume搭建_案例三:如何监控一个文件的变化?
data:image/s3,"s3://crabby-images/08cfe/08cfeaa3a986093c8cf930e297017f381a098aa7" alt=""
data:image/s3,"s3://crabby-images/e9d14/e9d146b2949cb3a0ba7960271c2814cfe2f64cb9" alt=""
启动Flume
flume-ng agent -n a1 -c conf -f exec.conf -Dflume.root.logger=INFO,console
flume-ng agent -n a1 -c conf -f /home/test_flume/flume3 -Dflume.root.logger=INFO,console
data:image/s3,"s3://crabby-images/7d870/7d870cbb9cef03d076a1bf0e592613b518cf4b37" alt=""
data:image/s3,"s3://crabby-images/486d8/486d87c4949332a8705b604c059faa94372f2286" alt=""
Flume搭建_案例四: 如何监控一个文件:目录的变化?
data:image/s3,"s3://crabby-images/c2e8f/c2e8f8c71e2eef27cbed4c6b4ef32c5c2dd73e98" alt=""
data:image/s3,"s3://crabby-images/a8ef8/a8ef8c69c428de69f62bc8d09be2dd025dab8f19" alt=""
data:image/s3,"s3://crabby-images/0cf82/0cf823e7950e4f2d5e8ae6a3e867d6c3eaa1b614" alt=""
data:image/s3,"s3://crabby-images/71a3c/71a3c421eb76acd8f528687054a26b3e7d54f935" alt=""
data:image/s3,"s3://crabby-images/58a73/58a737bc7432f402736777c8c00a69c3efaf8120" alt=""
data:image/s3,"s3://crabby-images/2be70/2be70de0dab335c7a71fc20eea981c97fa955f9c" alt=""
Flume搭建_案例五: 如何定义一个HDFS类型的Sink?
data:image/s3,"s3://crabby-images/a7b9a/a7b9aafee29b2bbc5eb8a94f8b8b0aefdbfdb07f" alt=""
Flume搭建_案例五_配置项解读
data:image/s3,"s3://crabby-images/ea575/ea5751111859ad06f8f3d94e1422efd205ff6ce4" alt=""
data:image/s3,"s3://crabby-images/4c12a/4c12a0735857b61ab912e1668dcb252f4e1d383e" alt=""
hdfs.rollInterval | 30 | Number of seconds to wait before rolling current file (0 = never roll based on time interval) |
hdfs.rollSize | 1024 | File size to trigger roll, in bytes (0: never roll based on file size) |
hdfs.rollCount | 10 | Number of events written to file before it rolled (0 = never roll based on number of events) |
4. 多长时间没有操作,Flume将一个临时文件生成新文件?
hdfs.idleTimeout | 0 | Timeout after which inactive files get closed (0 = disable automatic closing of idle files) |
5. 多长时间生成一个新的目录?(比如每10s生成一个新的目录)
四舍五入,没有五入,只有四舍
(比如57分划分为55分,5,6,7,8,9在一个目录,10,11,12,13,14在一个目录)
hdfs.round | false | Should the timestamp be rounded down (if true, affects all time based escape sequences except %t) |
hdfs.roundValue | 1 | Rounded down to the highest multiple of this (in the unit configured using hdfs.roundUnit), less than current time. |
hdfs.roundUnit | second | The unit of the round down value - second, minute or hour. |
data:image/s3,"s3://crabby-images/b1a90/b1a90f6f000f35d00e733703d4baef9bc174af95" alt=""
data:image/s3,"s3://crabby-images/d50e1/d50e151f4d49d90aaf5baf0d1755f5f098b4363f" alt=""