應用場景

按照Hadoop完全分散式安裝Flume博文，測試使用了Flume監聽資料夾，當資料夾中新增了檔案，Flume設定會立馬進行收集資料夾中的新增的檔案，那麼這是一種應用場景，但是如果我們想收集檔案中的內容，該如何辦呢？比如，linux目錄下有一個檔案，我會往這個檔案裡不斷的新增內容，那麼怎麼才能實時寫入到HDFS呢？

操作方案

Hadoop完全分散式安裝Flume博文，中監控資料夾，如果linux目錄的資料夾下，有檔案新增，那麼會自動採集到HDFS目錄，如果需要監控具體的檔案內容，如果該檔案中有資料更新，那麼需要修改flume-conf.properties檔案為如下，其他不變！

 # cd /opt/flume1.7.0/conf
 # vim flume-conf.properties

# a.conf: A single-node Flume configuration
# Name the components on this agent
a1.sources = r1
a1.sinks = k1
a1.channels = c1
# Describe/configure the source
a1.sources.r1.type = exec 
a1.sources.r1.command = tail -F /opt/log/exec.text
a1.sources.r1.fileHeader = true
a1.sources.r1.deserializer.outputCharset=UTF-8
# Describe the sink
a1.sinks.k1.type = hdfs
a1.sinks.k1.hdfs.path = hdfs://hadoop0:9000/log
a1.sinks.k1.hdfs.fileType = DataStream
a1.sinks.k1.hdfs.writeFormat=Text
a1.sinks.k1.hdfs.maxOpenFiles = 1
a1.sinks.k1.hdfs.rollCount = 0
a1.sinks.k1.hdfs.rollInterval = 0
a1.sinks.k1.hdfs.rollSize = 1000000
a1.sinks.k1.hdfs.batchSize = 100000
# Use a channel which buffers events in memory
a1.channels.c1.type = memory
a1.channels.c1.capacity = 1000000
a1.channels.c1.transactionCapacity = 100000
# Bind the source and sink to the channel
a1.sources.r1.channels = c1
a1.sinks.k1.channel = c1

 # cd /opt/flume1.7.0/
 # bin/flume-ng agent --conf conf --conf-file conf/flume-conf.properties --name a1 -Dflume.root.logger=INFO,console

Android檔案或資料夾內容改變監聽器（FileObserver）
2015-03-25
AndroidServer
java檔案相關（檔案追加內容、檔案內容清空、檔案內容讀取）
2018-06-29
Java
uniapp獲取通知欄內容監聽通知欄內容
2021-05-06
APP
linux 監控檔案內容變化
2024-06-16
Linux
監聽設定密碼
2010-07-15
密碼
JavaScript監聽文字節點內容改變
2018-09-30
JavaScript
Swift_監聽UITextField內容的變化
2016-08-17
SwiftUI
Flume實時監控單個追加檔案
2020-10-02
設定USB資料監聽
2017-02-28
為監聽設定密碼
2008-05-06
密碼
Oracle 三個監聽檔案
2014-02-14
Oracle
為監聽設定密碼防止遠端關閉監聽
2011-02-18
密碼
ORACLE listener監聽設定密碼
2009-09-07
Oracle密碼
Flume監聽Nginx日誌流向HDFS安裝配置
2014-07-16
Nginx
檔案內容拷貝
2011-12-03
Oracle 控制檔案內容
2008-01-12
Oracle
檔案內容比較
2024-06-18
監聽日誌檔案的管理
2013-12-06
oracle 11g 監聽檔案
2016-01-06
Oracle
ORACLE停止監聽日誌檔案
2015-10-27
Oracle
【監聽】tnsname.ora檔案理解
2015-11-30
【監聽】listener.ora檔案理解
2015-11-26
vim內替換檔案內容
2018-05-09
jquery監聽文字框內容變化程式碼例項
2017-04-05
jQuery
如何使用python指令碼定時清空檔案內容?
2021-09-11
Python指令碼
設定 Oracle 監聽器密碼(LISTENER)
2017-09-07
Oracle密碼
加固Oracle安全，為監聽設定口令
2011-11-05
Oracle
node.js監聽檔案變化
2019-04-17
Node.js
監聽檔案修改的四種方法
2022-01-16
【打包1】內容、嵌入資源等檔案的生成操作，屬性如何設定
2024-03-29
檔案內容對比工具
2021-12-10
C#分割檔案內容
2017-06-28
C#
git檢視檔案內容
2018-05-21
Git
properties檔案內容亂碼
2017-07-17
Linux檔案內容操作
2015-02-09
Linux
檢視控制檔案內容
2015-07-15
dump 轉儲檔案內容
2011-01-18
提取rpm檔案內容
2012-01-31

設定Flume監聽檔案內容

應用場景

操作方案

相關文章