深入理解Elasticsearch-Filebeat: config and mechanism

quantLearner發表於2020-12-25

原文網址 : https://blog.csdn.net/The_Time_Runner/article/details/111710126

Elasticsearch

How Filebeat Works

Filebeat consists of two main components: input and harvesters.
Harvester

A harvester is responsible for reading the content of a single file.

The harvester reads each file, line by line, and sends the content to the output.

One harvester is started for each file. The harvester is responsible for opening and closing the file, which means that the file descriptor remians open while the harvester is running.

If a file is removed or renamed while it’s being harvested, Filebeat continues to read the file. This has the side effect that the space on your disk is reserved until the harvester closes.

By default, Filebeat keeps the file open until close_inactive is reached.
Input

An input is resonsible for managing the harvesters and finding all sources to read from.

Each input runs its own Go routine.

New lines are only picked up if the size of the file has changed since the harvester was closed.
How does Filebeat keep the state of files

Filebeat keeps the state of each file and frequently flushes the state to disk in the registry file.

The state is used to remember the last offset a harvester was reading from and to ensure all log lines are sent.

If the output such as Elasticsearch or Logstash, is not reachable, Filebeat keeps track of the last lines sent and will continue reading the files as soon as the output becomes avaiable again.
Configure

Filebeat modules provide the fastest getting started experience for common log formats. You can configure modules in the modules.d directory(recommended), or in the Filebeat configuration file.

Because Filebeat modules contain default configurations, Elasticsearch ingest node pipline definitions, and Kibana dashboards to help you implement and deploy a log monitoring solution, before running FIlebeat with modules enabled, make sure you also set up the environment to use Kibana dashboards.

See Regular expression support for a list of supported regexp patterns.

Filebeat regular expression support is based on RE2.

By default, Filebeat identifies files based on their inodes and device IDs.

The path section of the filebeat.yml config file contains configuration options that define where Filebeat looks for its files.
Autodiscover

When you run applications on containers, they become moving targets to the monitoring system.

Autodiscover allows you to track them and adapt settings as changes happen.

Autodiscover providers work by watching for events on the system and translating those events into internal autodiscover events with a common format.

The Docker autodiscover provider watches for Docker containers to start and stop.
Internal Queue

Filebeat uses an internal queue to store events before publishing them.
Modules

Filebeat modules simplify the collection, parsing, and visualization of common log formats.

A typical module is composed of one or more filesets.

A fileset contains the following:
- Filebeat input configurations, which contain the default paths where to look for the log files.
- Elasticsearch Ingest Node pipeline definition, which is used to parse the log lines
- Fields definitions, which are used to configure Elasticsearch with the correct types for each field
- Sample Kibana dashboards
Processor

You can use processors to filter and enhance data before sending it to the configured output.

To define a processor, you specify the processor name, an optional condition, and a set of parameters.
```
processors:
- <processor_name>
  when:
    <condition>
  <parameters>
- <priocessor_name>
  when:
    <condition>
  <parameters>
```
<processor_name> specifies a processor that performs some kind of action, such as selecting the fields that are exported or adding metadata to the event.

<condition> specifies an optional condition. If the condition is present, then the action is executed only if the condition is fulfilled. If no condition is set, then the action is always executed.

<parameters> is the list of parameters to pass to the processor.

The supported processors are here.

深入理解 Laravel 中 config 配置載入原理
2018-10-21
Laravel
codeforces 514C Watto and Mechanism
2020-04-06
深入理解margin
2019-01-08
深入理解React
2018-12-08
React
深入理解KVO
2019-04-14
深入理解 ReentrantLock
2019-04-14
ReentrantLock
深入理解 PWA
2018-12-05
深入理解BFC
2018-10-14
深入理解volatile
2018-10-03
深入理解MVC
2018-08-09
MVC
深入理解 TypeScript
2018-08-26
TypeScript
深入理解JSCore
2018-08-24
JS
深入理解JavaScriptCore
2018-08-24
JavaScript
深入理解Isolate
2020-01-21
深入理解 JVM
2019-08-29
JVM
深入理解HashMap
2019-07-24
HashMap
深入理解ThreadLocal
2020-07-22
thread
深入理解Transform
2019-04-24
ORM
深入理解 Git
2019-02-19
Git
深入理解redux
2018-03-06
Redux
BFC深入理解
2018-03-31
深入理解padding
2018-03-28
padding
深入理解JSX
2022-07-07
JS
深入理解 SynchronizationContext
2021-07-12
Context
深入理解JVM
2021-06-01
JVM
深入理解AQS
2021-04-02
AQS
【Interview】深入理解ReentrantLock
2019-03-20
ViewReentrantLock
深入理解Java PriorityQueue
2019-03-17
Java
深入理解 Java 方法
2019-03-16
Java
深入理解JavaScript原型
2019-02-24
JavaScript原型
深入淺出理解Redux
2018-05-11
Redux
深入理解AbstractQueuedSynchronizer(AQS)
2018-05-03
AQS
Java：IO:深入理解
2018-09-17
Java
深入理解load averages
2018-06-07
深入理解 async / await
2018-09-13
AI
《深入理解SpringMVC思想》
2018-09-23
SpringMVC
深入理解閉包
2018-07-02
深入理解JVM——物件
2020-01-07
JVM物件

深入理解Elasticsearch-Filebeat: config and mechanism

How Filebeat Works

Harvester

Input

How does Filebeat keep the state of files

Configure

Autodiscover

Internal Queue

Modules

Processor

相關文章