SkyWalking 是一個應用效能監控系統,特別為微服務、雲原生和基於容器(Docker, Kubernetes, Mesos)體系結構而設計。除了應用指標監控以外,它還能對分散式呼叫鏈路進行追蹤。類似功能的元件還有:Zipkin、Pinpoint、CAT等。
上幾張圖,看看效果,然後再一步一步搭建並使用
1. 概念與架構
SkyWalking是一個開源監控平臺,用於從服務和雲原生基礎設施收集、分析、聚合和視覺化資料。SkyWalking提供了一種簡單的方法來維護分散式系統的清晰檢視,甚至可以跨雲檢視。它是一種現代APM,專門為雲原生、基於容器的分散式系統設計。
SkyWalking從三個維度對應用進行監視:service(服務), service instance(例項), endpoint(端點)
服務和例項就不多說了,端點是服務中的某個路徑或者說URI
SkyWalking allows users to understand the topology relationship between Services and Endpoints, to view the metrics of every Service/Service Instance/Endpoint and to set alarm rules.
SkyWalking允許使用者瞭解服務和端點之間的拓撲關係,檢視每個服務/服務例項/端點的度量,並設定警報規則。
1.1. 架構
SkyWalking邏輯上分為四個部分:Probes(探針), Platform backend(平臺後端), Storage(儲存), UI
這個結構就很清晰了,探針就是Agent負責採集資料並上報給服務端,服務端對資料進行處理和儲存,UI負責展示
2. 下載與安裝
SkyWalking有兩中版本,ES版本和非ES版。如果我們決定採用ElasticSearch作為儲存,那麼就下載es版本。
https://skywalking.apache.org/downloads/
https://archive.apache.org/dist/skywalking/
agent目錄將來要拷貝到各服務所在機器上用作探針
bin目錄是服務啟動指令碼
config目錄是配置檔案
oap-libs目錄是oap服務執行所需的jar包
webapp目錄是web服務執行所需的jar包
接下來,要選擇儲存了,支援的儲存有:
- H2
- ElasticSearch 6, 7
- MySQL
- TiDB
- InfluxDB
作為監控系統,首先排除H2和MySQL,這裡推薦InfluxDB,它本身就是時序資料庫,非常適合這種場景
但是InfluxDB我不是很熟悉,所以這裡先用ElasticSearch7
https://github.com/apache/skywalking/blob/master/docs/en/setup/backend/backend-storage.md
2.1. 安裝ElasticSearch
https://www.elastic.co/guide/en/elasticsearch/reference/7.10/targz.html
# 啟動 ./bin/elasticsearch -d -p pid # 停止 pkill -F pid
ElasticSearch7.x需要Java 11以上的版本,但是如果你設定了環境變數JAVA_HOME的話,它會用你自己的Java版本
通常,啟動過程中會報以下三個錯誤:
[1]: max file descriptors [4096] for elasticsearch process is too low, increase to at least [65535] [2]: max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144] [3]: the default discovery settings are unsuitable for production use; at least one of [discovery.seed_hosts, discovery.seed_providers, cluster.initial_master_nodes] must be configured
解決方法:
在 /etc/security/limits.conf 檔案中追加以下內容:
* soft nofile 65536 * hard nofile 65536 * soft nproc 4096 * hard nproc 4096
可通過以下四個命令檢視修改結果:
ulimit -Hn ulimit -Sn ulimit -Hu ulimit -Su
修改 /etc/sysctl.conf 檔案,追加以下內容:
vm.max_map_count=262144
修改es配置檔案 elasticsearch.yml 取消註釋,保留一個節點
cluster.initial_master_nodes: ["node-1"]
為了能夠ip:port方式訪問,還需修改網路配置
network.host: 0.0.0.0
修改完是這樣的:
至此,ElasticSearch算是啟動成功了
接下來,在 config/application.yml 中配置es地址即可
storage: selector: ${SW_STORAGE:elasticsearch7} elasticsearch7: clusterNodes: ${SW_STORAGE_ES_CLUSTER_NODES:192.168.100.19:9200}
2.2. 安裝Agent
https://github.com/apache/skywalking/blob/v8.2.0/docs/en/setup/service-agent/java-agent/README.md
將agent目錄拷貝至各服務所在的機器上
scp -r ./agent chengjs@192.168.100.12:~/
這裡,我將它拷貝至各個服務目錄下
plugins是探針用到各種外掛,SkyWalking外掛都是即插即用的,可以把optional-plugins中的外掛放到plugins中
修改 agent/config/agent.config 配置檔案,也可以通過命令列引數指定
主要是配置服務名稱和後端服務地址
agent.service_name=${SW_AGENT_NAME:user-center} collector.backend_service=${SW_AGENT_COLLECTOR_BACKEND_SERVICES:192.168.100.17:11800}
當然,也可以通過環境變數或系統屬性的方式來設定,例如:
export SW_AGENT_COLLECTOR_BACKEND_SERVICES=127.0.0.1:11800
最後,在服務啟動的時候用命令列引數 -javaagent 來指定探針
java -javaagent:/path/to/skywalking-agent/skywalking-agent.jar -jar yourApp.jar
例如:
java -javaagent:./agent/skywalking-agent.jar -Dspring.profiles.active=dev -Xms512m -Xmx1024m -jar demo-0.0.1-SNAPSHOT.jar
3. 啟動服務
修改 webapp/webapp.yml 檔案,更改埠號及後端服務地址
server: port: 8080 collector: path: /graphql ribbon: ReadTimeout: 10000 # Point to all backend's restHost:restPort, split by , listOfServers: 127.0.0.1:12800
啟動服務
bin/startup.sh
或者分別依次啟動
bin/oapService.sh bin/webappService.sh
檢視logs目錄下的日誌檔案,看是否啟動成功
瀏覽器訪問 http://127.0.0.1:8080
4. 告警
編輯 alarm-settings.yml 設定告警規則和通知
https://github.com/apache/skywalking/blob/v8.2.0/docs/en/setup/backend/backend-alarm.md
重點說下告警通知
為了使用釘釘機器人通知,接下來,新建一個專案
<?xml version="1.0" encoding="UTF-8"?> <project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd"> <modelVersion>4.0.0</modelVersion> <parent> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-parent</artifactId> <version>2.4.0</version> <relativePath/> <!-- lookup parent from repository --> </parent> <groupId>com.wt.monitor</groupId> <artifactId>skywalking-alarm</artifactId> <version>1.0.0-SNAPSHOT</version> <name>skywalking-alarm</name> <properties> <java.version>1.8</java.version> </properties> <dependencies> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <dependency> <groupId>com.aliyun</groupId> <artifactId>alibaba-dingtalk-service-sdk</artifactId> <version>1.0.1</version> </dependency> <dependency> <groupId>commons-codec</groupId> <artifactId>commons-codec</artifactId> <version>1.15</version> </dependency> <dependency> <groupId>com.alibaba</groupId> <artifactId>fastjson</artifactId> <version>1.2.75</version> </dependency> <dependency> <groupId>org.projectlombok</groupId> <artifactId>lombok</artifactId> <optional>true</optional> </dependency> </dependencies> <build> <plugins> <plugin> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-maven-plugin</artifactId> </plugin> </plugins> </build> </project>
可選依賴(不建議引入)
<dependency <groupId>org.apache.skywalking</groupId> <artifactId>server-core</artifactId> <version>8.2.0</version> </dependency>
定義告警訊息實體類
package com.wt.monitor.skywalking.alarm.domain; import lombok.Data; import java.io.Serializable; /** * @author ChengJianSheng * @date 2020/12/1 */ @Data public class AlarmMessageDTO implements Serializable { private int scopeId; private String scope; /** * Target scope entity name */ private String name; private String id0; private String id1; private String ruleName; /** * Alarm text message */ private String alarmMessage; /** * Alarm time measured in milliseconds */ private long startTime; }
傳送釘釘機器人訊息
package com.wt.monitor.skywalking.alarm.service; import com.dingtalk.api.DefaultDingTalkClient; import com.dingtalk.api.DingTalkClient; import com.dingtalk.api.request.OapiRobotSendRequest; import com.taobao.api.ApiException; import lombok.extern.slf4j.Slf4j; import org.apache.commons.codec.binary.Base64; import org.springframework.beans.factory.annotation.Value; import org.springframework.stereotype.Service; import javax.crypto.Mac; import javax.crypto.spec.SecretKeySpec; import java.io.UnsupportedEncodingException; import java.net.URLEncoder; import java.security.InvalidKeyException; import java.security.NoSuchAlgorithmException; /** * https://ding-doc.dingtalk.com/doc#/serverapi2/qf2nxq * @author ChengJianSheng * @data 2020/12/1 */ @Slf4j @Service public class DingTalkAlarmService { @Value("${dingtalk.webhook}") private String webhook; @Value("${dingtalk.secret}") private String secret; public void sendMessage(String content) { try { Long timestamp = System.currentTimeMillis(); String stringToSign = timestamp + "\n" + secret; Mac mac = Mac.getInstance("HmacSHA256"); mac.init(new SecretKeySpec(secret.getBytes("UTF-8"), "HmacSHA256")); byte[] signData = mac.doFinal(stringToSign.getBytes("UTF-8")); String sign = URLEncoder.encode(new String(Base64.encodeBase64(signData)),"UTF-8"); String serverUrl = webhook + "×tamp=" + timestamp + "&sign=" + sign; DingTalkClient client = new DefaultDingTalkClient(serverUrl); OapiRobotSendRequest request = new OapiRobotSendRequest(); request.setMsgtype("text"); OapiRobotSendRequest.Text text = new OapiRobotSendRequest.Text(); text.setContent(content); request.setText(text); client.execute(request); } catch (ApiException e) { e.printStackTrace(); log.error(e.getMessage(), e); } catch (NoSuchAlgorithmException e) { e.printStackTrace(); log.error(e.getMessage(), e); } catch (UnsupportedEncodingException e) { e.printStackTrace(); log.error(e.getMessage(), e); } catch (InvalidKeyException e) { e.printStackTrace(); log.error(e.getMessage(), e); } } }
AlarmController.java
package com.wt.monitor.skywalking.alarm.controller; import com.alibaba.fastjson.JSON; import com.wt.monitor.skywalking.alarm.domain.AlarmMessageDTO; import com.wt.monitor.skywalking.alarm.service.DingTalkAlarmService; import lombok.extern.slf4j.Slf4j; import org.springframework.beans.factory.annotation.Autowired; import org.springframework.web.bind.annotation.PostMapping; import org.springframework.web.bind.annotation.RequestBody; import org.springframework.web.bind.annotation.RequestMapping; import org.springframework.web.bind.annotation.RestController; import java.text.MessageFormat; import java.util.List; /** * @author ChengJianSheng * @date 2020/12/1 */ @Slf4j @RestController @RequestMapping("/skywalking") public class AlarmController { @Autowired private DingTalkAlarmService dingTalkAlarmService; @PostMapping("/alarm") public void alarm(@RequestBody List<AlarmMessageDTO> alarmMessageDTOList) { log.info("收到告警資訊: {}", JSON.toJSONString(alarmMessageDTOList)); if (null != alarmMessageDTOList) { alarmMessageDTOList.forEach(e->dingTalkAlarmService.sendMessage(MessageFormat.format("-----來自SkyWalking的告警-----\n【名稱】: {0}\n【訊息】: {1}\n", e.getName(), e.getAlarmMessage()))); } } }
5. 文件
https://skywalking.apache.org/
https://skywalking.apache.org/zh/
https://github.com/apache/skywalking/tree/v8.2.0/docs
https://archive.apache.org/dist/
https://www.elastic.co/guide/en/elasticsearch/reference/master/index.html