混沌工程最佳實踐 - 尋交流

esnake0發表於2024-06-07

背景:
最近由於工作所需,開始學習瞭解混沌工程。
目前專案多用 AWS 的雲服務,由此展開了對官方混沌工程工具 AWS FIS 的調研。

調研結果做了一點筆記,包含工具的基本功能,可注入的缺陷型別,價格等
以下為筆記
Resource
https://docs.aws.amazon.com/fis/latest/userguide/what-is.html
https://docs.aws.amazon.com/zh_cn/wellarchitected/latest/reliability-pillar/rel_testing_resiliency_failure_injection_resiliency.html

Price
https://aws.amazon.com/cn/fis/pricing/?refid=12eea001-bcfd-40ce-9788-748f73400e32

The AWS FIS price is $0.10 per action-minute plus an additional $0.10 per action-minute for each additional account, for all regions except AWS GovCloud (US-East and US-West), where the price is $0.12 per action-minute, plus an additional $0.12 per action-minute for each additional account.
increasing CPU utilization on an instance or failing over a database is called an action.
Experiments include one or several actions that will be run in sequence or in parallel per your specification.
Charges are the same regardless of the action type, number of affected resources, or total experiment duration.

Feature
https://aws.amazon.com/cn/fis/features/

list of scenarios
https://docs.aws.amazon.com/fis/latest/userguide/scenario-library-scenarios.html

EC2:
- EC2 stress: instance failure
- EC2 stress: Disk
- EC2 stress: CPU
- EC2 stress: Memory
- EC2 stress: Network Latency

EKS:
- EKS stress: Pod Delete
- EKS stress: CPU
- EKS stress: Disk
- EKS stress: Memory
- EKS stress: Network latency

for multi-AZ and multi-Region
- AZ availability: Power interruption
- Cross-Region: Connectivity

目前看來能用上的大概是 EKS 的幾種缺陷注入能力,

想打聽一下咱們 testerhome 有沒有 混沌工程 方面的大佬?
對於 基於 AWS 雲的微服務架構系統 進行缺陷注入和混沌實驗有什麼建議?
對於混沌工程 領域新人常踩的坑有哪些?
在實際生產環境執行混沌實驗有哪些最佳實踐?
如何評估實驗成本?

希望能多多交流。

相關文章