使用describe命令進行Kubernetes pod錯誤排查

i042416發表於2018-11-20

原文網址 : http://blog.itpub.net/24475491/viewspace-2220728/

我有一個pod名叫another，用kubectl create建立後發現過了29分鐘，狀態還是處於ContainerCreating階段。

使用describe命令進行Kubernetes pod錯誤排查

使用kubectl describe命令檢查：

使用describe命令進行Kubernetes pod錯誤排查

從錯誤訊息發現是因為這個pod attach volume失敗：

FailedAttachVolume 2m1s (x22 over 31m) attachdetach-controller AttachVolume.Attach failed for volume "pvc-c4d41f5c-e7ed-11e8-8726-fe6d42bf075f" : googleapi: Error 400: RESOURCE_IN_USE_BY_ANOTHER_RESOURCE - The disk resource 'projects/sap-pi-coo-acdc-dev/zones/europe-west1-b/disks/shoot--k8s-train--shac-pvc-c4d41f5c-e7ed-11e8-8726-fe6d42bf075f' is already being used by 'projects/sap-pi-coo-acdc-dev/zones/europe-west1-b/instances/shoot--k8s-train--shacw46-worker-prvfv-z1-7844dc6744-ghd5m'

Warning FailedMount 31s (x14 over 29m) kubelet, shoot--k8s-train--shacw46-worker-prvfv-z1-7844dc6744-hhrmd Unable to mount volumes for pod "another_part-0110(13f15fa4-e819-11e8-8726-fe6d42bf075f)": timeout expired waiting for volumes to attach or mount for pod "part-0110"/"another". list of unmounted volumes=[content-storage]. list of unattached volumes=[content-storage default-token-6z5sk]

檢視這個pod的yaml檔案，果然發現有一個persistent volume的claim：

使用describe命令進行Kubernetes pod錯誤排查

用命令kubectl get pv, 發現當前所有的persistent volume都被佔用了（BOUND狀態）：

使用describe命令進行Kubernetes pod錯誤排查

解決方案有很多種，處於測試目的，我只是簡單地將另一個同樣宣告瞭nginx-pvc作為PersistentVolumeClaim的pod刪除，然後這個名為another的pod狀態就很快變成Running了：

使用describe命令進行Kubernetes pod錯誤排查

從describe命令生成的日誌裡也能清楚的觀察到這個成功mount volume的事件：

使用describe命令進行Kubernetes pod錯誤排查

Normal SuccessfulAttachVolume 84s attachdetach-controller AttachVolume.Attach succeeded for volume "pvc-c4d41f5c-e7ed-11e8-8726-fe6d42bf075f"

要獲取更多Jerry的原創文章，請關注公眾號"汪子熙":

來自 “ ITPUB部落格 ” ，連結：http://blog.itpub.net/24475491/viewspace-2220728/，如需轉載，請註明出處，否則將追究法律責任。

相關文章

通過describe命令學習Kubernetes的pod屬性詳解
2018-11-20
kubernetes 載入pod出現ErrImageNeverPull錯誤
2022-11-01
Kubernetes Pod OOM 排查日記
2020-08-07
OOM
Kubernetes 使用arthas進行除錯
2020-08-06
除錯
pod install命令後tool 'xcodebulid' required Xcode...錯誤
2018-07-22
XCodeUI
使用 KRAWL 掃描 Kubernetes 錯誤
2020-02-27
Kubernetes的Pod進階（十一）
2022-01-27
排查錯誤日誌
2020-06-09
使用 sudo 命令出現錯誤
2018-07-12
使用ErrorStack進行錯誤跟蹤及診斷
2018-12-19
Error
[20180302]使用find命令小錯誤.txt
2018-03-02
如何優雅的在 Kubernetes Pod 內進行網路抓包
2022-05-21
使用 Kubernetes 最容易犯的 10 個錯誤！
2020-09-29
使用錯誤的運算子進行字串比較缺陷漏洞
2021-10-12
字串
使用kubernetes的10個最常見錯誤 – pipetail Blog
2020-05-18
AI
Minikube：使用 Kubernetes 進行本地開發
2022-11-28
傲視Kubernetes(三)：Kubernetes中的Pod
2020-12-13
Kubernetes之Pod排程
2018-12-14
Kubernetes Pod驅逐策略
2020-11-02
kubernetes之pod中斷
2019-06-06
Kubernetes：Pod總結(二)
2022-02-10
Kubernetes Pod 全面知識
2021-11-29
Kubernetes部署單元-Pod
2022-04-11
Abp框架之執行Update-Database 命令系列錯誤
2018-10-24
框架Database
IIS 7.5 解析錯誤命令執行漏洞解決方案
2019-05-27
【常見錯誤】--Nltk使用錯誤
2018-09-13
使用pdb進行Python除錯
2021-06-30
Python除錯
Kubernetes 無法查詢到並且無法刪除pod例項的排查過程
2018-12-26
Kubernetes之Pod工作負載
2024-03-23
負載
Kubernetes：Pod 升級、回滾
2021-12-03
Kubernetes:28---pod託管（Job：任務型pod）
2020-12-28
使用 acme 指令碼給專案新增 https 證書排查錯誤一例
2021-12-23
ACM指令碼HTTP
利用errorstack事件進行錯誤跟蹤和診斷
2018-05-20
Error事件
淺析Node是如何進行錯誤處理的
2020-04-03
docker中使用systemctl命令時報Too many open files錯誤
2021-03-03
Docker
使用 C-Reduce 進行除錯
2019-03-03
除錯
使用exp進行SQL報錯注入
2020-08-19
SQL
使用IDEA進行遠端除錯
2024-06-27
Idea除錯