使用describe指令進行Kubernetes pod錯誤排查

2021-12-03 23:50:00

我有一個pod名叫another，用kubectl create建立後發現過了29分鐘，狀态還是處于ContainerCreating階段。

使用kubectl describe指令檢查：

從錯誤消息發現是因為這個pod attach volume失敗：

FailedAttachVolume 2m1s (x22 over 31m) attachdetach-controller AttachVolume.Attach failed for volume “pvc-c4d41f5c-e7ed-11e8-8726-fe6d42bf075f” : googleapi: Error 400: RESOURCE_IN_USE_BY_ANOTHER_RESOURCE - The disk resource ‘projects/sap-pi-coo-acdc-dev/zones/europe-west1-b/disks/shoot–k8s-train–shac-pvc-c4d41f5c-e7ed-11e8-8726-fe6d42bf075f’ is already being used by ‘projects/sap-pi-coo-acdc-dev/zones/europe-west1-b/instances/shoot–k8s-train–shacw46-worker-prvfv-z1-7844dc6744-ghd5m’

Warning FailedMount 31s (x14 over 29m) kubelet, shoot–k8s-train–shacw46-worker-prvfv-z1-7844dc6744-hhrmd Unable to mount volumes for pod “another_part-0110(13f15fa4-e819-11e8-8726-fe6d42bf075f)”: timeout expired waiting for volumes to attach or mount for pod “part-0110”/“another”. list of unmounted volumes=[content-storage]. list of unattached volumes=[content-storage default-token-6z5sk]

檢視這個pod的yaml檔案，果然發現有一個persistent volume的claim：

用指令kubectl get pv, 發現目前所有的persistent volume都被占用了（BOUND狀态）：

解決方案有很多種，處于測試目的，我隻是簡單地将另一個同樣聲明了nginx-pvc作為PersistentVolumeClaim的pod删除，然後這個名為another的pod狀态就很快變成Running了：

從describe指令生成的日志裡也能清楚的觀察到這個成功mount volume的事件：

Normal SuccessfulAttachVolume 84s attachdetach-controller AttachVolume.Attach succeeded for volume “pvc-c4d41f5c-e7ed-11e8-8726-fe6d42bf075f”

使用describe指令進行Kubernetes pod錯誤排查

繼續閱讀

WINDOWS下安裝MRTG全攻略

使用jvm監控工具(jconsole、jvisualvm)通過jmx遠端連接配接kubernetes上的java應用

configure/make/make install的作用

ubuntu下gvim配置檔案.vimrc

Docker - Docker Volume及Volume指令詳解

SPOJ QTREE4 Query on a tree IV

如何配置Eclipse進行Perl開發

npm install stylus --save失敗

Error: docker-ce conflicts with 2:docker-1.13.1-53.git774336d.el7.centos.x86_64

在Windows上編譯Wireshark源代碼 .

Learning Perl: 1.3. How Can I Get Perl?

golang建構Dockerfile，并打包成鏡像，運作在docker和k8s上

Docker-compose 進行Doris自動化編排部署

服裝資訊化數字化變革

使用kubeadm+calico部署kubernetes v1.25.3

Perl與網絡監控