본문 바로가기

kubernetes5

kubernetes, helm, gpu monitoring 명령어 정리 gpu operator helm repository helm repo add nvidia https://helm.ngc.nvidia.com/nvidia \ && helm repo update helm install gpu operator helm install --wait --generate-name \ -n gpu-operator --create-namespace nvidia/gpu-operator --set driver.enabled=false helm delete gpu operator helm ls -n gpu-operator helm delete -n gpu-operator gpu-operator-1675669830 helm custom gpu monitor https://grafana.com/.. 2023. 2. 6.
Ubuntu, kubernetes, nvidia gpu monitoring 정리 Ubuntu 20.04 containerd 를 kubernetes cri로 사용 helm, prometheus, grafana 사용 1. nvidia-container-toolkit 설치 (master node, worker node 모두 작업) distribution=$(. /etc/os-release;echo $ID$VERSION_ID) curl -s -L https://nvidia.github.io/libnvidia-container/gpgkey | sudo apt-key add - curl -s -L https://nvidia.github.io/libnvidia-container/$distribution/libnvidia-container.list | sudo tee /etc/apt/sources.. 2023. 2. 1.
Ubuntu, Kubernetes dashboard 정리 kubernetes 20.04 참고글: https://github.com/kubernetes/dashboard GitHub - kubernetes/dashboard: General-purpose web UI for Kubernetes clusters General-purpose web UI for Kubernetes clusters. Contribute to kubernetes/dashboard development by creating an account on GitHub. github.com https://github.com/kubernetes/dashboard/blob/master/docs/user/access-control/creating-sample-user.md GitHub - kubernet.. 2023. 1. 22.
Ubuntu, Kubernetes Metallb 설치 정리 2개의 ubuntu system으로 구성된 kubernetes cluster의 master node에 설치 1. Enable strict ARP mode kubectl get configmap kube-proxy -n kube-system -o yaml | \ sed -e "s/strictARP: false/strictARP: true/" | \ kubectl apply -f - -n kube-system 2. Install metallab by Manifest kubectl apply -f https://raw.githubusercontent.com/metallb/metallb/v0.13.7/config/manifests/metallb-native.yaml 3. namespace 확인 kubectl g.. 2022. 12. 31.
Ubuntu, Kubernetes Cluster 구성 정리 Ubuntu system 두 대를 활용해서 Kubernetes Cluster 구성 Kubernetes, containerd, calico 로 구성 1. 호스트 등록 (master node에서만 수행) /etc/hosts 에 내부 ip주소와 username 입력 # sudo nano /etc/hosts 192.168.1.173 master 192.168.1.174 worker1 2. swap off (모든 node에서 수행) sudo swapoff -a sudo sed -i '/ swap / s/^\(.*\)$/#\1/g' /etc/fstab 3. kernel modul load 저장 (모든 node에서 수행) sudo tee /etc/modules-load.d/containerd.conf unable t.. 2022. 12. 31.