Question 1

Node SelectorとTolerations付きGPUワークロードPod

Accepted Answer

## KubernetesでのGPUワークロード

GPUワークロードにはGPUハードウェアを持つノードに着陸するための特定のスケジューリングが必要です。Kubernetesはノードラベル、tolerations、拡張リソースを使用してGPUスケジューリングを管理します。

### 主要な設定

yaml
apiVersion: apps/v1
kind: Deployment
metadata:
  name: ml-inference
  labels:
    app: "ml-inference"
    workload: "gpu"
spec:
  replicas: 1
  template:
    spec:
      containers:
        - name: inference
          image: my-ml-model:latest
          ports:
            - name: grpc
              containerPort: 8500
          resources:

Question 2

When is this useful?

Accepted Answer

Kubernetesクラスタの専用GPUノードで機械学習推論、モデルトレーニング、ビデオトランスコーディング、その他のGPUアクセラレーションワークロードを実行する。

機能	nodeSelector	nodeAffinity
構文	シンプルなキーバリュー	表現力のあるオペレーター
ソフトプリファレンス	いいえ	はい（preferredDuringScheduling）
複数条件	ANDのみ	ANDとOR
ユースケース	シンプルなGPUスケジューリング	複雑なマルチゾーンスケジューリング

Node SelectorとTolerations付きGPUワークロードPod

詳細な説明

KubernetesでのGPUワークロード

主要な設定

Node Selector vs Node Affinity

Tolerations

ユースケース

試してみる — K8s Pod Spec Builder

関連トピック