18 Apr 2026
docker部署llm

docker image准备

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:latest
docker tag swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:latest vllm-openai:latest

启动llm

docker run --rm -d --gpus all `
    --shm-size=30g `
    -p 8000:8000 `
    --ipc=host `
    -v D:/ai-learning/model/DeepSeek-R1-Distill-Qwen-1.5B:/models/DeepSeek-R1-Distill-Qwen-1.5B `
    vllm-openai:latest `
    --model /models/DeepSeek-R1-Distill-Qwen-1.5B `
    --gpu-memory-utilization 0.9 `
    --swap-space 20 `
    --max-model-len 8192 `
    --dtype half `
    --max-num-seqs 10

验证

$ curl http://192.168.0.122:8000/v1/chat/completions -H "Content-Type: application/json" -d "{\"model\":\"/models/DeepSeek-R1-DistillQwen-1.5B\",\"messages\":[{\"role\":\"user\",\"content\":\"你好，请介绍一下自己\"}],\"max_tokens\":100}"
{
  "id": "chat-d8e563fc13094ce195f9b661a376a33e",
  "object": "chat.completion",
  "created": 1776502168,
  "model": "/models/DeepSeek-R1-Distill-Qwen-1.5B",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "<think>\n\n</think>\n\n你好！我是DeepSeek-R1，一个由深度求索公司开发的智能助手，我会尽我所能为您提供帮助。请问有什么可以为您服务的？",
        "tool_calls": [

        ]
      },
      "logprobs": null,
      "finish_reason": "stop",
      "stop_reason": null
    }
  ],
  "usage": {
    "prompt_tokens": 7,
    "total_tokens": 45,
    "completion_tokens": 38
  },
  "prompt_logprobs": null
}

总结

gpu驱动版本需要更新，否则vllm不支持
参数需要调整，否则gpu oom
大模型通过国内镜像下载
vllm通过国内镜像下载

LEo at 00:12

18 Apr 2026 docker部署llm

docker image准备

启动llm

验证

总结

18 Apr 2026
docker部署llm