about blog github

18 Apr 2026
docker部署llm

docker image准备

docker pull swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:latest
docker tag swr.cn-north-4.myhuaweicloud.com/ddn-k8s/docker.io/vllm/vllm-openai:latest vllm-openai:latest

启动llm

docker run --rm -d --gpus all `
    --shm-size=30g `
    -p 8000:8000 `
    --ipc=host `
    -v D:/ai-learning/model/DeepSeek-R1-Distill-Qwen-1.5B:/models/DeepSeek-R1-Distill-Qwen-1.5B `
    vllm-openai:latest `
    --model /models/DeepSeek-R1-Distill-Qwen-1.5B `
    --gpu-memory-utilization 0.9 `
    --swap-space 20 `
    --max-model-len 8192 `
    --dtype half `
    --max-num-seqs 10

验证

$ curl http://192.168.0.122:8000/v1/chat/completions -H "Content-Type: application/json" -d "{\"model\":\"/models/DeepSeek-R1-DistillQwen-1.5B\",\"messages\":[{\"role\":\"user\",\"content\":\"你好,请介绍一下自己\"}],\"max_tokens\":100}"
{
  "id": "chat-d8e563fc13094ce195f9b661a376a33e",
  "object": "chat.completion",
  "created": 1776502168,
  "model": "/models/DeepSeek-R1-Distill-Qwen-1.5B",
  "choices": [
    {
      "index": 0,
      "message": {
        "role": "assistant",
        "content": "<think>\n\n</think>\n\n你好!我是DeepSeek-R1,一个由深度求索公司开发的智能助手,我会尽我所能为您提供帮助。请问有什么可以为您服务的?",
        "tool_calls": [

        ]
      },
      "logprobs": null,
      "finish_reason": "stop",
      "stop_reason": null
    }
  ],
  "usage": {
    "prompt_tokens": 7,
    "total_tokens": 45,
    "completion_tokens": 38
  },
  "prompt_logprobs": null
}

总结

  • gpu驱动版本需要更新,否则vllm不支持
  • 参数需要调整,否则gpu oom
  • 大模型通过国内镜像下载
  • vllm通过国内镜像下载


LEo at 00:12

about blog github