Skip to content

Commit 429cf01

Browse files
update vllm version to 0.13.0 (#753)
* upgrade vllm to 0.13.0 * remove attenion-backend
1 parent ecc8ff4 commit 429cf01

4 files changed

Lines changed: 3 additions & 4 deletions

File tree

model-engine/model_engine_server/inference/vllm/Dockerfile.vllm

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# syntax=docker/dockerfile:1
2-
ARG VLLM_VERSION=0.11.1
2+
ARG VLLM_VERSION=0.13.0
33
ARG VLLM_BASE_REPO=vllm/vllm-openai
44
ARG VLLM_BASE_IMAGE=${VLLM_BASE_REPO}:v${VLLM_VERSION}
55
FROM ${VLLM_BASE_IMAGE} AS base

model-engine/model_engine_server/inference/vllm/build_and_upload_image.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,7 +29,7 @@ fi
2929
ACCOUNT=$1
3030
IMAGE_TAG=$2
3131
BUILD_TARGET=$3
32-
VLLM_VERSION=${VLLM_VERSION:-"0.10.2"}
32+
VLLM_VERSION=${VLLM_VERSION:-"0.13.0"}
3333
VLLM_BASE_REPO=${VLLM_BASE_REPO:-"vllm/vllm-openai"}
3434

3535
# if build target = vllm use vllm otherwise use vllm_batch
Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
vllm==0.11.0
1+
vllm==0.13.0

model-engine/model_engine_server/inference/vllm/vllm_server.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -82,7 +82,6 @@ def debug(sig, frame):
8282

8383
def parse_args(parser: FlexibleArgumentParser):
8484
parser = make_arg_parser(parser)
85-
parser.add_argument("--attention-backend", type=str, help="The attention backend to use")
8685
return parser.parse_args()
8786

8887

0 commit comments

Comments
 (0)