Running VLLM
Typical runs:
vllm serve --dtype=half --max_model_len 3424 Qwen/Qwen2.5-1.5B-Instruct
Typical runs:
vllm serve --dtype=half --max_model_len 3424 Qwen/Qwen2.5-1.5B-Instruct
From here you can search these documents. Enter your search terms below.
Keys | Action |
---|---|
? | Open this help |
n | Next page |
p | Previous page |
s | Search |