Running VLLM
Typical runs:
vllm serve --dtype=half --max_model_len 3424 Qwen/Qwen2.5-1.5B-Instruct
Typical runs:
vllm serve --dtype=half --max_model_len 3424 Qwen/Qwen2.5-1.5B-Instruct
From here you can search these documents. Enter your search terms below.
| Keys | Action |
|---|---|
| ? | Open this help |
| n | Next page |
| p | Previous page |
| s | Search |