Qwen3.5 0.8B
Alibaba's compact Qwen3.5 vision-language model for lightweight multimodal deployment
Serve the model
Start server
Choose module, then engine and optional parameters on the left, then copy the serve command by clicking the button on the right.
Command
·
No command for this module and engine in model data.
Call the model over Web API
Copy a client command below and paste it into your terminal to make a Web API request to the model you just served.
curl -s http://${JETSON_HOST}:8000/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "Qwen/Qwen3.5-0.8B",
"messages": [{"role": "user", "content": "Hello!"}]
}' Model Details
Qwen3.5 0.8B is the smallest vision-language model in the Qwen3.5 lineup. It is designed for lightweight local multimodal inference, fast iteration, and efficient Jetson deployment.
Inputs and Outputs
Input: Text and images
Output: Text
Intended Use Cases
- Visual question answering: Ask questions about images and receive text responses
- Image understanding: Captioning, scene description, and visual analysis
- Tool calling: OpenAI-compatible tool use via vLLM
- Rapid prototyping: Quick local multimodal experiments
Additional Resources
- Hugging Face Model - Original checkpoint