Multimodal
Qwen3.5 4B
Alibaba's efficient Qwen3.5 4B vision-language model tuned for practical multimodal deployment
Parameters 4B
Modalities
Text Image
Context Length 256K
License Apache 2.0
Precision
AWQ 4-bit
Serve the model
Start server
Choose module, then engine and optional parameters on the left, then copy the serve command by clicking the button on the right.
Command
·
No command for this module and engine in model data.
Call the model over Web API
Copy a client command below and paste it into your terminal to make a Web API request to the model you just served.
curl -s http://${JETSON_HOST}:8000/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "cyankiwi/Qwen3.5-4B-AWQ-4bit",
"messages": [{"role": "user", "content": "Hello!"}]
}' Model Details
Qwen3.5 4B offers a balanced point in the Qwen3.5 family for local multimodal instruction following, visual understanding, and agent-style workloads on Jetson.
Inputs and Outputs
Input: Text and images
Output: Text
Intended Use Cases
- Visual question answering: Multimodal prompting with image inputs
- Image understanding: Captioning, scene analysis, and grounded responses
- Tool calling: Structured tool use with vLLM
- Multilingual tasks: Translation and multilingual prompting
Additional Resources
- Original Model - Base Qwen3.5 4B checkpoint
- AWQ Checkpoint - Quantized checkpoint used here