Qwen-Image
Qwen-Image is a 20B MMDiT text-to-image model that produces generally appealing, coherent images with good overall composition but a distinctly “polished / dreamy” aesthetic rather than a strictly photographic one. It is strong at: - Global composition and scene coherence, even with long, detailed prompts.
Speed
Price
Inputs
Text
Outputs
Image
Lab
Alibaba Cloud
Qwen-Image sample outputs
Captured directly from our eval suite. Click any tile to inspect the full render.
More from the Qwen family
Explore other options from the same family.