Model summary
Alibaba Qwen image editing model. Moat is money: strong resource backing; Chinese foundation model labs are not far behind global peers.
Qwen-Image Edit is a 20B MMDiT image editing model focused on prompt‑driven and reference‑based edits. It performs particularly well on product-oriented tasks and multi-angle product edits, showing strong adherence to reference images and generally good prompt following for placement and composition. However, its style transfer capabilities are currently weak: it struggles to preserve subjects’ facial identity and fails to accurately capture specific artistic styles, often producing generic painterly looks with overly bright, cheerful colors and poor texture detail. When editing people, it tends to over-smooth skin and clothing, giving an airbrushed, somewhat unrealistic appearance, and hair (especially beards) can look synthetic. It may also slightly distort product scale, sometimes making objects larger than is realistic. Overall, Qwen-Image Edit is well-suited for product renders and general image edits where ultra-realistic texture, perfect identity preservation, or highly faithful artistic style transfer are not critical.