Alibaba’s Quven team released a new image generation Artificial Intelligence (AI) model last week. Cuven VLO was dubbed, it is the successor of Quven 2.5 Vision Language Model and comes with many upgrade compared to older models. The latest AI image model supports both the text-from the image and the image-to-image generations. It also supports text input in many languages including English and Chinese. In addition to the image generation, the AI model is also capable of editing inline for input images along with the images generated.
Qwen Vlo accepts hints in many languages
One in Post On X (East was known as Twitter), the official handle of the Quven team announced the release of the new model. The model’s technical name is Qwen3-235B-A22B, and it is available here for free on the company’s chat interface. Users can also use models without logging in.
Members of the Gadgets 360 staff tested the AI model and found that its image was to be the generation’s capacity to be equal to the imagene 2 of Google. Instructions are slightly lower compared to the convenience of the following and image output quality imagene -3 and GPT -4O -operated image of OpenAII. However, its generation’s time is faster than both of them, and compared to it, its rate is higher.
On its github PageThe company said the Quven VLO comes with better image understanding, which enables it to edit better inline without distorting the structural integrity of the input image. It also improves the overall quality of the output. The model also understands unclear and open-ended prompts, and can generate images that are aligning with the expectations of the user.
In addition to the image generation and editing, Quven VLO can do image analysis-related functions such as edge detection, segmentation, prediction mapping, and more. The company said that the future version of the model will also be able to accept many input images and combine them based on user requests.
Lesson rendering with the latest AI image generator has also been improved. We were able to generate accurate lessons in various fonts in our testing of the model. Finally, Qwen vlo supports images with dynamic aspect ratio as input, including 4: 1 and 1: 3 extremes. The company has soon planned to connect the facility to generate images in different aspect ratio.