Chinese AI start-up Shengshu unveils Vidu Q2 in challenge to OpenAI’s Sora

Chinese artificial intelligence start-up Shengshu AI on Tuesday rolled out Vidu Q2, a new video-generation model aimed at challenging OpenAI’s Sora, intensifying the global race to create realistic, controllable AI-generated videos.

The model allows creators to produce expressive and consistent videos using up to seven reference images to define faces, gestures, scenes or props. ShengShu said the “Vidu Q2 Reference-to-Video” feature can merge multiple unrelated elements – such as distinct characters, objects and backgrounds – into a single coherent clip, improving realism, cinematic motion and prompt comprehension.

“Vidu Q2 ‘Reference-to-Video’ marks a new chapter in AI video creation,” said Luo Yihang, CEO of Shengshu Technology. A former head of AI solutions and product lines at ByteDance’s cloud unit Volcano Engine, Luo said the model “goes beyond basic video creation; it’s about teaching AI to act and tell stories alongside creators”.

The Vidu Q2, along with the model’s application programming interface for enterprises, is available globally through Shengshu’s platform. It supports reference-to-video, image-to-video and text-to-video generation, producing clips of up to eight seconds in various aspect ratios, and also features synchronised dialogue and sound effects.

Membership tiers range from free access to a flagship plan costing up to 600 yuan (US$84) a month, which offers faster output and more generation capacity.

Shengshu AI co-founder and president Tang Jiayu during a product demonstration. Photo: Ben Jiang

Founded in March 2023, the Beijing-based company focuses on multimodal AI models through its Vidu video platform, which includes the previously released Vidu 1.5, Vidu 2.0 and Vidu Q1 versions.

M	T	W	T	F	S	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30