Chinese short video platform Kuaishou on Monday launched a new artificial intelligence video generation model to challenge OpenAI’s Sora and start-up Runway in the global market for AI content creation.
Kuaishou, which competes with TikTok’s Chinese sibling Douyin in China, said Kling O1 was the first unified multimodal video model in the industry, based on an architecture that integrated diverse video creation tasks – generation, precise and controllable editing and understanding – into a single platform, providing a “seamless, end-to-end workflow for the creative industry”, according to a statement.
Kling O1 was hailed as the “Nano Banana for AI video”, according to Alvaro Cintas-Canto, an assistant professor of AI and cybersecurity at Marymount University in the US. He lauded the video tool’s versatility in handling text-to-video, content editing and maintaining video character consistency in complex scenes in a post published to his X account on Monday.
Advertisement
Nano Banana is Google’s image generation and editing model, which is known for its precise manipulation of visual elements in a photo. Kuaishou claimed to be the first in the industry to build the same advanced editing capabilities into a video generation model, paving the way for its use in real-world settings across industries.

Kuaishou credited Kling O1’s multimodal capabilities and semantic comprehension for allowing it to take in and understand different content formats from text, images, video and visual elements. This lent it the ability to “see and understand” different parts and perspectives of an image, video or character that users upload as a reference when generating videos, while retouching video content in a more precise fashion.
Advertisement

