KeLing AI officially enters the 2.0 era! Stronger semantic response, better dynamic quality, and more beautiful visual texture

KUAISHOU's Keling AI 2.0 has significantly improved generation effects in semantic response, dynamic quality, and visual aesthetics. Keling 2.0 Master Edition has fully upgraded controllable generation and editing capabilities for video and image creation, launching a new multimodal video editing feature that can flexibly understand user intent and support adding, deleting, and modifying video content

On April 15th, KUAISHOU's Keling AI announced another upgrade to its base model, officially releasing the Keling 2.0 video generation model and the Ketu 2.0 image generation model.

KUAISHOU Senior Vice President and Head of Community Science Line, Gai Kun, introduced at the Keling 2.0 model launch that the Keling 2.0 Master Edition has significantly improved generation effects in semantic response, dynamic quality, and visual aesthetics. The Keling 2.0 Master Edition has fully upgraded controllable generation and editing capabilities for video and image creation, launching a new multimodal video editing feature that can flexibly understand user intent and support adding, deleting, and modifying video content.

How "smart" is Keling AI 2.0? Let's explore!

Significant Improvement in Semantic Response Ability

Keling 2.0 has made significant progress in semantic response. It can more accurately understand the text commands input by users and generate video or image content that closely matches them. This means that users can guide AI creation with more natural and complex language descriptions, resulting in works that better meet their expectations.

For example, users can describe the atmosphere of a scene, the actions and emotions of characters in detail, and Keling 2.0 can accurately incorporate these elements into the generated content.

“The man first laughs happily, then suddenly becomes angry, slamming his hand on the table and standing up.”

Dynamic Quality Optimization

In terms of dynamic quality, Keling 2.0 has achieved a qualitative leap. The generated videos show significant improvements in motion smoothness, temporal coherence, and camera effects. Whether it's complex action scenes or delicate emotional expressions, Keling 2.0 can present them in a more natural and realistic way.

“A dinosaur charges towards the camera, with motion blur and camera shake.”

Visual Texture Upgrade

Keling 2.0 has also undergone comprehensive optimization in visual texture. The generated images and videos excel in color, lighting effects, and detail representation. The Ketu 2.0 image generation model has also significantly improved in instruction adherence, cinematic quality, and artistic style performance, capable of generating images with cinematic quality.

“A girl transitions from sitting quietly on a park bench to slowly walking out of the frame, with the morning light gradually shifting to the midday sun and then transitioning to dusk, the sky's colors changing from pink-orange to deep blue and then to purple-red, with passing pedestrians forming flowing shadowy trails in a fixed shot, highlighting the slow crawl of light and shadow on the wooden texture of the bench, while fallen leaves accumulate under the bench and are swept up by the wind.”

It is understood that current video generation mainly consists of text-to-video and image-to-video types. KUAISHOU Vice President and Keling AI Head, Zhang Di, disclosed that 85% of video creation is completed through image-to-video.

In the demonstration by Gai Kun, users can convey multidimensional complex ideas in their minds to AI through the MVL method, combining multimodal information such as image references and video clips, rather than just text prompts.

"A bard cat poet, singing his own story in a tavern, while his hand strums the guitar chords."

"A dive that looks professional but is actually a rookie."

"A girl just finished a performance and sincerely bowed to everyone."

"First-person perspective, driving, is real driving."

"A softly crying alien."

Some netizens commented:

"After watching the Keling 2.0 launch event, I have seen radical and conservative factions form around me. The radicals believe Keling 2.0 is already world-leading, while the conservatives think the radicals are too conservative..."