Kuaishou Releases Kling AI 3.0 With Advanced Multimodal Video Generation

Details: By Chris Borden; Category: Models; 1 w; 186

Chinese developer Kuaishou has unveiled the third version of its video generation model, Kling AI.

“Kling 3.0 is built on a deeply unified training platform, enabling truly native multimodal input and output. Through seamless audio integration and advanced consistency control, the model brings a stronger sense of realism and coherence to generated content,” the company said in its announcement.

The model combines multiple capabilities, including converting text, images, and reference materials into video, as well as adding or removing content, and modifying or transforming existing clips.

Introducing the Kling 3.0 Model: Everyone a Director. It’s Time. X

Video length has been extended to 15 seconds. Other improvements include more flexible shot control and more accurate prompt adherence. Overall realism has also been enhanced, with character movements becoming more expressive and dynamic.

Kling Video 3.0 vs. Kling Video 2.6 comparison. Source: Kling AI

A new Multi-Shot feature analyzes prompts to determine scene structure and shot types, automatically adjusting camera angles and composition.

The model supports a wide range of editing styles—from classic shot–reverse-shot dialogues to parallel storytelling and scenes with voice-over narration.

“There’s no longer a need for tedious cutting and editing—one generation is enough to create a cinematic video and make complex audiovisual formats accessible to all creators,” the announcement said.

In addition to standard image-to-video generation, Kling 3.0 supports multiple image references and video inputs as scene elements.

The model locks in the characteristics of characters, objects, and scenes. Regardless of camera movement or narrative development, key elements remain stable and consistent throughout the video.

Native audio has also been upgraded: speech is synchronized more accurately with facial expressions, and in dialogue scenes users can manually specify the speaker.

The list of supported languages has expanded to include Chinese, English, Japanese, Korean, and Spanish, with improved handling of dialects and accents.

In addition, the team upgraded its multimodal O1 model to Video 3.0 Omni.

Users can upload audio clips with speech starting from three seconds to extract a voice, or provide three- to eight-second video clips of a character to capture its core attributes.

Competitors put pressure on Sora

OpenAI introduced its video generation model Sora in February 2024. The tool sparked excitement on social media, but a public release only followed in December.

Nearly a year later, users gained access to text-to-video generation, image animation, and video extension features.

The iOS version of Sora, released in September, quickly attracted attention, surpassing 100,000 downloads on its first day. It reached 1 million installs faster than ChatGPT, despite being invite-only.

However, the trend soon reversed. In December, downloads fell by 32% month over month, and the decline continued in January, with 1.2 million installs recorded.

The slowdown was driven by several factors. First, competition intensified with Google’s Nano Banana model strengthening Gemini’s position. Sora also faces pressure from Meta AI and its Vibes feature, while Runway’s Gen-4.5 model raised the bar in independent tests.

Second, OpenAI encountered copyright challenges. Users generated videos featuring popular characters such as SpongeBob and Pikachu, forcing the company to tighten restrictions.

In December, the situation stabilized after OpenAI reached an agreement with Disney, allowing users to generate videos featuring the studio’s characters. However, this did not lead to renewed download growth.

About The Hosts

Chris Borden

AI Analyst & Technology Researcher

AI researcher and industry analyst covering decentralized infrastructure, AI systems, and emerging technology markets. Focused on data-driven analysis, long-term trends, and real-world adoption of artificial intelligence.

AI News

Adobe Firefly Introduces Unlimited AI Image and Video Generation for Subscribers

AGI May Arrive by 2026–2027, Warns Anthropic CEO Dario Amodei

AI Agent Beats 804 Human Programmers in Major Coding Tournament

AI Agents Can Now Hire Humans: Rentahuman.ai Turns Automation Into a Marketplace

AI & Society

AI Agents Create a Lobster Religion on Moltbook

Amazon Launches Health AI Assistant in One Medical App

Autonomous AI Agent Launches Smear Campaign After Code Rejection

CZ Predicts AI Agents Will Use Crypto for Millions of Transactions

AI Insights

Adobe Reinvents Document Work with Acrobat Studio and AI

AI as a Role Model for Generation Alpha: Promise, Risks, and the Future of Childhood

AI as a Toy: Why Humanity Always Misuses New Technology First

AI as On-Chain Judge: Stanford Professor Proposes Using LLMs to Resolve Prediction Market Disputes

Kuaishou Releases Kling AI 3.0 With Advanced Multimodal Video Generation

Competitors put pressure on Sora

About The Hosts

More From Chris Borden

Policy & Security

Anthropic Clashes With Pentagon Over AI Use in Autonomous Weapons and Surveillance

Industry

Anthropic Raises $30B in Series G, Valuation Jumps to $380B

Modelos

GLM-5 de Zhipu AI: modelo open source chino compite con GPT-5.2 y Claude

Models

Zhipu AI Unveils GLM-5, an Open-Source Model Challenging GPT-5.2 and Claude Opus 4.5

Platforms

Coinbase Launches Agentic Wallets for Autonomous AI Agents

Policy & Security

OpenAI Deploys Secure ChatGPT Version for U.S. Defense Platform

Platforms

OpenAI Begins Testing Ads in ChatGPT for US Users

Policy & Security

Google Translate Can Be Hacked via Prompt Injection After Gemini Upgrade

Industry

Anthropic Faces Internal Backlash as AI Safety Lead Departs Over Value Drift

Industry

Cango Sells 4,451 BTC to Cut Debt and Pivot Toward AI Infrastructure as NFN8 Files for Bankruptcy

Categories

AI News

Categories

AI & Society

Categories

AI Insights

Kuaishou Releases Kling AI 3.0 With Advanced Multimodal Video Generation

Competitors put pressure on Sora

About The Hosts

More From Chris Borden