.
AltHunter Updates |
No Result
View All Result
  • Home
  • Hot News
  • Latest News
  • All About News
  • Bitcoin
  • Telegram
  • X
No Result
View All Result
AltHunter Updates |
No Result
View All Result
Home All About News

China’s $9 AI Video Tool Kling 2.1 Adds Audio—Can It Beat Google’s $250 Veo 3?

AltHunter by AltHunter
June 17, 2025
in All About News, Latest News
0
China’s $9 AI Video Tool Kling 2.1 Adds Audio—Can It Beat Google’s $250 Veo 3?
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter

In brief

-Chinese AI tool Kling 2.1 now generates videos with synchronized audio, including footsteps, rain, and ambient effects.

  • At just $9 a month, Kling undercuts Google’s Veo 3 by more than 20 times.
  • We tested both tools head-to-head: Kling shines on pricing and flexibility, but Veo still leads in dialogue and sound design quality.

Chinese short video platform Kuaishou has added an audio generation feature to Kling 2.1, its AI-powered video creation tool, enabling users to produce clips with synchronized sound effects such as footsteps, rainfall, and ambient noise.

The feature, which launched quietly last week, is available in Kling’s image-to-video mode, where users upload a still image and the platform animates it with both motion and audio generated by artificial intelligence.

Related articles

The state of Injective: Where onchain money goes further

The state of Injective: Where onchain money goes further

July 1, 2025
Dollar Index (DXY) Suffers Worst Crash Since 1991; Bitcoin’s (BTC) ‘Stochastic’ Points to Renewed Drop Below $100K

Dollar Index (DXY) Suffers Worst Crash Since 1991; Bitcoin’s (BTC) ‘Stochastic’ Points to Renewed Drop Below $100K

July 1, 2025

The timing pits Kling against Google’s Veo 3, which launched with integrated audio capabilities from day one.

Early users on X praised Kling’s seamless audio-visual synchronization, with creator Roberto Nickson calling it “one of the most useful models on the market” for producing generative video content.

The feature is free during initial rollout, accessible through Kling’s website and mobile app.

Kling 2.1 one of the most useful models on the market

— Roberto Nickson (@rpnickson) June 12, 2025

Kling 2.1 generates 5- to 10-second clips at up to 1080p resolution, utilizing what the company describes as “3D spatiotemporal attention mechanisms” to synchronize sounds with visuals.

The audio tool currently generates sound effects only—no dialogue or music—and produces something similar to Southeast Asian language audio when text is involved—very tonal, and completely unintelligible. But that by itself isn’t enough to crown Google as the undisputed King of generative video.

We tested Kling 2.1’s new audio features against Google’s Veo 3 to see how the upstart stacks up.

The Price of Creation

The price gap between the two platforms turns out to be massive.

Kling 2.1’s audio feature is only compatible with the standard version, not the higher-end Master edition. However, at current rates, users can generate more than 20 videos on Kling for every single Veo 3 creation.

For example, using Freepik’s credit system, one generation with Google Veo 3 is currently on sale for 4,000 credits (with the normal price being 8,000 credits per video), whereas Kling 2.1 costs 300 credits per video.

Google’s model runs exclusively through its $250-per-month Ultra subscription. Kling is available on its official site, offering some free generations, with subscriptions starting at around $9 per month.

Even with Google’s current promotional pricing, Veo 3 remains ten times more expensive than Kling.

For creators who know video generation involves plenty of trial and error, with failure rates that frustrate even patient users, Kling’s economics make experimentation feasible.

The Premium plan on Kling unlocks 1080p resolution, improving overall video quality while still maintaining the cost advantage.

Audio Capabilities

But you get what you pay for. Veo 3 offers sophisticated sound generation, accurately synthesizing speech and matching complex audio elements to visual scenes.

Its understanding of spatial audio and contextual sounds surpassed Kling’s offerings by a wide margin.

While Kling 2.1 can’t compete, in fairness, it aimed at something different: ambient sounds and background effects—no dialogue, no music. So forget about those viral AI street interviews for now. Attempts to generate audio produce speech gibberish.

Yet for scenes or videos requiring atmospheric audio, its results were serviceable.

2. An off-road SUV drives through rocky, muddy, and wet forest terrain.

You hear the crunch, the splash, the growl of the engine. Felt like a real shoot. pic.twitter.com/S0gVhCAQjk

— ZOYA ✪ (@Zoya_ai) June 12, 2025

The platform’s new ability to add effects to existing silent videos gives it an edge that Veo 3 couldn’t match.

Users can upload finished videos and retrofit them with appropriate soundscapes, a workflow that Google’s model doesn’t support. Weirdly, Veo can create videos, but it can’t edit them.

Besides the ability to create sounds for silent videos, Kling also offers a lip-syncing feature.

Users can upload a photo and a speech or dialogue separately, and the model will make a video in which the subjects interact naturally, as if they were speaking to each other according to the uploaded audio.

【Kling AI(@Kling_ai)】リップシンク update!!📢
動画に登場するキャラクターを選択して、どの人物が話しているかを選択できたり、音声のタイミングを調整するリップシンクの編集機能が追加されました。… pic.twitter.com/brvGUOgLKs

— SEIIIRU😈動画生成AI×AfterEffects (@seiiiiiiiiiiru) June 10, 2025

The twenty-to-one generation ratio meant creators can experiment with different audio approaches on Kling while Veo 3 users have to nail their sound design in fewer attempts.

For hobbyists and those learning generative video, Kling’s approach offers more room for trial and error.

But professional creators needing precise audio-visual synchronization and dialogue will find Veo 3’s sophisticated sound engine worth the premium.

Video Generation Quality

Video quality testing produced unexpected results. In a test scene featuring a woman fleeing from a giant spider, Kling 2.1’s standard version outperformed both Veo 3 and its own Master edition.

The standard model accurately represented the scene dynamics, exhibiting fluid motion and proper directional movement. Veo 3 inexplicably generated the woman running toward the spider instead of away from it.

The Master edition typically produces sharper, crisper visuals, but the standard version demonstrated superior scene comprehension and more fluid movement.

This is odd since higher resolution should always translate to better results, but maybe the problem boiled down to prompt technique issues or simply bad luck in the generation.

That said, Kling 2.1 standard with 1080p generations is a great model that holds its own against Google Veo 3 here.

Platform Workflows and Limitations

Platform limitations shape each tool’s workflow differently. Kling 2.1’s audio feature works only with image-to-video generation, not text-to-video, which remains exclusive to the Master edition without audio support—yes, this is odd, but it is what it is.

The best workaround is using Kolors, Kuaishou’s image generator, to create starting frames before converting them to video with synchronized audio. Kolors produces highly realistic images that serve as excellent starting points for video generation.

However, you might find that models including Reve, MidJourney, Recraft, Flux, and even ChatGPT are easier to prompt.

Veo 3 took the opposite approach, offering only text-to-video generation without any image-to-video option.

This forces users to rely entirely on prompt engineering, with no way to control the starting visual.

Google’s decision also seems particularly odd given that the previous Veo 2 does actually support image-to-video through its separate Flow platform.

The lack of visual control means users have to generate videos blindly, hoping their text prompts will produce the desired starting frames.

Content Moderation Approaches

Content moderation revealed contrasting philosophies. Veo 3 employs aggressive keyword filtering and post-generation checks, blocking content that violates Google’s policies.

The system flags potentially problematic prompts before generation and analyzes completed videos for policy violations.

Kling applies more liberal restrictions, allowing content that Veo will block outright.

However, the model’s training data naturally excluded explicit content—the model generates figures without anatomical details and violence without gore.

So, users can generate certain types of content that bypass keyword filters while still maintaining safety boundaries.

Both platforms refund credits when post-generation censorship blocks a video, but Kling’s lighter touch allows more creative freedom within boundaries.

Conclusions

Veo 3 might still be the king, but Kling 2.1 is definitely close to a populist on a mission to overthrow the monarchy.

Its audio feature is pretty revolutionary when you consider it’s a $9 tool competing against a $250 subscription.

The atmospheric sounds work, the rain sounds like rain, footsteps match the movement most of the time, and you can generate twenty attempts while Veo users carefully craft their single shot.

That retrofit feature, where you add sound to finished videos, is something Google doesn’t offer, and it’s genuinely useful for salvaging silent clips.

Things will look completely different if your primary goal is speech. Kling’s gibberish won’t fool anyone.

For this kind of specific requirement, Google Veo 3 is the obvious and only choice. The king is (almost) dead. Long live the Kling!

Edited by Josh Quittner and Sebastian Sinclair

Generally Intelligent Newsletter

A weekly AI journey narrated by Gen, a generative AI model.

Tags: AddsAudioCanbeatChinasGooglesKlingToolVeoVideo
ShareTweet

Related Posts

The state of Injective: Where onchain money goes further

The state of Injective: Where onchain money goes further

by AltHunter
July 1, 2025
0

Injective took a significant step forward at the Injective Summit, where it announced three game-changing developments: iBuild, a no-code platform...

Dollar Index (DXY) Suffers Worst Crash Since 1991; Bitcoin’s (BTC) ‘Stochastic’ Points to Renewed Drop Below $100K

Dollar Index (DXY) Suffers Worst Crash Since 1991; Bitcoin’s (BTC) ‘Stochastic’ Points to Renewed Drop Below $100K

by AltHunter
July 1, 2025
0

This is a daily technical analysis by CoinDesk analyst and Chartered Market Technician Omkar Godbole.The dollar index (DXY), which tracks...

Is Robert Kiyosaki a Sucker for Buying More Bitcoin?

Is Robert Kiyosaki a Sucker for Buying More Bitcoin?

by AltHunter
July 1, 2025
0

The author of Rich Dad Poor Dad became a Bitcoin bull several years ago and has since been highly vocal...

Smarter Web Buys $24.7M in BTC, Lifts Holdings to 773 BTC

Smarter Web Buys $24.7M in BTC, Lifts Holdings to 773 BTC

by AltHunter
July 1, 2025
0

United Kingdom-based web design and marketing firm The Smarter Web Company has expanded its Bitcoin treasury, purchasing an additional 230.05...

Crypto Stocks To Watch Today

Crypto Stocks To Watch Today

by AltHunter
July 1, 2025
0

US stocks climbed to fresh record highs on Monday, propelled by signs of progress in global trade talks and easing...

Load More
  • Trending
  • Comments
  • Latest
The investor’s guide to the DESK perps trading airdrop

The investor’s guide to the DESK perps trading airdrop

March 25, 2025
Drink-to-earn? A new sparkling water comes with an NFT and points

Drink-to-earn? A new sparkling water comes with an NFT and points

May 8, 2025
Execs expect patient SEC after SOL futures launch, more altcoin filings

Execs expect patient SEC after SOL futures launch, more altcoin filings

March 26, 2025
Mysten Labs’ Walrus could reshape decentralized gaming and apps

Mysten Labs’ Walrus could reshape decentralized gaming and apps

March 26, 2025
XRP, DOGE Rise, Ether Burn Falls to Record Low as Traders Eye This Week’s U.S. Data

XRP, DOGE Rise, Ether Burn Falls to Record Low as Traders Eye This Week’s U.S. Data

0
Trump Family-Linked World Liberty Snaps Up 3.54M MNT Tokens After Mantle’s Hard Fork

Trump Family-Linked World Liberty Snaps Up 3.54M MNT Tokens After Mantle’s Hard Fork

0
Philippines’ Largest Digital Wallet GCash Adds USDC Support

Philippines’ Largest Digital Wallet GCash Adds USDC Support

0
XRP Could Hit $10 by 2030 as Ripple Wraps Up SEC Case: Analyst

XRP Could Hit $10 by 2030 as Ripple Wraps Up SEC Case: Analyst

0
The state of Injective: Where onchain money goes further

The state of Injective: Where onchain money goes further

July 1, 2025
Dollar Index (DXY) Suffers Worst Crash Since 1991; Bitcoin’s (BTC) ‘Stochastic’ Points to Renewed Drop Below $100K

Dollar Index (DXY) Suffers Worst Crash Since 1991; Bitcoin’s (BTC) ‘Stochastic’ Points to Renewed Drop Below $100K

July 1, 2025
Is Robert Kiyosaki a Sucker for Buying More Bitcoin?

Is Robert Kiyosaki a Sucker for Buying More Bitcoin?

July 1, 2025
Smarter Web Buys $24.7M in BTC, Lifts Holdings to 773 BTC

Smarter Web Buys $24.7M in BTC, Lifts Holdings to 773 BTC

July 1, 2025

About Us

Welcome to AltHunter Updates, your premier source for the latest cryptocurrency news, market trends, and expert insights. We are dedicated to providing up-to-date, accurate, and in-depth analysis of the fast-evolving world of digital assets.

Categories

  • All About News
  • Bitcoin
  • Hot News
  • Latest News

Recent News

  • The state of Injective: Where onchain money goes further
  • Dollar Index (DXY) Suffers Worst Crash Since 1991; Bitcoin’s (BTC) ‘Stochastic’ Points to Renewed Drop Below $100K
  • Is Robert Kiyosaki a Sucker for Buying More Bitcoin?

Copyright © 2025 AltHunter Updates.

  • Home
  • Hot News
  • Latest News
  • All About News
  • Bitcoin
  • Telegram
  • X

Copyright © 2025 AltHunter Updates.

We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.Ok