The Alibaba stand on the World Synthetic Intelligence Convention on the Shanghai World Expo Exhibition Middle in Shanghai, China, on July 5, 2024.
Nurphoto | Nurphoto | Getty Photographs
Whereas U.S. markets have been targeted on the influence of Anthropic and Altruist’s instruments on software program and monetary providers, China’s tech giants have launched AI fashions this week which have proven developments in robotics and video era.
Alibaba, TikTok creator ByteDance and short-video platform Kuaishou, have all launched new AI fashions that underscore how Chinese language corporations are maintaining with these within the U.S.
It comes after Google DeepMind boss Demis Hassabis informed CNBC that Chinese language AI fashions are simply “months” behind Western rivals.
These fashions from China are immediately competing with video era fashions corresponding to OpenAI’s Sora, in addition to robotics fashions from Nvidia and Google.
This is a rundown of the fashions.

Alibaba’s RynnBrain
Alibaba’s DAMO Academy unveiled RynnBrain this week, an AI mannequin designed to assist robots comprehend the bodily world round them and establish objects.
In a video demo, Alibaba confirmed a robotic with pincers for arms that appeared to have the ability to depend oranges, choose them up and place them in a basket. It was additionally proven taking milk out of a fridge.
Fashions require in depth coaching to allow them to establish on a regular basis objects to work together with, which signifies that easy duties like selecting up fruit might be difficult in robotics.
RynnBrain now places Alibaba in competitors with the likes of Nvidia and Google that are creating their very own AI fashions for robots.
“Considered one of its key improvements is built-in time and house consciousness,” Adina Yakefu, a researcher at Hugging Face, informed CNBC.
“As an alternative of merely reacting to rapid inputs, the robotic can bear in mind when and the place occasions occurred, monitor activity progress, and proceed throughout a number of steps. This makes it extra dependable and coherent in advanced real-world environments.”
Yakefu added that Alibaba’s “broader ambition” was to “set up a foundational intelligence layer for embodied methods.”
ByteDance’s Seedance 2.0
Seedance 2.0 is a video era AI mannequin able to producing a practical video from only a textual content immediate from a person. However prompts may include different movies and pictures.
Movies created with Seedance 2.0 and reviewed by CNBC seem to indicate fairly real looking imagery and video that has been absolutely created with AI.
Billy Boman, who is predicated in Stockholm, Sweden, and runs a artistic promoting company that produces AI-generated content material, has used Seedance 2.0.
He stated AI video era has made important strides over the previous two years, with fast enhancements throughout the trade.

“Again in 2023 … it was tough to get somebody to run or to stroll. Any sort of realism was [limited to] very quick clips, every part was very gradual, unhealthy textures, no pores and skin textures, missing element. Now the script has flipped. Now I can do something. It has been nothing wanting distinctive, the technological developments,” Boman informed CNBC in an interview.
Hugging Face’s Yakefu, added that the Seedance 2.0 mannequin has proven progress from earlier generations in “controllability, pace and manufacturing effectivity.”
“Seedance 2.0 is among the most well-rounded video era fashions I’ve examined to date. It genuinely shocked me by delivering satisfying outcomes on the primary strive, even with a easy immediate. The visuals, music, and cinematography come collectively in a means that feels polished relatively than experimental,” Yakefu stated.
Nonetheless, whereas customers have praised the know-how, Seedance has run into bother. Native Chinese language media reported that Seedance has suspended a characteristic that allowed the AI to generate the voice of an individual based mostly on an image they uploaded. It got here after a blogger in China raised considerations in regards to the voice era happening with out consent.
ByteDance was not instantly accessible for remark when contacted by CNBC.
Kuaishou’s Kling 3.0
Launched final week, Kuaishou’s Kling 3.0 is one other video era mannequin to rival ByteDance’s.
Kling 3.0 “options main upgrades in consistency, photorealistic output, prolonged video period as much as 15s, and native audio era throughout a number of languages, dialects, and accents.
The mannequin is simply accessible to paying subscribers however can be accessible to the general public quickly, Kuaishou stated.
Kuaishou’s success with its Kling fashions has been a key issue behind its greater than 50% share value rise during the last yr.
Kuaishou shares year-to-date
Different key AI mannequin releases
Zhipu AI — which trades as Data Atlas Expertise in Hong Kong — noticed its shares surge on Thursday after it launched GLM-5, an open-source large-language mannequin with enhanced coding capabilities and long-running agent duties.
The corporate stated the mannequin approaches Anthropic’s Claude Opus 4.5 in coding benchmarks whereas surpassing Google’s Gemini 3 Professional on some assessments. CNBC couldn’t confirm these claims.
Shares of MiniMax additionally jumped Thursday after it launched its up to date M2.5 open-source mannequin with enhanced AI agent instruments. “Brokers” or “agentic AI” refers to AI instruments designed to automate duties.
— CNBC’s Anniek Bao and Dylan Butts contributed to this report.