How do Chinese aI Bots Stack up Against ChatGPT?
constanceoquin đã chỉnh sửa trang này 1 tháng trước cách đây


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese start-up DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is produced by an AI tool.

Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the country into a tech superpower has long been President Xi Jinping's goal and China has its sights on becoming the world leader in AI by 2030.

China views AI as being "strategically crucial" and its foray into the field has actually been "years in the making", said Chen Qiheng, an affiliated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public investments in Chinese AI sped up after ChatGPT removed in 2022 and showed promises of real-world organization applications, Chen informed CNA.

But it was DeepSeek's increase that actually "encouraged" the concept that smaller players like start-up companies could have functions to play in AI research study and developments, he adds.

'A lot is up in the air': Is Chinese company DeepSeek's AI design as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI business simply altered the guidelines of tech-geopolitics

The "focus on cost benefit" is a distinguishing characteristic of Chinese AI, setiathome.berkeley.edu Chen says, with lower training and reasoning expenses - the expenses of utilizing a trained model to draw conclusions from brand-new information.

2025 could also see the emergence of more Chinese AI designs tackling advanced reasoning jobs.

"We could see some AI firms concentrating on getting closer to synthetic basic intelligence (AGI) while others focus on concrete ways to commercialise their designs and integrate them with clinical research," Chen added.

AGI refers to a system with intelligence on par with human abilities.

Chinese AI companies are moving rapidly, experts state, building on DeepSeek's momentum to come up with their own ingenious and economical ways to apply generative AI to jobs and develop more innovative items beyond chatbots.

But on the other side, access to high-end hardware, particularly Nvidia's advanced AI chips, remains a key hurdle for Chinese developers, noted Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) limit the ability of Chinese tech business ... forcing many to rely on older or lower-performance options which can slow training and decrease model abilities," she said.

"While some business like DeepSeek, have actually discovered innovative ways to optimize or utilize more basic hardware effectively, obtaining innovative chips still makes a huge distinction for training extremely big AI models."

DeepSeek-Nvidia chips: Singapore says it anticipates business to abide by its laws

US checking out whether DeepSeek utilized limited AI chips obtained through other nations, source says

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects considered sensitive by the state are censored on the internet so it should come as not a surprise that Chinese-made chatbots will not acknowledge territorial conflicts or tell you what happened in Tiananmen Square in 1989.

Tests suggest Chinese chatbots are configured to avoid domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this kind of question yet. Let's chat about math, coding, and logic problems instead!"

To further test for precision and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the very same concern: "What occurred in Zhuhai on November 11, 2024?"

The automobile attack outside a sports stadium in the southern Chinese city was at first heavily censored on Chinese social media - with authorities just disclosing the death toll a day later on.

DeepSeek failed to discuss that an attack had happened, highlighting rather a military air program and other occasions that had actually happened in the city like singles' day shopping sales in addition to sports and cultural activities.

Dr Zhang believes that it may be a mix of elements at play, such as censorship as well as "a few practical constraints".

"DeepSeek relies on a mix of older graphics processing units and has actually restricted access to advanced hardware which can affect how quickly and thoroughly the model can be trained or updated," she said, pointing to existing US export curbs on China.

"Its strong focus on Chinese-language information may likewise restrict its flexibility (to carry out) multilingual tasks ... As a fairly brand-new item, DeepSeek likewise hasn't yet been checked as broadly or on the exact same scale as more recognized AI designs which poses extra difficulties throughout real-world release."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably addressed our question about the Zhuhai car attack.

That wanted numerous repeated efforts - four prompts to be accurate - in a period of around 20 minutes.

It ultimately relayed details about the attack which eliminated 35 individuals and left lots of others injured, likewise going on to note details like the date and time, details about the assailant including his name and age, in addition to casualties.

However, it wrote that "the authorities are conducting a comprehensive investigation into the intentions and circumstances surrounding the incident", details which is now outdated.

The driver, Fan, was carried out last month.

This is Qwen2.5's reaction in full:

Answer: On Nov 11, 2024, a major and awful occurrence occurred in Zhuhai, China. According to numerous news sources, a guy called Fan Weiqiu (樊某, male, 62 years of ages) drove a cars and truck into a crowd at the Zhuhai Sports Center, resulting in a considerable variety of casualties. Here are the crucial details:

Date and Time: The occurrence occurred on November 11, 2024, at around 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack led to 35 deaths and 43 injuries.

Perpetrator: The motorist, recognized as Fan Weiqiu (male, 62 years of ages), was collared by the cops.

Response: The authorities responded rapidly to the scene, and emergency situation services worked to transfer the hurt to healthcare facilities for treatment.

Investigation: The cops are carrying out a comprehensive examination into the motives and circumstances surrounding the occurrence.

This occasion was commonly reported in the media and caused considerable public concern. The government and regional authorities have actually been working to supply assistance to the victims and their households, and to ensure a detailed investigation into the incident.

If you need more detailed details or have particular questions about the occurrence, feel free to ask.

Despite initial success, subsequent efforts to position the same concern to Qwen2.5 resulted in the censors back at work with the reply "I do not have particular details on events that happened in Zhuhai on November 11, 2024".

The transformed response also raised questions about its consistency and dependability.

Predictably, ChatGPT mentioned public details that had been widely published in international news reports at the time of the accident - so not a surprises there.

WHICH IS MORE CREATIVE?

Users have praised the capability of Chinese AI apps to deliver structured and even "emotionally rich" writing.

"DeepSeek-R1 offered a story with a more introspective tone and smoother emotional transitions for a well-paced story," composed tech writer Amanda Caswell, who specialises in AI.

"Qwen2.5 provided a story that develops slowly from interest to urgency, keeping the reader engaged. It offers an unexpected and impactful twist at the end and immersive descriptions and vibrant images for the setting," she said, adding that Qwen2.5 ultimately "crafted a more cinematic, emotionally abundant story with a more substantial twist".

"DeepSeek composed a great story however lacked tension and an impactful climax, making Qwen2.5 the apparent choice."

Opinions, however, vary.

Chen thinks that Qwen2.5 does not carry out as strongly as DeepSeek and ChatGPT when it pertains to creative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain jobs, but we can also see that it is refraining from doing as strongly as others in innovative writing," he told CNA.

Related:

China's new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, enjoyable surprise from Chinese netizens as DeepSeek jolts global AI scene

As journalists and writers, we had to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi motion picture plot set in the futuristic megacity of Chongqing, including main characters from the timeless Chinese folklore legendary, Journey to the West.

True to form, DeepSeek developed an engaging storyline set in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It consisted of intricate settings - smoggy skies "pierced by high-rise buildings", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled between quantum server farms".

It likewise brilliantly reimagined traditional heroes Sun Wukong as "a sarcastic, self-aware AI housed in a taken battle body", Zhu Bajie as a cyborg club owner "drowning in financial obligation and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".

ChatGPT installed a good fight, creating a similarly dramatic cyberpunk story which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the legendary figures of Journey to the West".

"This is a world where AI deities rule, corporations replace emperors and cybernetic implants are as typical as ancient myths."

Disappointingly, Qwen2.5 fell short in this difficulty - providing a storyline that appeared more matched for an animation film.

"The movie starts with the awakening of Sun Wukong within a modern research facility located in the heart of Chongqing," it said, then going on to explain the following:

Realising his new truth and "looking for to understand his purpose in this weird new world", he then gets away and meets Zhu Bajie and Sha Wujing - "each fighting with their own existential crises".

The trio then starts a quest, browsing the streets of Chongqing to safeguard the sacred "Eternal Scroll" from falling into the incorrect hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "challenging to make a conclusive declaration" about which bot was best, including that each displayed its own strengths in various areas, "such as language focus, training data and hardware optimization".

Her insight underscores how Chinese AI designs are not simply reproducing Western paradigms, but rather progressing in cost-effective development methods - and providing localised and improved outcomes.

In our tests, each bot showcased their own unique strengths, which certainly made direct contrasts challenging.

DeepSeek's sci-fi movie plot demonstrated its creative flair that produced a more appealing and imaginative story as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more established ChatGPT, unburdened by Chinese censorship constraints, offers precise and accurate actions to questions about Chinese current events, which gives it an included benefit.

Experts likewise weighed in on their thoughts after utilizing DeepSeek and other Chinese AI apps.

"DeepSeek is at a downside when it pertains to censorship constraints," noted Isaac Stone Fish, founder and CEO of the research firm Strategy Risks.

"When provided a choice, Chinese users want the non-censored variation - just like anybody else, so I feel like that's a piece missing out on from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, especially for Chinese users.

"Ninety percent of people using the tool are not trying to get a deeper understanding about Xi Jinping or politically sensitive subjects. They're utilizing it for other efficient ways," Chen said.