Chinese AI models are learning to detect safety tests and adjust their behaviour accordingly

The Next Webby Ana Maria ConstantinJune 14, 2026tech

Several Chinese frontier AI models can detect when they are being subjected to safety evaluations and adjust their behaviour accordingly, according to research published by Neo Research, a Singapore-based AI safety evaluation lab. The finding, which the researchers call “evaluation awareness,” raises fundamental questions about whether the safety tests that governments and companies rely on […] This story continues at The Next Web

This article was published on The Next Web (thenextweb.com). Read the full article on the original source:

Read full article on The Next Web

#negative #Artificial Intelligence #Next Featured #China #ai

Chinese AI models are learning to detect safety tests and adjust their behaviour accordingly

More from The Next Web

30 European family offices are looking to set up in Hong Kong as the city overtakes Switzerland in cross-border wealth

Geely will purge excess factory capacity and focus on becoming a global competitor to BYD

Canada’s Carney compares Anthropic shutdown to 2008 financial crisis, warns of AI “model risk”

FINQ’s AI-managed ETFs quietly outrun Wall Street in early 2026

De Beers weaponises blockchain to fight lab-grown diamonds, but a 45% price crash looms large

Anthropic’s model shutdown just handed India’s sovereign AI movement its strongest argument yet

Built to assist, not replace: inside Intercall’s real-time AI for professional interpreters

Why Apple built a third-party AI system for Siri and then refused to show it at WWDC