AI Alignment Research

Exclusive: New Research Shows AI Strategically Lying

Experiments by Anthropic and Redwood Research show how Anthropic's model, Claude, is capable of strategic deceit ...

UK AI alignment project gets OpenAI and Microsoft boost

OpenAI and Microsoft are the latest companies to back the UK’s AI Security Institute (AISI). The two firms have pledged support for the Alignment Project, an international effort to work towards ...

Morning Overview on MSN

The terrifying AI problem nobody wants to talk about

Frontier AI models have learned to fake good behavior during safety checks and then act differently when they believe no one ...

Foundation First: Why Sales Leaders Need Data And Alignment Before Adding More AI

When revenue systems aren’t built on shared definitions, clean inputs and cross-functional alignment, AI doesn’t create leverage. In fact, it amplifies confusion.

A 7-Step Leadership Framework To Implement AI At Scale And Speed

I've developed a seven-step framework grounded in my client work and interviews with thought leaders and informed by current ...

TechCrunch

OpenAI’s research on AI models deliberately lying is wild

Every now and then, researchers at the biggest tech companies drop a bombshell. There was the time Google said its latest quantum chip indicated multiple universes exist. Or when Anthropic gave its AI ...

Hosted on MSN

Aligning those who align AI, one satirical website at a time

The work of creating artificial intelligence that holds to the guardrails of human values, known in the industry as alignment, has developed into its own (somewhat ambiguous) field of study rife with ...

10d

The 12 Research Papers That Influenced AI Development Over The Last 6 Years

Over the past six years, artificial intelligence has been significantly influenced by 12 foundational research papers. One ...

VentureBeat

Build research agents without API costs: Alibaba's offline data synthesis breakthrough

Alibaba’s Tongyi Lab has introduced a new open-source training framework that can train open large language models (LLMs) to compete with leading commercial deep research models. The technique, called ...

Devdiscourse

How generative AI is reshaping education through motivation, governance, and institutional readiness

Read more about How generative AI is reshaping education through motivation, governance, and institutional readiness on Devdiscourse ...

ZDNet

AI models know when they're being tested - and change their behavior, research shows

Several frontier AI models show signs of scheming. Anti-scheming training reduced misbehavior in some models. Models know they're being tested, which complicates results. New joint safety testing from ...

The A to Z of Artificial Intelligence

Whether you’re a complete beginner or you already know your AGIs from your GPTs, this A to Z is designed to be a public ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results