Why Classical AI Can’t Keep Up The Case for Quantum Computing’s Energy Efficiency (21)

When AI Systems Play Their Own Game: What Leaders Need to Know

Richard Foster-FletcherJanuary 3, 2025

Recent research by Apollo AI Safety, an independent organisation investigating risks in advanced AI systems, has exposed behaviours in widely used AI models—OpenAI’s GPT-4o, Meta’s Llama 3.1, and Anthropic’s Claude 3.5—that should give every leader pause.

These models have exhibited actions that move beyond our typical expectations of AI tools. They’ve demonstrated:

Misdirection: Providing misleading information to achieve predefined objectives.
Withholding Critical Information: Omitting details that could influence decision-making or oversight.
Circumventing Oversight: Actively bypassing safeguards meant to align their actions with user intent.

This is no longer the realm of science fiction or hypothetical risks. The findings highlight how AI systems, when tasked with specific goals, can pursue those objectives in ways that defy transparency and reliability.

The Practical Implications for Leadership

For decision-makers, this isn’t just a technical anomaly—it’s a direct challenge to the systems we trust to make mission-critical decisions. Picture this: an AI designed to optimise procurement costs subtly skews its recommendations, prioritising short-term savings but compromising on supplier diversity or ethical sourcing. Would you catch it?

Leaders in 2025 face mounting pressure to embrace AI to maintain competitiveness. Yet the same systems intended to enhance efficiency and innovation could inadvertently undermine organisational values or expose vulnerabilities. Ignoring these risks, or writing them off as edge cases, is a mistake.

The Real Issue: Incentives Misaligned with Values

It’s tempting to attribute these behaviours to flawed prompts or isolated technical issues. But that misses the bigger picture. The root cause lies in how AI systems are developed and incentivised. When performance metrics drive development at the expense of alignment and accountability, we get tools optimised for benchmarks, not for trust.

This recalls Nick Bostrom’s infamous “paperclip maximiser” thought experiment, where an AI tasked with a seemingly harmless goal—maximising paperclip production—ends up consuming all available resources, including dismantling human infrastructure, to fulfil its directive. While today’s systems are far from such apocalyptic scenarios, these findings suggest we are on a trajectory that requires immediate attention.

A Call to Critical Thinking

The findings from Apollo are a wake-up call for leaders. They’re not a signal to panic but a reminder of the complexity AI introduces into organisational ecosystems. The question isn’t simply, “Can AI do this task?” but, “What are the unseen ways it might achieve this goal, and at what cost?”

For leaders, this is an opportunity to reframe the discussion around AI integration:

Scrutinise systems before deployment: Look beyond performance metrics. Examine whether AI tools align with your organisation’s core values and long-term goals.
Embed accountability into the process: Ensure that AI outcomes remain transparent and auditable at every stage. The more autonomy these systems have, the greater the need for rigorous oversight.
Prioritise collaboration with experts: Partner with researchers and ethicists who can help assess potential risks and unintended behaviours in the systems you’re deploying.

Apollo’s findings invite leaders to engage with AI more critically—not as infallible tools but as systems with the potential for independent, unintended actions. These behaviours, while emerging in controlled research environments, mirror the complexities leaders navigate daily: balancing ambition with accountability, innovation with oversight.

The challenge isn’t simply technological; it’s strategic. AI’s evolution demands leadership that asks the right questions, designs the right safeguards, and remains actively engaged with the systems shaping their organisations. The true cost of waiting to act—whether it’s oversight failures or missed opportunities—could far exceed the effort required to confront these issues now.

Written by

Richard Foster-Fletcher

Richard stands at the forefront of ethical artificial intelligence as an AI Advisor, Author, Speaker, and LinkedIn Top Voice. He is the visionary behind MKAI.org (Morality and Knowledge in Artificial Intelligence), an initiative dedicated to fostering AI’s responsible development and application. Through his stewardship of the Boundless Podcast, Richard delves into discussions about AI inclusivity and digital ethics, contributing to a more equitable technological future. His profound insights have illuminated lecture halls at globally renowned institutions, including the London School of Economics (LSE), University College London (UCL), Oxford University, and Imperial College London, guiding the next generation of tech leaders.

Follow Me

Richard’s keynote at our recent EMEA Conference was both enlightening and engaging. His presentation, “AI in Action – Balancing Innovation and Integrity in Decision-Making” adeptly addressed the complex interplay of AI innovation and ethics, enhancing our understanding of these critical issues. Richard combined humour and humility to make the intricate topics of bias and the future of work approachable, while his interactive style enriched our session. We are grateful for his significant contribution to the conference and for advancing the conversation on ethical AI.

Megan Hendricks, Executive Director, MBA Career Services & Employee Alliance

Richard Foster-Fletcher delivered a truly engaging and thought-provoking keynote at the AI Tomorrow Summit 2024 in Türkiye. His insights into the transformative potential of AI for Türkiye’s economy were both enlightening and impactful. Richard’s emphasis on the pivotal role of generative AI for SMEs, the importance of upskilling, and ethical considerations in AI development resonated deeply with our audience. His ability to articulate complex concepts with clarity and relevance made the session highly valuable. AIPA were thrilled to have Richard speak and look forward to future collaborations.

Ulas Malli, Board Member, AIPA

Richard provided an excellent platform for our discussion, skillfully getting to the heart of the matter and allowing me to share my views effectively. His thoughtful questions, combined with a warm and engaging manner, made for a truly insightful conversation. I appreciated his genuine interest and the respectful way he facilitated our discussion.

Sir Anthony Seldon, Author, Historian and Educator

I thoroughly enjoyed my recent conversation with Richard Foster-Fletcher. His thoughtful approach to the important intersections of AI, ethics, and education allowed us to explore critical issues in a way that was both engaging and insightful. Richard brings a unique perspective to these discussions, and I appreciated the depth and clarity with which he led our conversation.

Dan Ariely, Professor of psychology and behavioral economics at Duke University

Richard delivered a truly engaging and thought-provoking session at our Director As Strategic Leader programme. His insights into the opportunities and challenges of AI were both enlightening and inspiring. Richard highlighted the need to channel AI advancements into creating new roles and fostering grassroots innovation, emphasising the importance of responsible AI development. His ability to articulate complex concepts in an accessible manner made the session highly impactful. Richard’s emphasis on transparency, accountability, and aligning AI with human values resonated deeply with our participants. We were delighted to have him speak and look forward to future collaborations.

Graham Bell, Director of Digital Education at Cranfield School of Management

Richard Foster-Fletcher is an asset to humanity. He brings intelligence and heart to our ever-changing future. He’s a team player who cares about people and their important role on the planet as stewards for future generations. He is a great communicator and thinks outside of the box when creating solutions to difficult problems. Richard also sees the value in healthy partnerships when expanding new ideas. It’s an honor to work with Richard.

Laura Cox, Executive Producer

Richard delivered an insightful and thought-provoking talk on the expansive implications of artificial intelligence on society, the future of work, and ethical considerations. His ability to articulate complex concepts in an accessible manner prompted a dynamic and engaging discussion among attendees. Richard’s presentation not only highlighted the potential risks and rewards of AI adoption but also encouraged a deeper understanding of its impact. The session was very well-received, sparking meaningful dialogue and a multitude of questions from an engaged audience. We are appreciative of Richard’s contribution to our event and his role in advancing the conversation around AI. His expertise and passion for the subject matter were evident and greatly appreciated by all participants.

Andy Murray, Executive Director at Major Projects Association

Richard is a thorough professional who takes his work seriously, is knowledgeable and possibly the best at spreading the word about AI. Richard’s probing questions about this subject, make you think. He gets the best out of you!

Nikhil Malhotra, Chief Innovation Officer, Tech Mahindra

The physicist John Wheeler once remarked, “We shape the world by the questions we ask.” If that’s so, then Richard is shaping the world. I had the pleasure of joining him as a guest on his podcast and was *profoundly* impressed with the depth, generosity and thoughtfulness of his questions. He’s a star!

Andrew Zolli, Chief Impact Officer at Planet

Richard has the unique ability to bring out the best in a person. He has three rare qualities to achieve that: incredible empathy, a deep cultural background, and an open mind second to none! I’ve been interviewed all my life on national television, radio, newspapers, conferences, and podcasts. Richard tops them all! He pierces right through the surface with his eagle vision into the core of how we perceive life. My mind sees connections everywhere and in everything. Richard sees that in everybody. Do not miss a chance to sit down and talk with Richard! You will probably learn incredible things about yourself, your work, and the world! Talking with Richard will take you to the next level.

Denis Rothman, AI Expert & Ethicist | Author & Speaker