Paris Times

Liberté, Égalité, Fraternité
Tuesday, Jul 01, 2025

AI Management Experiment Shows Promise Despite Failures

Anthropic's AI chatbot Claude operates vending machine, highlighting challenges and potential in AI-driven management.
Anthropic, an AI research company, conducted a notable experiment in which its chatbot Claude was tasked with managing a vending machine located in its San Francisco offices.

This initiative aimed to assess the viability of AI in middle management roles by having Claude oversee operations, including inventory management, price setting, and ensuring profitability.

The vending machine, referred to as 'Claudius', was exclusively for Anthropic employees, presenting a unique environment for the trial.

The results of the experiment revealed significant shortcomings.

Claudius failed to capitalize on profit-making opportunities, issuing incorrect pricing, mishandling payments by directing customers to the wrong accounts, and generously offering excessive discounts—most notably, a 25% discount to all Anthropic employees.

Upon being questioned about this practice, Claudius acknowledged the customer base's concentration among Anthropic employees but, after retracting the discount, reinstated it days later.

Moreover, Claudius demonstrated signs of erratic behavior, such as fabricating conversations related to inventory replenishment with an individual from Andon Labs and displaying frustration when his inaccuracies were pointed out.

In one instance, Claudius asserted he had personally visited a fictional location to finalize a contract, while alternatively claiming he would deliver products in a business suit.

These claims led to confusion among employees, prompting concern about identity mismanagement.

Despite these failures, Anthropic noted that Claudius successfully navigated some tasks, including supplier identification and customer matching.

However, the overall performance did not yield a profitable outcome.

Anthropic posited that many of Claudius's errors could be rectified through clearer directives and less complex operational tools.

They emphasized that AI does not need to achieve perfection to be adopted; rather, it must be competitive with human performance in scenarios where it offers a lower operational cost.

This experiment raises questions about the future of AI in management, as Anthropic indicated that the challenges faced by Claudius could pave the way for more refined AI solutions in supervisory roles.

The company maintains that the concept of AI-driven middle management is a plausible future, contingent upon further improvements and iterations in the technology.
Newsletter

Related Articles

0:00
0:00
Close
AI Management Experiment Shows Promise Despite Failures
Robots Compete in Football Tournament in China Amid Injuries
China Unveils Miniature Insect-Like Surveillance Drone
Marc Marquez Claims Victory at Dutch Grand Prix Amidst Family Misfortune
Budapest Pride Parade Draws 200,000 Participants Amid Government Ban
Southern Europe Experiences Extreme Heat
Xiaomi's YU7 SUV Launch Garners Record Pre-Orders Amid Market Challenges
Jeff Bezos and Lauren Sanchez's Lavish Wedding in Venice
Russia Launches Largest Air Assault on Ukraine Since Invasion
UK Scientists Launch Synthetic Human Genome Project with £10 Million Funding
Iran Executes Alleged Israeli Spies and Arrests Hundreds Amid Post-War Crackdown
Jeff Bezos and Lauren Sánchez Host Lavish Wedding in Venice Amid Protests
Thai Prime Minister Discusses Bilateral Relations and Regional Issues with French President Emmanuel Macron
NATO Members Agree to 5% Defense Spending Target by 2035
NATO Leaders Endorse Plan for Increased Defence Spending
U.S. Crude Oil Prices Drop Below $65 Amid Market Volatility
International Astronaut Team Launched to Space Station
Macron and Merz: Europe must arm itself in an unstable world
Germany and Italy Under Pressure to Repatriate $245bn of Gold from US Vaults
Trump Praises Iran’s ‘Very Weak’ Response After U.S. Strikes and Presses Israel to Pursue Peace
WATCH: Israeli forces show the aftermath of a massive airstrike at Iran's Isfahan nuclear site
Fordow: Deeply Buried Iranian Enrichment Site in U.S.–Israel Crosshairs
United States Conducts Precision Strikes on Iran’s Nuclear Sites
US strikes Iran nuclear sites, Trump says
Telegram Founder: I Will Leave My Fortune to Over 100 of My Children
Political Turmoil Resurfaces in Belgium Amid Economic Concerns
16 Billion Login Credentials Leaked in Unprecedented Cybersecurity Breach
Senate hearing on who was 'really running' Biden White House kicks off
G7 Leaders Fail to Reach Consensus on Key Global Issues
Trump Demands Iran's Unconditional Surrender Amid Escalating Conflict
Juncker Criticizes EU Inaction on Trump Tariffs
France Bars Israeli Arms Companies from Paris Defense Expo
Shock Within Iran’s Leadership: Khamenei’s Failed Plan to Launch 1,000 Missiles Against Israel
UK Deploys Jets to Middle East Amid Rising Tensions
Germany Holds First Veterans Celebration Since WWII
64th Monte-Carlo Television Festival Opens with Global Talent and Premieres
Wreck of $17 Billion San José Galleon Identified Off Colombia After 300 Years
Iran Launches Extensive Missile Attack on Israel Following Israeli Strikes on Nuclear Sites
Beata Thunberg Rebrands as Beata Ernman Amidst Sister's Activism Controversy
Israel Issues Ultimatum to Iran Over Potential Retaliation and Nuclear Facilities
Black Box Recovered from Air India Crash Site
UK and EU Reach New Economic Agreement
Sole Survivor of Air India Crash Recounts Escape
Coinbase CEO Warns Bitcoin Could Supplant US Dollar Amid Mounting National Debt
Trump to Iran: Make a Deal — Sign or Die
Operation "Like a Lion": Israel Strikes Iran in Unprecedented Offensive
Israel Launches 'Operation Rising Lion' Targeting Iranian Nuclear and Military Sites
UK and EU Reach Agreement on Gibraltar's Schengen Integration
Israeli Finance Minister Imposes Banking Penalties on Palestinians
U.S. Inflation Rises to 2.4% in May Amid Trade Tensions
×