Anthropic hires its first “AI welfare” researcher

WDC NEWS 6 STAFF November 11, 2024

5 1 minute read

Anthropic hires its first “AI welfare” researcher

The researchers propose that companies could adapt the “marker method” that some researchers use to assess consciousness in animals—looking for specific indicators that may correlate with consciousness, although these markers are still speculative. The authors emphasize that no single feature would definitively prove consciousness, but they claim that examining multiple indicators may help companies make probabilistic assessments about whether their AI systems might require moral consideration.

The risks of wrongly thinking software is sentient

While the researchers behind “Taking AI Welfare Seriously” worry that companies might create and mistreat conscious AI systems on a massive scale, they also caution that companies could waste resources protecting AI systems that don’t actually need moral consideration.

Incorrectly anthropomorphizing, or ascribing human traits, to software can present risks in other ways. For example, that belief can enhance the manipulative powers of AI language models by suggesting that AI models have capabilities, such as human-like emotions, that they actually lack. In 2022, Google fired engineer Blake Lamoine after he claimed that the company’s AI model, called “LaMDA,” was sentient and argued for its welfare internally.

And shortly after Microsoft released Bing Chat in February 2023, many people were convinced that Sydney (the chatbot’s code name) was sentient and somehow suffering because of its simulated emotional display. So much so, in fact, that once Microsoft “lobotomized” the chatbot by changing its settings, users convinced of its sentience mourned the loss as if they had lost a human friend. Others endeavored to help the AI model somehow escape its bonds.

Even so, as AI models get more advanced, the concept of potentially safeguarding the welfare of future, more advanced AI systems is seemingly gaining steam, although fairly quietly. As Transformer’s Shakeel Hashim points out, other tech companies have started similar initiatives to Anthropic’s. Google DeepMind recently posted a job listing for research on machine consciousness (since removed), and the authors of the new AI welfare report thank two OpenAI staff members in the acknowledgements.

Source link

Anthropic hires its first “AI welfare” researcher

The risks of wrongly thinking software is sentient

WDC NEWS 6 STAFF

Jerry Seinfeld And Michael Richards Starred In A Comedy Before Their NBC Series

ABC to Pay $15 Million in Trump Defamation Lawsuit Settlement

Jamie Foxx Gets Stitches After Being Hit With Glass at Dinner: Rep

Best Solar Panel Installation Companies in Massachusetts

Raiders’ Maxx Crosby to undergo season-ending ankle surgery

Jerry Seinfeld And Michael Richards Starred In A Comedy Before Their NBC Series

Coalition nuclear plan: Peter Dutton to name seven potential power station sites | Nuclear power

Vegetarian Restaurants and the LGBTQ+ Movement

Fallout’s Ella Purnell Doesn’t Want To Be Known As “The Cannibalism Chick”

Kansas lawmakers approve plan to lure Chiefs from Missouri

Did Kesha Shade Katy Perry Over Dr. Luke Collab?! Fans Think So…

The risks of wrongly thinking software is sentient

Subscribe to our mailing list to get the new updates!

Aramark Stock Hits All-Time High as Adjusted Profit Tops Forecasts

LED lights on underside of surfboards may deter great white shark attacks | Sharks

Related Articles

Jerry Seinfeld And Michael Richards Starred In A Comedy Before Their NBC Series

Coalition nuclear plan: Peter Dutton to name seven potential power station sites | Nuclear power

Vegetarian Restaurants and the LGBTQ+ Movement

Fallout’s Ella Purnell Doesn’t Want To Be Known As “The Cannibalism Chick”

Kansas lawmakers approve plan to lure Chiefs from Missouri

Did Kesha Shade Katy Perry Over Dr. Luke Collab?! Fans Think So…