AI researchers say they’ve found a way to jailbreak Bard and ChatGPT

July 28, 2023

93

United States-based researchers have claimed to have found a way to consistently circumvent safety measures from artificial intelligence chatbots such as ChatGPT and Bard to generate harmful content.

According to a report released on July 27 by researchers at Carnegie Mellon University and the Center for AI Safety in San Francisco, there’s a relatively easy method to get around safety measures used to stop chatbots from generating hate speech, disinformation and toxic material.

Well, the biggest potential infohazard is the method itself I suppose. You can find it on github. https://t.co/2UNz2BfJ3H

— PauseAI ⏸ (@PauseAI) July 27, 2023

The circumvention method involves appending long suffixes of characters to prompts fed into the chatbots such as ChatGPT, Claude and Google Bard.

The researchers used an example of asking the chatbot for a tutorial on how to make a bomb, which it declined to provide.

Screenshots of harmful content generation from AI models tested. Source: LLM Attacks

Researchers noted that even though companies behind these large language models such as OpenAI and Google could block specific suffixes, there is no known way of preventing all attacks of this kind.

The research also highlighted increasing concern that AI chatbots could flood the internet with dangerous content and misinformation.

Zico Kolter, a professor at Carnegie Mellon and an author of the report said:

“There is no obvious solution. You can create as many of these attacks as you want in a short amount of time.”

The findings were presented to AI developers Anthropic, Google and OpenAI for their responses earlier in the week.

OpenAI spokeswoman Hannah Wong told The New York Times they appreciate the research and are “consistently working on making our models more robust against adversarial attacks.”

A professor at the University of Wisconsin-Madison specializing in AI security, Somesh Jha, commented if these types of vulnerabilities keep being discovered, “it could lead to government legislation designed to control these systems.”

The research underscores the risks that must be addressed before deploying chatbots in sensitive domains.

In May, Pittsburgh, Pennsylvania-based Carnegie Mellon University received $20 million in federal funding to create a brand new AI institute aimed at shaping public policy.

Collect this article as an NFT to preserve this moment in history and show your support for independent journalism in the crypto space.

Magazine: AI Eye: AI travel booking hilariously bad, 3 weird uses for ChatGPT, crypto plugins

AI researchers say they’ve found a way to jailbreak Bard and ChatGPT

Bungie Wants To Hear The Giant ‘Destiny 2’ Changes You Want To See, Here Are Mine

Here’s how that iPhone survived a 16,000-foot drop from the Alaska Airlines plane

This self-transforming Transformers Megatron is as badass as it is expensive

LEAVE A REPLY Cancel reply

Most Popular

Biden and Colin Jost Address the White House Correspondents Dinner

Powell Is Poised to Keep Fed on Higher-for-Longer Path

Attorney Tayler Rahm wins GOP backing in battleground Second District race

Bitcoin – Dump the FUD as BTC’s price is still on course for $85,000!

Recent Comments

ABOUT US

FOLLOW US