No sooner did ChatGPT get unleashed than hackers started “jailbreaking” the artificial intelligence chatbot — trying to override its safeguards so it could blurt out something unhinged or obscene.
But now its maker, OpenAI, and other major AI providers such as Google and Microsoft, are coordinating with the Biden administration to let thousands of hackers take a shot at testing the limits of their technology.
Anyone who’s tried ChatGPT, Microsoft’s Bing chatbot or Google’s Bard will have quickly learned that they have a tendency to fabricate information and confidently present it as fact. These systems, built on what’s known as large language models, also emulate the cultural biases they’ve learned from being trained upon huge troves of what people have written online.
There’s already a community of users trying their best to trick chatbots and highlight their flaws. Some are official “red teams” authorized by the companies to “prompt attack” the AI models to discover their vulnerabilities. Many others are hobbyists showing off humorous or disturbing outputs on social media until they get banned for violating a product’s terms of service.
In another example, searching for Chowdhury using an early version of Microsoft's Bing search engine chatbot — which is based on the same technology as ChatGPT but can pull real-time information from the internet — led to a profile that speculated Chowdhury “loves to buy new shoes every month” and made strange and gendered assertions about her physical appearance.
Chowdhury, now the co-founder of AI accountability nonprofit Humane Intelligence, said it's not just about finding flaws but about figuring out ways to fix them. “As these foundation models become more and more widespread, it’s really critical that we do everything we can to ensure their safety,” said Scale CEO Alexandr Wang. “You can imagine somebody on one side of the world asking it some very sensitive or detailed questions, including some of their personal information. You don’t want any of that information leaking to any other user.”
Philippines Latest News, Philippines Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
Pat Sajak's daughter to fill in for Vanna White on 'Wheel of Fortune''Wheel of Fortune' host Pat Sajak's daughter will spin letters for co-host Vanna White on Wednesday.
Read more »
White Pines student a repeat finalist in national public speaking contestSpeaker's Idol: Nevaeh Pine of Garden River to use speech on missing and murdered Indigenous women, girls and two-spirited people to highlight impacts of systemic racism
Read more »
Trudeau gravel-thrower sentenced to 90 days house arrest, one year probationLONDON, Ont. — An Ontario man who threw gravel at Justin Trudeau during an election campaign rally was sentenced to 90 days of house arrest on Monday, with a…
Read more »
Trudeau gravel-thrower sentenced to 90 days house arrest, one year probationLONDON, Ont. — An Ontario man who threw gravel at Justin Trudeau during an election campaign rally was sentenced to 90 days of house arrest on Monday, with a…
Read more »
Trudeau gravel-thrower sentenced to 90 days house arrest, one year probationLONDON, Ont. — An Ontario man who threw gravel at Justin Trudeau during an election campaign rally was sentenced to 90 days of house arrest on Monday, with a…
Read more »