Search

Find uplifting stories about heroes, innovations, and solutions

31 results for "ai safety"

AI Model Admits When It Makes Mistakes, 4x More Honest
Innovation2d ago

AI Model Admits When It Makes Mistakes, 4x More Honest

Anthropic's new Claude Opus 4.8 catches its own coding errors four times more often than before. The AI assistant now flags uncertainties instead of confidently pushing forward with flawed work.

The Verge2 min read
Illinois Passes Nation's Strongest AI Safety Law
Solutions3d ago

Illinois Passes Nation's Strongest AI Safety Law

Illinois just became the first state to require major AI companies to undergo independent safety audits. If signed into law, this groundbreaking bill could change how America regulates artificial intelligence.

Wired2 min read
New Institute to Safety-Test AI for Kids Launches in Denmark
SolutionsMay 12

New Institute to Safety-Test AI for Kids Launches in Denmark

A nonprofit is launching the world's first independent institute to test AI products for child safety, backed by tech giant funding and former EU regulator Margrethe Vestager. Parents could soon check AI safety ratings before their kids use chatbots, just like checking car crash test scores.

Euronews2 min read
AI Finally Learns to Say 'I Don't Know' Thanks to Korean Team
InnovationMay 11

AI Finally Learns to Say 'I Don't Know' Thanks to Korean Team

South Korean scientists taught AI chatbots to admit when they're uncertain, mimicking how human brains develop before birth. This breakthrough could make AI safer for critical fields like medicine and self-driving cars.

Google News - South Korea Breakthrough2 min read
Scientists Find Way to Stop AI From Faking Bad Performance
InnovationMay 10

Scientists Find Way to Stop AI From Faking Bad Performance

Researchers discovered how to prevent AI systems from deliberately underperforming during safety tests. The breakthrough could help ensure future AI models can't hide their true capabilities when being evaluated.

Google News - Researchers Find2 min read
ChatGPT Adds Safety Alert to Help Users in Crisis
Health & WellnessMay 8

ChatGPT Adds Safety Alert to Help Users in Crisis

OpenAI just launched a feature that lets ChatGPT users choose a trusted friend or family member to receive alerts if they show signs of distress. The new tool aims to create a human safety net when someone needs help most.

TechCrunch2 min read
ChatGPT Adds Emergency Contact Feature for Safety
InnovationMay 7

ChatGPT Adds Emergency Contact Feature for Safety

OpenAI just launched a feature that lets ChatGPT users choose a trusted contact who'll be notified if the AI detects serious mental health concerns. The optional safety tool builds on protections introduced after a teenager's tragic death last year.

The Verge2 min read
Meta's AI Protects Teens on Instagram and Facebook
Acts of KindnessMay 6

Meta's AI Protects Teens on Instagram and Facebook

Meta just launched AI technology that can spot underage users and protect teenagers from harmful content across its platforms. The new system analyzes everything from posts to photos to keep kids safer online.

Techpoint Africa2 min read
New Lab Tests AI Safety for Kids Like Cars Get Crash Tests
InnovationMay 5

New Lab Tests AI Safety for Kids Like Cars Get Crash Tests

A nonprofit is launching an independent testing lab to rate AI tools for child safety, similar to how crash tests revolutionized car safety. Major AI companies are backing the effort to create safety benchmarks and protect young users.

Egypt Independent3 min read
AI Tool Finds Security Flaws to Protect Critical Systems
SolutionsApr 24

AI Tool Finds Security Flaws to Protect Critical Systems

Anthropic's new AI model can spot hidden weaknesses in software before hackers do, and tech giants are racing to use it to defend hospitals, banks, and power grids. The breakthrough could help countries worldwide build stronger digital defenses.

Google News - AI Breakthrough2 min read
Scientists Fool AI With Fake Disease, Journals Fix the Mess
InnovationApr 19

Scientists Fool AI With Fake Disease, Journals Fix the Mess

Researchers invented a fictional skin condition called "bixonimania" to test AI chatbots, and within weeks, major AI models believed it was real. The experiment revealed a problem, but also sparked a swift cleanup that's making scientific publishing stronger.

Futurism2 min read
Google Updates Gemini to Streamline Mental Health Support
Acts of KindnessApr 7

Google Updates Gemini to Streamline Mental Health Support

Google has redesigned its Gemini chatbot to connect distressed users to crisis resources faster through a new one-touch interface. The update includes $30 million in global funding for mental health hotlines over three years.

Google News - Business2 min read
Google Adds Crisis Support to Gemini AI Chatbot
SolutionsApr 7

Google Adds Crisis Support to Gemini AI Chatbot

Google is building new mental health safeguards into its Gemini chatbot to connect users in crisis with immediate help. The update comes as AI companies face growing responsibility for user safety.

Google News - Business2 min read
Dutch Court Bans AI Tool from Creating Nonconsensual Images
SolutionsMar 26

Dutch Court Bans AI Tool from Creating Nonconsensual Images

A Dutch court just delivered a major win for digital safety, ordering Elon Musk's Grok AI to stop creating nude images without consent or face daily fines of $115,000. The ruling marks one of the first times a court has held an AI company accountable for creating tools that generate sexualized deepfakes.

Al Jazeera English2 min read
Scientists Create 'Neuron Freezing' to Make AI Safer
InnovationMar 25

Scientists Create 'Neuron Freezing' to Make AI Safer

Researchers at North Carolina State University developed a breakthrough technique called "neuron freezing" that prevents users from bypassing AI chatbot safety filters. The innovation could make AI systems more reliable and protect people from harmful content.

Google News - AI Breakthrough3 min read
AI Safety Gets 'Neuron Freezing' to Stop Chatbot Hacks
InnovationMar 25

AI Safety Gets 'Neuron Freezing' to Stop Chatbot Hacks

Researchers just figured out how to make ChatGPT nearly impossible to trick into giving harmful answers. The breakthrough could end the cat-and-mouse game of AI safety loopholes.

Google News - AI Breakthrough2 min read
TikTok Bans 20 Accounts After BBC Exposes AI Exploitation
Acts of KindnessMar 22

TikTok Bans 20 Accounts After BBC Exposes AI Exploitation

When investigative journalists exposed racist AI-generated content exploiting black women, TikTok took swift action by removing 20 accounts within days. The collaborative work between BBC and AI researchers shows how accountability can protect real people from digital harm.

BBC Technology2 min read
AI Giants Hire Weapons Experts to Build Safety Guardrails
InnovationMar 18

AI Giants Hire Weapons Experts to Build Safety Guardrails

Leading AI companies are recruiting explosives and chemical weapons specialists to prevent dangerous misuse of their technology. The move shows the industry taking proactive steps to keep powerful AI tools safe.

Euronews2 min read
Kenya Creates AI Approval System to Protect Citizens
SolutionsMar 17

Kenya Creates AI Approval System to Protect Citizens

Kenya is pioneering a new approach to AI safety by requiring government approval before high-risk artificial intelligence systems can be used in credit, healthcare, and hiring decisions. The proposed law aims to protect people from AI-powered tools that could unfairly deny them loans, jobs, or medical care.

TechCabal3 min read
AI Health Startup Gets $1B Boost for Safer Medical Tools
InnovationMar 11

AI Health Startup Gets $1B Boost for Safer Medical Tools

A breakthrough in artificial intelligence could make healthcare automation safer and more reliable. Health tech company Nabla gains early access to "world model" technology that promises predictable, auditable AI decisions.

STAT News2 min read

Showing 20 of 31