Character.AI institutes new safety measures for AI chatbot conversations

Character.AI has rolled out new safety features and policies for building and interacting with the AI-powered virtual personalities it hosts. The new measures aim to make the platform safer for all users, but particularly younger people. The update includes more control over how minors engage with the AI chatbot, more content moderation, and better detection of the AI discussing topics like self-harm.

Though not cited in the blog post about the update, Character AI linked to the announcement in a post on X expressing condolences to the family of a 14-year-old who spent months interacting with one of Character.AI’s chatbots before taking his own life. His family has now filed a lawsuit against Character.AI for wrongful death, citing a lack of safeguards for the AI chatbots as a contributor to his suicide.

More stories

Microsoft exec promises big changes for Windows 11 will come ‘this month and every month this year’ â but don’t get too excited
December 31, 1969

NYT Strands hints and answers for Thursday, January 22 (game #690)
December 31, 1969

Intel Core Ultra 9 285K overclocked to 7.5GHz smashes a bunch of records â and of course AI was involved (it had to be, right?)
October 25, 2024

Survey says most believe generative AI is conscious, which may prove it’s good at making us hallucinate, too
July 16, 2024

We are heartbroken by the tragic loss of one of our users and want to express our deepest condolences to the family. As a company, we take the safety of our users very seriously and we are continuing to add new safety features that you can read about here:…October 23, 2024

AI chat guardrails

Character AI’s post laid out several new safety features for the platform. For instance, if the model detects keywords related to suicide or self-harm, it will display a pop-up urging the user to the National Suicide Prevention Lifeline and related resources. The AI will also be better at spotting and removing inappropriate content in a conversation, with a particular sensitivity to when users are under 18.

Presumably, minors would already have restricted content in conversations, but Character.AI may have upped that sensitivity further. In cases where that might not be enough, entire chatbots have been removed.

“We conduct proactive detection and moderation of user-created Characters, including using industry-standard and custom blocklists that are regularly updated. We proactively, and in response to user reports, remove Characters that violate our Terms of Service,” Character.AI explained in its post. “Users may notice that we’ve recently removed a group of Characters that have been flagged as violative, and these will be added to our custom blocklists moving forward.”

Other new features are more about helping ground users. So, you’ll see a notification when you have spent an hour on the platform asking if you want to keep going as a way of helping make sure you don’t lose track of time. You’ll also see more prominent disclaimers emphasizing that the AI is not a real person. There are already such disclaimers in the conversations, but Character.AI wants to make it impossible to ignore.

These safety features are the flipside of how Character.AI has made engaging with chatbots feel more like talking to a real person, including voices and the two-way voice conversations available with the Character Calls feature. Still, the company is likely keen to ensure its services are as safe as possible, and its moves could inform how others in the space shape their own AI chatbot characters.

Character.AI institutes new safety measures for AI chatbot conversations

Microsoft exec promises big changes for Windows 11 will come ‘this month and every month this year’ â but don’t get too excited

NYT Strands hints and answers for Thursday, January 22 (game #690)

Intel Core Ultra 9 285K overclocked to 7.5GHz smashes a bunch of records â and of course AI was involved (it had to be, right?)

Survey says most believe generative AI is conscious, which may prove it’s good at making us hallucinate, too

AI chat guardrails

You Might Also Like…

‘The only worry is for your budget’: Microsoft briefly says 32GB of RAM is the ‘no worries’ level for gaming on Windows 11, but users fear getting priced out

Quordle hints and answers for Sunday, May 3 (game #1560)

NYT Strands hints and answers for Sunday, May 3 (game #791)

8 new 4K Blu-rays from April 2026 to add to your collection

I canât believe Iâve found another RGB gadget for my home, but this flashy turntable is funky, affordable, automatic, and comes with some really neat touches

The S&P 500 and Nasdaq kept their record rallies going. Here are 3 key takeaways

‘The only worry is for your budget’: Microsoft briefly says 32GB of RAM is the ‘no worries’ level for gaming on Windows 11, but users fear getting priced out

NYT Strands hints and answers for Sunday, May 3 (game #791)

Quordle hints and answers for Sunday, May 3 (game #1560)