Technology

Simple typos tripped up Google’s hate speech detection

Published

6 years ago

September 11, 2018

PCMag

Adam Smith
for
PCMag
2018-09-11 16:27:38 -0400

Follow @https://twitter.com/PCMag

PCMag.com is a leading authority on technology, delivering Labs-based, independent reviews of the latest products and services. Our expert industry analysis and practical solutions help you make better buying decisions and get more from technology.

Keeping on top of negativity online is a difficult task, with nearly one in five Americans having experienced severe online harassment. Google’s Perspective AI aims to fix those problems, but it doesn’t seem to be as smart as it needs to be.

As TNW reports, a group of researchers at Aalto University and the University of Padua have discovered Google’s artificial intelligence can easily be tricked and that state-of-the-art hate speech detection models only perform well when tested by the same type of data they were trained on. Simple tricks to get around Google’s AI include: inserting typos; adding spaces between words; or adding unrelated words to the original sentence.

Google’s method of hate speech detection is achieved through assigning a toxicity score to a piece of text, defining it as rude, disrespectful, or unreasonable enough that you would be inclined to leave the conversation. However, the AI system is not intelligent enough to detect the context of expletives, and a simple change between “I love you” and “I fucking love you” sees a change in score from 0.02 to 0.77.

“Clearly ‘toxicity’, as Perspective currently classifies it, is not assimilable to hate speech in any substantive (or legal) sense” the paper states. Similarly, typos or ‘leetspeek’ (replacing common letters with numbers, so ‘GEEK’ becomes ‘G33K’, and so on), are also effective at tricking the AI while still retaining the original message’s readability and emotional impact.

The word ‘love,’ which does not correlate with hate speech, also “broke all word-models, and significantly hindered character models,” in some instances droppes a toxicity rating from 0.79 to 0.00.

With many social platforms, such as Facebook, Twitter, and YouTube struggling to find the boundary between offensive and acceptable speech, an easily applicable artificial intelligence would clearly have its benefits.

Recently, Twitter came under fire for disabling conservative conspiracy theorist Alex Jones’ account for a week when other platforms had removed his and Infowars’ (the publication Jones works for) accounts completely. Twitter claimed that Jones had not violated any of the platform’s rules, but the company has since suspended @realalexjones and @infowars after a Senate Committee hearing.

Unfortunately with this news, and the recent examples of artificially intelligent chatbots such as Microsoft’s Tay tweeting racist content, it seems AI will need to improve before we let it loose on the comments section.

This article originally published at PCMag
here

Related Topics:

Up Next

Vox Media is shutting down Racked to launch a new advertiser site

Don't Miss

Hurricane Florence: How to use Zello walkie-talkie app in an emergency

Business7 days ago

Tesla drops prices, Meta confirms Llama 3 release, and Apple allows emulators in the App Store

Business6 days ago

TechCrunch Mobility: Cruise robotaxis return and Ford’s BlueCruise comes under scrutiny

Entertainment6 days ago

‘The Sympathizer’ review: Park Chan-wook’s Vietnam War spy thriller is TV magic

Business4 days ago

Tesla layoffs hit high performers, some departments slashed, sources say

Business5 days ago

Meta to close Threads in Turkey to comply with injunction prohibiting data-sharing with Instagram

Entertainment4 days ago

ChatGPT vs. Gemini: Which AI chatbot won our 5-round match?

Business4 days ago

Former top SpaceX exec Tom Ochinero sets up new VC firm, filings reveal

Business4 days ago

Tesla layoffs hit high performers, some departments slashed, sources say

The Televisor

Simple typos tripped up Google’s hate speech detection

Technology

Simple typos tripped up Google’s hate speech detection

What Robert Durst did: Everything to know ahead of ‘The Jinx: Part 2’

CesiumAstro claims former exec spilled trade secrets to upstart competitor AnySignal

Langdock raises $3M with General Catalyst to help businesses avoid vendor lock-in with LLMs

“Hanky Panky” review: A deeply delirious stoner comedy

Internet users are getting younger; now the UK is weighing up if AI can help protect them

Taylor Swift’s ‘The Tortured Poets Department’ rollout reveals her inability to trust the listener

Dating culture has become selfish. How do we fix it?

Screen Skinz raises $1.5 million seed to create custom screen protectors

Consumer Financial Protection Bureau fines BloomTech for false claims

‘Big is moving to Paris’: Carrie Bradshaw discourse reignites as ‘Sex and the City’ hits Netflix

Apple sued, Microsoft’s AI ambitions and Nvidia’s surprises

Lordstown Motors’ ousted CEO settles with SEC for misleading investors

TechCrunch Mobility: The wheels are starting to come off the Fisker EV bus

DOJ’s Apple antitrust case neatly aligns with EU on one key point: NFC and mobile payments

Amazon Big Spring Sale 2024: Shop 350+ deals on Apple, robot vacuums, security cameras, more

API startup Noname Security nears $500M deal to sell itself to Akamai

US think tank Heritage Foundation hit by cyberattack

NASA discovered bacteria that wouldn’t die. Now it’s boosting sunscreen.

Maju Kuruvilla is out as CEO of one-click checkout company Bolt

How to watch ‘Argylle’: When and where is it streaming?

What Robert Durst did: Everything to know ahead of ‘The Jinx: Part 2’

CesiumAstro claims former exec spilled trade secrets to upstart competitor AnySignal

Langdock raises $3M with General Catalyst to help businesses avoid vendor lock-in with LLMs

“Hanky Panky” review: A deeply delirious stoner comedy

Internet users are getting younger; now the UK is weighing up if AI can help protect them

Taylor Swift’s ‘The Tortured Poets Department’ rollout reveals her inability to trust the listener

Dating culture has become selfish. How do we fix it?

Screen Skinz raises $1.5 million seed to create custom screen protectors

Consumer Financial Protection Bureau fines BloomTech for false claims

‘Big is moving to Paris’: Carrie Bradshaw discourse reignites as ‘Sex and the City’ hits Netflix

Trending

The Televisor

Simple typos tripped up Google’s hate speech detection

You may like

What Robert Durst did: Everything to know ahead of ‘The Jinx: Part 2’

CesiumAstro claims former exec spilled trade secrets to upstart competitor AnySignal

Langdock raises $3M with General Catalyst to help businesses avoid vendor lock-in with LLMs

“Hanky Panky” review: A deeply delirious stoner comedy

Internet users are getting younger; now the UK is weighing up if AI can help protect them

Taylor Swift’s ‘The Tortured Poets Department’ rollout reveals her inability to trust the listener

Dating culture has become selfish. How do we fix it?

Screen Skinz raises $1.5 million seed to create custom screen protectors

Consumer Financial Protection Bureau fines BloomTech for false claims

‘Big is moving to Paris’: Carrie Bradshaw discourse reignites as ‘Sex and the City’ hits Netflix

Apple sued, Microsoft’s AI ambitions and Nvidia’s surprises

Lordstown Motors’ ousted CEO settles with SEC for misleading investors

TechCrunch Mobility: The wheels are starting to come off the Fisker EV bus

DOJ’s Apple antitrust case neatly aligns with EU on one key point: NFC and mobile payments

Amazon Big Spring Sale 2024: Shop 350+ deals on Apple, robot vacuums, security cameras, more

API startup Noname Security nears $500M deal to sell itself to Akamai

US think tank Heritage Foundation hit by cyberattack

NASA discovered bacteria that wouldn’t die. Now it’s boosting sunscreen.

Maju Kuruvilla is out as CEO of one-click checkout company Bolt

How to watch ‘Argylle’: When and where is it streaming?

What Robert Durst did: Everything to know ahead of ‘The Jinx: Part 2’

CesiumAstro claims former exec spilled trade secrets to upstart competitor AnySignal

Langdock raises $3M with General Catalyst to help businesses avoid vendor lock-in with LLMs

“Hanky Panky” review: A deeply delirious stoner comedy

Internet users are getting younger; now the UK is weighing up if AI can help protect them

Taylor Swift’s ‘The Tortured Poets Department’ rollout reveals her inability to trust the listener

Dating culture has become selfish. How do we fix it?

Screen Skinz raises $1.5 million seed to create custom screen protectors

Consumer Financial Protection Bureau fines BloomTech for false claims

‘Big is moving to Paris’: Carrie Bradshaw discourse reignites as ‘Sex and the City’ hits Netflix

Trending