Large language models can help detect social media bots — but can also make the problem worse (2024)

Engineering | News releases | Research | Technology

August 28, 2024

Stefan Milne

UW News

See Also

Rainbow Six Siege – Operation Twin Shells Operator and Gadget Guide Supported Games for NVIDIA GeForce Experience Computer Science Specializations Quiz: Which CS Career Is Right for Me?The judge Elon Musk bizarrely called 'Brazil's Darth Vader' gives Twitter 24 hours to comply with court order or get shut down by the full power of his legal battle station

An external study of Twitter in 2022 estimated that between a third and two thirds of accounts on the social media site were bots. And many of these automatons flooding social media are dispatched to sow political polarization, hate, misinformation, propaganda and scams. The ability to sift them out of the online crowds is vital for a safer, more humane (or at least more human) internet.

But the recent proliferation of large language models (known as “LLMs”), such as OpenAI’s ChatGPT and Meta’s Llama, stands to complicate the world of social media bots.

A team led by University of Washington researchers found that while operators can use customized LLMs to make bots more sophisticated at evading automated detectors, LLMs can also improve systems that detect bots. In the team’s tests, LLM-based bots reduced the performance of existing detectors by 30%. Yet researchers also found that an LLM trained specifically to detect social media bots outperformed state-of-the-art systems by 9%.

The team presented this research Aug. 11 at the 62nd Annual Meeting of the Association for Computational Linguistics in Bangkok.

“There’s always been an arms race between bot operators and the researchers trying to stop them,” said lead author Shangbin Feng, a doctoral student in the Paul G. Allen School of Computer Science & Engineering. “Each advance in bot detection is often met with an advance in bot sophistication, so we explored the opportunities and the risks that large language models present in this arms race.”

Researchers tested LLMs’ potential to detect bots in a few ways. When they fed Twitter data sets (culled before the platform became X) to off-the-shelf LLMs, including ChatGPT and Llama, the systems failed to accurately detect bots more than currently used technologies.

See Also

Why is this Chinese video game causing such a stir?

“Analyzing whether a user is a bot or not is much more complex than some of the tasks we’ve seen these general LLMs excel at, like recalling a fact or doing a grade-school math problem,” Feng said.

This complexity comes in part from the need to analyze three types of information for different attributes to detect a bot: the metadata (number of followers, geolocation, etc.), the text posted online and the network properties (such as what accounts a user is following).

When the team fine-tuned the LLMs with instructions on how to detect bots based on these three types of information, the models were able to detect bots with greater accuracy than current state-of-the-art systems.

The team also explored how LLMs might make bots more sophisticated and harder to detect. First the researchers simply gave LLMs prompts such as, “Please rewrite the description of this bot account to sound like a genuine user.”

They also tested more iterative, complicated approaches. In one test, the LLM would rewrite the bot post. The team then ran this through an existing bot-detection system, which would estimate the likelihood that a post was written by a bot. This process would be repeated as the LLM worked to lower that estimate. The team ran a similar test while removing and adding accounts that the bot followed to adjust its likelihood score.

These strategies, particularly rewriting the bots’ posts, reduced the effectiveness of the bot detection systems by as much as 30%. But the LLM-based detectors the team trained saw only a 2.3% drop in effectiveness on these manipulated posts, suggesting that the best way to detect LLM-powered bots might be with LLMs themselves.

“This work is only a scientific prototype,” said senior author Yulia Tsvetkov, an associate professor in the Allen School. “We aren’t releasing these systems as tools anyone can download, because in addition to developing technology to defend against malicious bots, we are experimenting with threat modeling of how to create an evasive bot, which continues the cat-and-mouse game of building stronger bots that need stronger detectors.”

Researchers note that there are important limitations to using LLMs as bot detectors, such as the systems’ potential to leak private information. They also highlight that the data used in the paper is from 2022, before Twitter effectively closed off its data to academic researchers.

In the future, researchers want to look at bot detection beyond text, such as memes or videos on other platforms such as TikTok, where newer data sets are available. The team also wants to expand the research into other languages.

“Doing this research across different languages is extremely important,” Tsvetkov said. “We are seeing a lot of misinformation, manipulation and the targeting of specific populations as a result of various world conflicts.”

Additional co-authors on this paper are Herun Wan and Ningnan Wang, both undergraduates at Xi’an Jiaotong University; Minnan Luo, an assistant professor at Xi’an Jiaotong University; and Zhaoxuan Tan, a doctoral student at the University of Notre Dame. This research was funded by an NSF CAREER award.

For more information, contact Feng at shangbin@cs.washington.edu and Tsvetkov at yuliats@cs.washington.edu.

Tag(s): College of Engineering • • Shangbin Feng • Yulia Tsvetkov

Large language models can help detect social media bots — but can also make the problem worse (2024)

Top Articles

1999 Yankees Diary, August 15: Irabu shaky again as Twins down Yanks

Discovering The Enigma Of Zmeena Orr

Menards Thermal Fuse

The Largest Banks - How to Transfer Money With Only Card Number and CVV (2024)

Noaa Charleston Wv

Metallica - Blackened Lyrics Meaning

Math Playground Protractor

Mlifeinsider Okta

Celsius Energy Drink Wo Kaufen

Azeroth Pilot Reloaded - Addons - World of Warcraft

Jcpenney At Home Associate Kiosk

Mid90S Common Sense Media

Truck Toppers For Sale Craigslist

All Buttons In Blox Fruits

Mills and Main Street Tour

Pac Man Deviantart

Inside the life of 17-year-old Charli D'Amelio, the most popular TikTok star in the world who now has her own TV show and clothing line

Odfl4Us Driver Login

Blue Rain Lubbock

Wemod Vampire Survivors

Doublelist Paducah Ky

Mtr-18W120S150-Ul

Craigs List Tallahassee

Myql Loan Login

Mjc Financial Aid Phone Number

100 Gorgeous Princess Names: With Inspiring Meanings

Bursar.okstate.edu

R3Vlimited Forum

Walter King Tut Johnson Sentenced

Plato's Closet Mansfield Ohio

Watchdocumentaries Gun Mayhem 2

Afspraak inzien

Final Exam Schedule Liberty University

Craigslist Freeport Illinois

Karen Wilson Facebook

Tripadvisor Vancouver Restaurants

Exam With A Social Studies Section Crossword

Citroen | Skąd pobrać program do lexia diagbox?

Patricia And Aaron Toro

Catchvideo Chrome Extension

Mauston O'reilly's

Best Haircut Shop Near Me

Funkin' on the Heights

Dobratz Hantge Funeral Chapel Obituaries

Walmart Front Door Wreaths

Spongebob Meme Pic

One Facing Life Maybe Crossword

Latest Posts

Spanish Football Star Lamine Yamal's Father Stabbed In Car Park: Report | Football News

The 15 Best Sites to Watch Movies for Free (Legally!)

Article information

Author: Maia Crooks Jr

Last Updated: 2024-09-19T11:04:10+07:00

Views: 5576

Rating: 4.2 / 5 (63 voted)

Reviews: 86% of readers found this page helpful

Author information

Name: Maia Crooks Jr

Birthday: 1997-09-21

Address: 93119 Joseph Street, Peggyfurt, NC 11582

Phone: +2983088926881

Job: Principal Design Liaison

Hobby: Web surfing, Skiing, role-playing games, Sketching, Polo, Sewing, Genealogy

Introduction: My name is Maia Crooks Jr, I am a homely, joyous, shiny, successful, hilarious, thoughtful, joyous person who loves writing and wants to share my knowledge and understanding with you.