Researchers exploit weaknesses of master game bots

September 30, 2020

UNIVERSITY PARK, Pa. -- If you've ever played an online video game, you've likely competed with a bot -- an AI-driven program that plays on behalf of a human.

Many of these bots are created using deep reinforcement learning, which is the training of algorithms to learn how to achieve a complex goal through a reward system. But, according to researchers in the College of Information Sciences and Technology at Penn State, using game bots trained by deep reinforcement learning could allow attackers to use deception to easily defeat them.

To highlight this risk, the researchers designed an algorithm to train an adversarial bot, which was able to automatically discover and exploit weaknesses of master game bots driven by reinforcement learning algorithms. Their bot was then trained to defeat a world-class AI bot in the award-winning computer game StarCraft II.

"This is the first attack that demonstrates its effectiveness in real-world video games," said Wenbo Guo, a doctoral student studying information sciences and technology. "With the success of deep reinforcement learning in some popular games, like AlphaGo in the game Go and AlphaStar in StarCraft, more and more games are starting to use deep reinforcement learning to train their game bots."

He added, "Our work discloses the security threat of using deep reinforcement learning trained agents as game bots. It will make game developers be more careful about adopting deep reinforcement learning agents."

Guo and his research team presented their algorithm in August at Black Hat USA - a conference that is part of the most technical and relevant information security event series in the world. They also publicly released their code and a variety of adversarial AI bots.

"By using our code, researchers and white-hat hackers could train their own adversarial agents to master many -- if not all -- multi-party video games," said Xinyu Xing, assistant professor of information sciences and technology at Penn State.

Guo concluded, "More importantly, game developers could use it to discover the vulnerabilities of their game bots and take rapid action to patch those vulnerabilities."
-end-
In addition to Xing, Guo worked with; Xian Wu, a doctoral student studying informatics at Penn State; and Jimmy Su, senior director of the JD Security Research Center, to develop the algorithm.

Penn State

Related Algorithm Articles from Brightsurf:

CCNY & partners in quantum algorithm breakthrough
Researchers led by City College of New York physicist Pouyan Ghaemi report the development of a quantum algorithm with the potential to study a class of many-electron quantums system using quantum computers.

Machine learning algorithm could provide Soldiers feedback
A new machine learning algorithm, developed with Army funding, can isolate patterns in brain signals that relate to a specific behavior and then decode it, potentially providing Soldiers with behavioral-based feedback.

New algorithm predicts likelihood of acute kidney injury
In a recent study, a new algorithm outperformed the standard method for predicting which hospitalized patients will develop acute kidney injury.

New algorithm could unleash the power of quantum computers
A new algorithm that fast forwards simulations could bring greater use ability to current and near-term quantum computers, opening the way for applications to run past strict time limits that hamper many quantum calculations.

QUT algorithm could quash Twitter abuse of women
Online abuse targeting women, including threats of harm or sexual violence, has proliferated across all social media platforms but QUT researchers have developed a sophisticated statistical model to identify misogynistic content and help drum it out of the Twittersphere.

New learning algorithm should significantly expand the possible applications of AI
The e-prop learning method developed at Graz University of Technology forms the basis for drastically more energy-efficient hardware implementations of Artificial Intelligence.

Algorithm predicts risk for PTSD after traumatic injury
With high precision, a new algorithm predicts which patients treated for traumatic injuries in the emergency department will later develop posttraumatic stress disorder.

New algorithm uses artificial intelligence to help manage type 1 diabetes
Researchers and physicians at Oregon Health & Science University have designed a method to help people with type 1 diabetes better manage their glucose levels.

A new algorithm predicts the difficulty in fighting fire
The tool completes previous studies with new variables and could improve the ability to respond to forest fires.

New algorithm predicts optimal materials among all possible compounds
Skoltech researchers have offered a solution to the problem of searching for materials with required properties among all possible combinations of chemical elements.

Read More: Algorithm News and Algorithm Current Events
Brightsurf.com is a participant in the Amazon Services LLC Associates Program, an affiliate advertising program designed to provide a means for sites to earn advertising fees by advertising and linking to Amazon.com.