Meta researchers create an AI that masters diplomacy and outwits human players


Diplomatic screenshot provided by CICERO researcher.
Zoom in / Screenshot of the online game for Diplomacyincluding a running chat dialog, introduced by a Cicero scholar.

On Tuesday, Meta AI announce Cicero development, which is considered the first artificial intelligence to achieve human-level performance in a strategy board game Diplomacy. It’s a notable feat because the game requires deep personal negotiation skills, which means Cicero has acquired a certain command of the language needed to win the game.

Even before Deep Blue beat Garry Kasparov at chess in 1997Board games were a useful scale Achievements of artificial intelligence. In 2015, another roadblock came down when AlphaGo went under Defeated Go, Mr. Lee Sedol. Both of these games follow a relatively straightforward set of analytical rules (although Go rules are usually simplified for the computer AI).

but with Diplomacy A large part of the gameplay involves social skills. Players must show empathy, use natural language, and build relationships to win – a challenging task for a computer gamer. With this in mind, Meta asked, “Can we build more effective and agile agents who can use language to negotiate, persuade, and work with people to achieve strategic goals similar to the way humans do?”

According to Meta, the answer is yes. Cicero learned her skills by playing an online version of Diplomacy on me webDiplomacy.net. Over time, he has become adept at the game, reportedly achieving “more than twice the average score” of human players and ranking in the top 10 percent of people who have played more than one game.

To create Cicero, Meta combined AI models for strategic thinking (similar to AlphaGo) and natural language processing (similar to GPT-3) and rolled them into one proxy. During each game, Cicero looks at the state of the game board and conversation history and predicts how other players will act. She formulates a plan, which she implements through a language model that can generate human-like dialogue, allowing him to coordinate with other players.

رسم تخطيطي لـ Cicero ، روبوت اللعب <em> Diplomacy </ em> Submitted by Meta.” src=”https://cdn.arstechnica.net/wp-content/uploads/2022/11/Visual_UnderTheHood-640×907.jpg” width=”640″ height=”907″ srcset=”https: //cdn.arstechnica.net/wp-content/uploads/2022/11/Visual_UnderTheHood-1280×1813.jpg 2x”/></a><figcaption class=
Zoom in / cicero block diagram, Diplomacy– Bot Run, courtesy of Meta.

Meta AI

Meta calls Cicero’s natural language skills a “manageable dialogue form”, which is where the heart of Cicero’s character lies. Like GPT-3, Cicero pulls from a large collection of internet scripts pulled from the web. To build a manageable dialogue model, we started with a factor of 2.7 billion BartIt is like a language model that was pre-trained on text from the Internet and tuned to more than 40,000 human games on webDiplomacy.net,” Writes meta.

The resulting model mastered the intricacies of a complex game. “Cicero could infer, for example, that later in the game they will need the support of a particular player, and then formulate a strategy to win that person’s favor—and even recognize the risks and opportunities that player sees from their own point of view,” Meta says.

Research Meta Cicero Back in Science under the title “Playing on the Human Level in the Game of Diplomacy by Combining Language Paradigms with Strategic Thinking”.

For broader applications, Meta suggests that Cicero’s research could “relax barriers to communication” between humans and AI, such as maintaining a long-running conversation to teach someone a new skill. Or it could run a video game where NPCs can talk just like humans, understanding the player’s motives and adapting along the way.

At the same time, this technology can be used to manipulate humans by impersonating people and deceiving them in potentially dangerous ways, depending on the context. Along these lines, Meta hopes other researchers can build on its code “responsibly,” and says it has taken steps toward detecting and removing “toxic messages in this new field,” likely referring to dialogue Cicero learned from swallowed Internet scripts. –Always a risk For large language models.

submitted meta detailed website to explain how Cicero works and also has open source code for Cicero on github. Connected Diplomacy Fans – and maybe even the rest of us – may need to pay attention.



Source link

Related Posts

Precaliga