ABOUT COSMIC LOG

Quantum fluctuations in space, science, exploration and other cosmic fields... served up regularly by MSNBC.com science editor Alan Boyle since 2002.

Alan Boyle covers the physical sciences, anthropology, technological innovation and space science and exploration for MSNBC.com. He is a winner of the AAAS Science Journalism Award, the NASW Science-in-Society Award and other honors; a contributor to "A Field Guide for Science Writers"; and a member of the board of the Council for the Advancement of Science Writing.

Check out Boyle's biography or send a message to Cosmic Log via cosmiclog@msnbc.com.



Humans beat poker bot ... barely

Posted: Wednesday, July 25, 2007 7:38 PM by Alan Boyle

The results are in from the great "Man vs. Machine" computer poker showdown in Vancouver, with the humans coming out on top by a narrow margin. But the main result of the exercise was mutual respect, on the part of the computer programmers as well as the poker pros.

The final 500-hand playoff went until past 11 p.m. PT Tuesday, and when the takes were totaled up, high-ranked poker players Phil "The Unabomber" Laak and Ali Eslami came out $570 ahead. Those results were combined with a too-close-to-call draw and a win for the University of Alberta's Polaris computer program on Monday, plus a win for the humans earlier Tuesday. That led Rutgers computer scientist Michael Littman, the showdown's arbiter, to declare Laak and Eslami the "clear winners."


Univ. of Alberta
Poker pro Phil Laak considers his hand during the
"Man vs. Machine" playoff in Vancouver.

For Laak, however, the outcome is more ambiguous. "The subtlety to the whole thing is, we won, not by a significant amount, and the bots are closing in," he told me today. "That's the true summary."

The $50,000 two-day showdown, conducted at the annual meeting of the Association for the Advancement of Artificial Intelligence, brought the kind of media hype that attended chess champion Garry Kasparov's faceoffs against IBM's Deep Blue in 1996 and 1997. Play-by-play commentaries were provided by Weblogs offered by the University of Alberta as well as the Poker Academy.

To minimize the luck of the draw, this week's games were set up so that cards were dealt to the computer vs. human in one match (say, Laak's), and the same cards were dealt to the human vs. the computer in the other match (Eslami's). Thus, the human in one game was playing the same cards that the computer was dealt in the other game.

The profits from each 500-hand match were combined to figure out who came out on top. Thus, in the final match, Laak ended the game up $110, and Eslami was up $460. There was a bonus system in place that earned the humans an extra $12,500 (U.S.) over the course of the four-match playoff. That comes on top of the $5,000 honorarium plus expenses that each player received for showing up.

"I want to emphasize that this is peanuts," said University of Alberta computer scientist Jonathan Schaeffer, a leader of the Polaris team. "These guys usually play for a helluva lot more money."

Schaeffer said Laak (a tournament player who has a celebrity connection to actress Jennifer Tilly) and Eslami (who ranks among the world's top high-stakes cash players) fully deserved the win.

"This was an incredible win-win situation," he told me today. "It's the first time we validated that we were competitive with them. I'm not going to say that we're in their league - that would be silly."

Laak was similarly sportsmanlike toward the Polaris team. "Kudos to those guys," he said.  "They solved checkers in the last month, now they're trying to solve poker. The University of Alberta must be very proud."

Both players said they felt exhausted at the finish. "It was an emotionally draining match," Eslami told me. They also felt fortunate to leave Vancouver with the win.

"I literally felt the same feeling that you would have if you beat 500 people in a tournament and won a million dollars," Laak said.

In a way, poker is a tougher challenge than chess because competitors have to make decisions based on incomplete information about the state of the play - and often pretend they know more than they do.

The humans actually played several variants of the Polaris program in the course of the match. The first and the last of the four sets were played against a variant dubbed Mr. Pink, which Laak described as a "careful, reasonable, disciplined, thoughtful player."

Then there was Agent Orange, the humans' opponent in the second set, the one that beat them so badly.

"Agent Orange was like a crazed, cocaine-driven maniac with an ax," Laak said.


Univ. of Alberta
Ali Eslami looks at a laptop screen displaying his
cards in the "Man vs. Machine" poker playoff.

Eslami had a more scientifically couched description of the differences: Mr. Pink, he said, was programmed to come close to playing the Nash equilibrium for the game of poker - that is, the strategy most likely to settle into a draw. In contrast, Agent Orange was programmed to adapt its strategy to the play of its opponents as the game went on.

Laak said Agent Orange was also tweaked for more aggressive play, essentially by "thinking" that each pot was 7 percent bigger than it really was. "This strategy puzzled us," Laak said.

After their shellacking, Laak and Eslami got their heads together and decided to take an even more focused, deliberate approach to the third set on Tuesday afternoon. The programmers, meanwhile, went to a mixed strategy, selecting three software variants as tag teams for each of the human opponents.

In the end, it was the humans who were able to adapt to the bots. The humans won the third faceoff against the tag-team bots, and went on to beat Mr. Pink in the fourth and final round.

"The computer program is tricky," Schaeffer said. "It's hard to model. Its roots are in deep algorithms. Either consciously or subconsciously, [the humans] were able to figure out something and win."

The Polaris team as well as Laak and Eslami are all looking forward to a rematch in the next few months. By that time, Polaris may well become unbeatable - at least when it comes to Heads Up Limit Hold 'Em. "If it's not done, it's so close to done it's not even funny," Laak said. (As more than one commenter has noted below, no-limit games against multiple opponents are an entirely different proposition.)

Laak said he and Eslami gave Polaris' programmers some suggestions for making the bots better. "We actually told them the way you can beat us," he said. "If you could take Agent Orange, crank him down 50 percent, then have that guy play us randomly, so that each hand would be the new Agent Orange or Mr. Pink ... that might be the thing we can't beat."

Eslami said he encouraged the programmers to focus on the adaptive approach used by Agent Orange. "I think that's going to have application in broader society," he said.

After all, the whole point of this exercise is to create more intelligent software, not just craftier poker-playing bots. Eslami said artificial adaptive behavior could lead to "software that better understands the way humans think ... instead of being competitive and trying to oppose the person's next move, being cooperative and trying to help the person's next move."

The University of Alberta's Schaeffer echoed that view: "The challenge to us is how to get computers to reason and act intelligently in the absence of complete information. Poker is a game of what we call partial information. In this case, you don't know the opponent's cards. That doesn't sound like a big deal, but since you don't know what they have, you have to deal with probabilities."

The same challenges apply to making money in the stock market, where you have only partial information about the prospects for all the companies you could invest in ... or to buying a used car, where you have to sort through incomplete and sometimes misleading information as you negotiate a deal.

"What you're doing is, you're playing a game of poker," Schaeffer said. Next-generation software could help humans play those real-life games better - and, one can hope, more fairly.

So is Schaeffer disappointed to see Polaris lose? Not at all. "I'm personally pleased that we did not win the match," he said. If the computer had won, he said there would have been all sorts of unwarranted hand-wringing about human inadequacy. In Schaeffer's view, this really is a win-win situation.

"Even if we had won the match, nobody would have claimed victory," he told me. "Not winning the match avoided a lot of misrepresentation."

For more on the world of poker, check out MSNBC's special report, "Going All In."

MAIN PAGE

Email this EMAIL THIS

Comments

Who really cares about this?  The enjoyment of live poker comes from (yes, the money) defeating another live opponent.  Making bold plays and watching the other fold or sweat it out on draws.  Who really cares about poker bots?  What is the point of this?
Yes, I should add a bit about the scientific point of all this: The aim is to create better artificial-intelligence programs that can deal with incomplete information and adjust to changing circumstances (e.g. making bold plays and sweating out what the other actors in a situation are doing). Also, to create programs that interact more naturally with humans. If a robo-competitor can learn how to wait out a human opponent, it's not such a far leap to program a computer to give guidance for, say, boardroom negotiations. Or serve as an effective receptionist. Or ... well, you can imagine what might be possible, and some of it could be pretty scary. Feel free to speculate in your follow-ups.
what?? The point of all this is so that one day bots will be able to take over your job.
The same way that folks in the 60's made fun on "Star Trek" and now carry cellphones, I am sure that, in a couple of years, mundane, non-physical chores will be handled by computers.  This is an example of such.
The point of this is very soon anyone playing online without using a bot will be at a disadvantage. If poker bots become as good as the average human player online poker will die and along with it the ever increasing popularity of poker in general.
I don't think we need to search for a point. The point is to gain knowledge, to acquire a deeper understanding of A) the structure of poker and B) the kind of problem solving that expert players engage in when playing a complex game. Are these interesting questions? Does this exercise tell us anything new about them? Those are the only relevant justifications we should demand.
Yea but did the bot know not to count its money while sittin' at the table?
These kind of advances could provide A.I. for deep space probes and other space craft (hopefully not like HAL 2000) with the ablity to deal with problems on their own without having to rely on Earth's direction.  If we can take this further, some safety and disater situations can initially dealt with by machine giving us humans a chance to fully grasp the problem
Many people play online poker.  This is very important to them.
I wanted to make sure to link to this archived article about poker bots and their effect on online poker:

Are poker bots raking online pots?
http://www.msnbc.msn.com/id/6002298/
This country is running headlong into a bad science fiction movie. Our statesmen are simply not. They are greedy business men that will replace you with a robot as soon as they  can. Billionaires prefer subjects to citizens. Hey, lets elect them to rule!
Come on, Al.  That article is almost 3 years old.  How about doing an article on what online sites do to combat bots (and cheats in general) now.  That article seems to just want to scare people into thinking playing online will just get them cheated.  

This story about the humans versus the bot is great news.  It's a step in the direction of establishing poker as a game of skill like chess.  Hopefully soon those in power will realize this so that online poker can be regulated and taxed and not abominable
A.I. not just for space applications, S.B., but even closer to home, say, underwater exploration or perhaps failsafes for airliners where crew members are injured.  There are many applications where a computer that is able to adapt to situations without human intervention could be beneficial.  And no, it does not necessarily mean doing mundane tasks or taking over a human's job.
I wonder how the software would perform if it were playing against multiple players via a human proxy arranged so that no one knew the person was being directed by a computer?
Jason has a good point - what happens to the program if there are nine or ten players with different stack sizes and increasing blinds?   The " pro-human" doesn't have to take chips from the computer but can do very well stealing chips from the other i.e. "donkey" humans.      
This is just the normal way things will go from now on. And I do think that we need to fear the future. Just not yet.
Bottom line is this is LIMIT poker, a basically empirical, statistic driven and unimaginative form of poker that pros can play in their sleep.  Put a computer in a a six handed no-limit limit game with pros or the worst of amateurs and I guarantee you it would bust out.
The unfortunate thing about internet chess is you never know if your opponent is cheating.  A computer is far better and far faster than 99.9999% of human chess players.  Poker is the last bastion where computers are not, and it makes for interesting online play.  It will be a sad day when this program becomes more widely available.  The cheating will become rampant.      
And I for one welcome our new silicon overlords.
This was a very interesting event held.  Although it really doesn't have much relevance to high stakes poker.  That is a game where you play-the-player as much as you play the cards dealt to you.  The only future I see for the bots is playing on-line poker where no one is physically sitting across the table from you.
the math is abstract, chess and poker are abstract too, the nature is not abstract is real, so to explain the universe is not only about numbers is about something else, that we dont understand yet. the question is the brain is better than the computer, the response is yes. the compurer is a needed create for the human being. that field-technology- do not appear in the reality , look like that we use this knowlodge as a tool for interpretation -easy- our reality, that is very far from the pure reality.
Online poker isnt as vunerable as you might think, because of the inherent set patterns in any computer progam, its easy for poker websites to differentiate between a human and a bot.  Since every regulated site has rules against bots, and they all have safegaurds to detect bots, they wouldnt last long and the player would be blacklisted from all regulated sites.
Don,

Poker is not the last bastion.  There are many games where humans are far superior to computers.  Computers can't think "outside the box" and many games don't have simple solutions (like poker, but complicated by incomplete information) or can't be solved (like checkers) or semi-solved (like chess) because the computation involved is astronomical.  The canonical example is Go:

http://en.wikipedia.org/wiki/Go_%28board_game%29
I guess the old poker line of "you don't play the cards, you play the man," no longer applies.   Computers could not care less about winning or losing a hand of poker.  Men (and women) do.

What reward is there in defeating a few million lines of computer code?  Humanity shrunk a little bit as a result of this contest.
Didn't the Terminator start like this? Poker bot with AI gone wild.....
Stop bellyaching!  At least you'll never have to wait in a cafeteria line or bathroom line behind one of these computer pokers!
Cheating is already rampant - chess, scrabble (literati), boggle, etc.  I would be shocked if criminal enterprises (mafia, gangs, as well as individuals) have not already infiltrated and profited from online betting.

Security and law have not kept pace with technology and demand on the net.  These are much more complicated issues to resolve than mere technology - and in the legal arena, they can't even get the simple stuff right.

We tolerate all sorts of abuse from miscreants.  It takes a minuscule number of abusers to screw things up at first, but as more are able to profit with impunity from unethical behavior, the betas follow them.  And so we have a proliferation spammers, scammers, virus writers, cheaters - with every minor step forward given great press and little about the large steps backward or the tremendous speed with which the train of technology is leaving the station before us.

Hmm, where exactly is it that you guys are playing online poker in the US for $?
"Poker is the last bastion where computers are not, and it makes for interesting online play.  It will be a sad day when this program becomes more widely available.  The cheating will become rampant."

If bots could be made to play on internet sites it would be bad for net-poker.

Online poker sites could probably prevent users from using bots, with some sort of Turing Test that would lead to players being banned and have penalties built into the user agreements that forfeit their winnings.

But there would be little to prevent the sites themselves from using computerized "shills" to play on behalf of the house. Poker is an attractive gambling alternative for some because the house is seen as a neutral party, unlike other casino games where the house has the statistical edge, just merely takes a cut for providing the venue to play. But if it's known (or suspected) the house has bots playing the tables, that all goes away.
I understand what the programmers are trying to-do. However poker is a game for humans only. It's what makes the game so special. Draw for draw you’re trying to determine if the guy is buffing by the way he presents him/her self. Is the computer only going to play good cards all the time? Or will try to bluff the other human player? You have to have human element for this game and for any one else to say it's not isn't a poker player.
Ali Eslami here, one of the players in Vancouver.

Online poker isn't ruined. First off, these bots aren't available to the public, but even if they were, they are very limited in what they can do. They can ONLY play heads=up limit poker, and that is a very limited domain.  With more players added to the mix, bots are nowhere near competeing with good players.

I will admit though, I did have a hard time with these guys in the heads-up match. They are beatable, but tough in their domain. But remember, they're not exploiting. Even though they're tough TO BEAT, they similarly aren't doing their best to beat YOU. So basically, in order to minimize their vulnerability they aren't exploiting your play. This makes them very bad candidates for bots to rake in the money online. In the long run they'll probably get raked into oblivion except against weak opponents (and against those opponents humans would clean up faster anyway by pursuing an exploitative strategy).

-Ali Eslami
Why on Earth is the guy on the photo wearing sunglasses? is he afraid the bot is going to call his bluff if his eyes are exposed?? ;-)
By the way, in response to those of you wondering why this matters:

First off it has clear implications in poker, and that in and of itself is plenty enough to justify why the research exists. Remember, poker is a very widespread game, played by millions of people around the world. It is as widespread as many mainstream sports.

But even suppose poker was a very small niche activity. Still, poker research has many implications for other domains and fields. If the researchers were able to create learning methods that can model opponents, those same techniques can be applied to other domains. For example, bots may be developed that  can figure out likely moves of an opposing general on a battlefield, based on past tendancies.

Even more exciting than this for me personally, is the idea of cooperative programs and user interfaces. If you can figure out what a human is likely to do next, then instead of opposing that "move" you can cooperate with it by, for example, providing resources up to the person that matches what they are going to "need next".

Basically, if computer programs can become more empathetic and understand the "human element" better, than the possibilities are limitless (both positive and negative)

-Ali Eslami (one of the players in the Vancouver match)
indigoae@hotmail.com
Do these computers play strip poker?
Why on Earth is the guy on the photo wearing sunglasses? is he afraid the bot is going to call his bluff if his eyes are exposed?? ;-)

- It is his trademark attire, red pill junkie. :)



Hmm...so this is cutting edge in the AI field right now? I would have thought that getting a computer to make decisions based on incomplete information has been the goal of most AI research to date.

Does have some sinister implications though, say the life support computer runs the numbers and decides you only have a 20% chance of making it and pulls the plug on you.

Wake me up when the android butlers get to version 2.0  
thanks
First of all this was a heads up limit game which is very structured. Playing no limit against a full table would have yielded completly different results. (I suspect the "pros" would crush the computer). Also for all of the people commenting that poker bots will ruin on-line poker don't you realize that the best way to improve your skills is to play against higher competition ? How do you know you aren't already up against the likes of a Hellmuth or Bruson while playing on-line. Lastly I think most average players would love the chance to play against a top caliber professional (I surely would/I wouldn't expect to win but would love the challange). If you feel these computer "bots" are far superior and would ruin on-line poker then what excatly is the difference between playing against them or playing against a live professional on-line ?
I dont see how a computer cant beat people at poker. Our current generation of poker programs must suck if you mean to tell me a 3.0 ghz computer cant beat a player at poker. There isnt anything involved in poker other then guessing what the other player has and most of the entire game is simply statistics which computers i would imagine have a slight advantage at over any person.
As for bots online the only thing stopping bots online from destroying players is simply the overhead of web based programming. All the constant postbacks and the screen scraping kills the cpu time which then is wasted.  Minus the screen scraping for cards, and the constant postback web crap a cpu can beat a person in poker even the best of players in any poker domain. I refuse to believe otherwise any game where most of it comes down to math a computer if programmed by a good programmer will win. Online i dont think bots are anything to worry about screen scraping alone would waste most of a cpus resources. Unless someone very clever gets a bot net and with distributed computing or p2p can get groups of computers to all perform parts in parallel but that would be too complex for most people.
I don't get it, bots can't even do chip tricks.
To Joe a couple comments up - you honestly do not know what you are talking about. a) A bot is not a web based program.  b) Creating an image of a window is not CPU intensive at all.  c) Determining what the cards are from a predefined patern and color for the suit (using a four color deck) for two hole cards and five board cards takes less CPU than taking the screen shot - both combined could be down a thousand times over before you can blink once. -- On a side note, building an online bot that is very good (in limit or no-limit) really is not that hard.  This project was working on AI, building a good bot doesnt require extensive AI.  It requires instead a complete set of predetermined instructions that are controlled based on the players the bot is playing.  One could sit on an online poker site for six months just tracking all play, creating a database of every single thing done by every single player.  Then the bot would just need to reference what that it "probably" means when a specific player does a specific thing when that specific player is in that betting position, has that many chips - and after the previous things had happened involving that specific player in the previous hands.  (specific player could be substitued for by a composite if there is not enough data available on that player) Yes - a modern PC can handle that with CPU cycles left over to do anything else you might normally do on a computer.  The biggest problems any poker player (even pro's) have is bordem, letting go, and getting to cute.  A bot has none of those problems.  All that said - there is no way possible to make an unbeatable bot in no-limit poker - just like there is no unbeatable human.  A bot is going to call an all in (as would a human) a flop of AQQ with when its holding AA or QQ - either of which can be a losing call.
QFT at Zach.

the most basic ABC poker based on patterns and algorithms could be very easily coded to beat the low and medium stakes poker.

however, humans could also do this, perhaps better.. so i don't really see the fuss. if you're seriously afraid of losing to bots, i will hazard a guess that you are a losing player yourself
My husband, brother in law and father in law are OBSESSED with on-line poker, sure they win some and lose some. But when they're in a hand that only one card can beat, thats the card that comes on the flop.  They call it bad beats, I call it "the computer knew"
Personally, I think they are insane that they think they can beat the computer.  But as long as they enjoy it as a hobby I dont mind, its when he starts dipping into savings to put into his account.
Me I use WinPokerbot , it's a free and working robot playing texas Hold'em Limit Online.
I win a lot of money with this bot, it's very nice.
It works on Play Money and Real money so you can try it before playing with real money.
See here: http://www.winpokerbot.com/en/home.html
Maybe we should allow this in elementary schools or even US Congress. Reasoning & THINKING!!! a winning pair of novel ideas that just might save this country. Of course, "2 terms and OUT" would also do wonders too.
There are many new poker bots on the market which applied Artificial Intelligence in their gameplay.

An example 'Smart PB' from
http://www.pokerbot-smart.com/
i need the poker bot if there's an emergency and you still playing tournaments i think that's the advantage of having a bot well that's my opinion...


SEND A COMMENT

PLEASE READ: All comments must be approved before appearing in the thread; time and space constraints prevent all comments from appearing. We will only approve comments that are directly related to the blog, use appropriate language and are not attacking the comments of others.

Message (please, no HTML tags. Web addresses will be hyperlinked):

TRACKBACKS

Trackbacks are links to weblogs that reference this post. Like comments, trackbacks do not appear until approved by us. The trackback URL for this post is: http://cosmiclog.msnbc.msn.com/trackback.aspx?PostID=289607

Latest Tech & Science News

Syndicate This Site

Add Cosmic Log to your news reader:
live.com xml
myyahoo msn
bloglines newsgator
google