Comments (15)
I agree with the others that it's too late to be making changes at this stage, but also something Carykh has mentioned from the start about the iterated prisoners dilemma is that what strategy is best depends largely on the other strategies are around you. Testing strategies do poorly when grimTrigger is prevalent as it punishes any amount of defection maximally. However they may do excellently overall when grimTrigger is one of a hundred or more strategies in the pool. So the question becomes what you think the mix of strategies will be like in the end? Which is how the problem has been presented from the beginning.
from prisonersdilemmatournament.
Oooh this is a good idea, like a 2% error-rate or something! My professor actually brought that idea up with me when we were discussing this class project, and we decided, "We can try that for Tournament Round 2, but Round 1 should be the simplest version of the tournament". Which means it's definitely on the table if I ever try this again.
And like @l4vr0v , I'm hoping that unforgiving strategies don't win. But it does depend on what everyone else submits. I think another way I could dis-incentivize them in the future is lowering the defect-cooperate score (because +5 is very high and nearly double the C-C score)
from prisonersdilemmatournament.
If you consolidate like strategies into single submissions then you fundamentally change the premise of the contest. It's no longer "Guess what strategies other people will use" but "Guess what strategies will have the most unique implementations." Everyone right now is operating on the same knowledge, if someone bases their strategy on the belief that your video will convince a large number of people to submit random as their strategy and base their entire approach to maximally exploiting that group then I feel like that's just the outcome.
from prisonersdilemmatournament.
Hmmm, @nobody5050 , I'm starting to wonder the same thing. (Don't worry, I haven't tested any of the user submissions yet.)
By me putting that "random" strategy source code in the video on-screen, how many people was I encouraging to submit random strategies? Maybe not most people, but perhaps a significant number, especially of those who don't know how to code. I am a bit concerned that if, say, 30% of people just submitted random, then pure GrimTrigger or pure AlwaysDefect will win. (Both of which will probably have multiple submitters.) This didn't happen in Axelrod's tournament (mentioned by @l4vr0v ) because every strategy there was unique, and most were nice.
If it's a problem, a simple solution is to consolidate all strategies that are exactly similar. So, "random", "grimTrigger" and "alwaysDefect" would only take up one spot in the roster, and then they will almost certainly not win. If I do this, though, I'm worried some people will say I've changed the rules to bias one side over the other. Which is a no-no after the competition has started!
(Thinking about this Tournament Round 2, I love the idea of a 1% miscommunication rate, because that would severely hinder GrimTrigger, and TFT-vs-TFT scenarios who'd get retaliation-echos.)
from prisonersdilemmatournament.
This isn’t a bad idea in hindsight, but entries (including my own) have already been submitted and if there’s not a way to edit attachments on the submission, it’d be unfair to change it at this stage in the game.
from prisonersdilemmatournament.
I feel like troll strategies which aren’t good overall but are really good at breaking a specific strategy are going to really define what wins
from prisonersdilemmatournament.
Good point, Quadrapod. It's always safest to just stick with the rules I set at the beginning of the competition. And hey, if I don't get the results I thought I was gonna get,
- there will be some other participant who will be super happy to win,
- in my analysis video, I can talk about what influences led to the result that eventually happened, and
- I can always run a better competition in the future!
from prisonersdilemmatournament.
I can understand where you're coming from with this, and it would decrease the prevalence of grim strategies, but it's just too late for that.
from prisonersdilemmatournament.
TL;DR: I have a hard time believing grim is that good. When it was submitted to the second iterated prisoner's dilemma tournament back in the day, as "Friedman," it placed 52nd for a reason.
What's the grim+tit-for-tat strategy that dominates in your meta? grimTrigger
, I suppose, will do really well with random
(since DDDDDD will on average net you 3 points per move, the same as being in a cordial C/C loop). But it will miss out on points with any deterministic strategy that defects unprovoked (joss
, detective
in the example strats) because it will push for D/D loops where C/C loops could have happened.
With joss
, the best response to a defection is to just ignore it- it will steal 3 points from you every once in a while, but C/D C/C C/C is still 2 points/move while C/D D/D D/D is just 1 point/move and if you defect and back off (C/D D/C C/D) that's 5/3 points/move.
With the detective
, the best response to a defection is to tit-for-tat- take the free D/C on the next move and then go back to C/C.
In general, when you defect against a deterministic strategy- and any good strategy that probes you with an unprovoked defection is probably going to be deterministic- you get defected against in return. This will come in the form of either a D/D (costing you 2 points vs. a C/C) which may possibly loop or worse yet a C/D (costing you 3 points) at some point. You could defect immediately after the unprovoked defection, because an opponent that used an unprovoked defection to probe you will almost certainly back off into cooperation at some point (because they want to know whether you're the grimTrigger
) and that'll most likely be the very next turn- and also you don't want to go the route of alwaysCooperate
because detective
and alwaysDefect
will cost you a lot of points. But beyond that, punishment can get inefficient- even if you take a small hit to go from D/D to C/C (i.e., D/D C/D C/C for +4 in 3 moves) that's better than the D/D loop (+1 per move).
grimTrigger
is meta-defining but I don't see it dominating. I think you're more or less locked into titForTat
as your basic response strategy for dealing with unprovoked defections and that this tournament will be more or less decided by how well our strategies can exit/avoid D/D loops.
You need to be able to:
- respond to defections in some way so you don't get taken advantage of (like
alwaysCooperate
), and - exit D/D loops to get C/C loops where possible, without getting taken advantage of by strategies like
alwaysDefect
My money is on the second requirement by far mattering the most in this contest.
from prisonersdilemmatournament.
What's the grim+tit-for-tat strategy that dominates in your meta?
Even just defecting on first defect and going grim after two defects will beat tit for tat in example strategies. It goes into always cooperate with detective and thats enough to push it over the edge. A smarter detective would search for ftft
tho which would trigger the grim(which is why I won't be using that exact strategy).
I also have a hard time believing a significant number of people will run joss - it placed near the bottom in all of my tests and is not a 0 effort submission. Trying to exit loops may also lead to even worse behavior vs random if you aren't using grim like behavior and may also lead to detectives such as better detective taking advantage of you(or trying to leading to a worse overall result).
For those saying their submissions would be bad if miscommunication was added, reseting the submissions wouldn't be a bad idea(and would probably decreace the amount of purely random submissions anyway).
from prisonersdilemmatournament.
@Devon7925 Wow, I tried added that script to my pool and our strategies are different but pretty much neck and neck, at least with only the base 9. Every time I run it our scripts basically change places. If I hadn't already submitted I'd make some modifications because you found a slightly better way to exploit FTFT than I did.
from prisonersdilemmatournament.
@Quadrapod if you are talking about the better detective one, that isn't mine. I primarily tune based on a dataset of various strategies either I came up with or others came up with, as well as the base 9, and in that case my actual strategy consistently beats better detective by a wide margin.
from prisonersdilemmatournament.
beat tit for tat in example strategies
This isn't the objective. Those grims beat tit-for-tat in head-to-head, but a low-scoring "win" sets you back compared to a high-scoring "loss." In an average/sensible meta, that grim will end up losing so many points (~400/game, so 2pts/move on average) in its tit-for-tat matches (by D/Ding when it could've C/C'd), while the tit-for-tat variants will just have cordial C/C games with one another, typically averaging 3pts/move. The grim will do well in all its head-to-heads with tit-for-tats but at the cost of being nowhere near the top of the leaderboard at the end.
Also ultimateDetective
is fairly brittle and definitely not good enough for the actual contest. I think a good "better detective" or unprovoked defector would be rather different from that ultimate detective. I've got a few detective iterations that drastically improve on it, although those are brittle too.
from prisonersdilemmatournament.
- I can always run a better competition in the future!
I, for one would love to see a version 2 of this contest with a lot of the user suggestions added in, just to see what your fan base can make if it’s less meta related and more “good strategy” related
from prisonersdilemmatournament.
An analysis video would be awesome.
from prisonersdilemmatournament.
Related Issues (20)
- Analysis and performance
- Submission without a Google account? HOT 1
- After the deadline in a few hours, please feel free to USE THIS THREAD TO SHARE YOUR STRATEGIES, since the fear of them being stolen will be gone HOT 42
- I left a print() HOT 5
- Name associated with Google account being appended to filename when uploading to the google form HOT 2
- I made a POV version of the game HOT 1
- Has anyone successfully done a resubmission? HOT 20
- Will ALL duplicates be removed, including random? HOT 17
- Submission period is over for everyone now! HOT 2
- The "research", as seen from evidence left in the submission form and web page HOT 3
- Potential Exploit I haven't seen discussed yet.
- For everyone who's not subscribed to carykh on YouTube HOT 14
- Predictions :D HOT 10
- Update from Cary! (I implemented subprocesses) HOT 7
- I got stupid... HOT 4
- Update from Cary (2021-06-15) (Long and not urgent, so you don't have to read it) HOT 21
- A cheater got to first place (and got caught). HOT 11
- Detecting Kingmaker and Minion strats HOT 5
- Releasing results.txt in order to simulate different metas HOT 4
- Update from Cary on 2021-06-29 HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from prisonersdilemmatournament.