API Documentation

Docstrings for interface members can be accessed through Julia's built-in documentation system or in the list below.

API Documentation

Index

CounterfactualRegret.CFRSolver
CounterfactualRegret.CSCFRSolver
CounterfactualRegret.CallbackChain
CounterfactualRegret.ESCFRSolver
CounterfactualRegret.ExpectedValueBaseline
CounterfactualRegret.ExploitabilityCallback
CounterfactualRegret.Games.Kuhn
CounterfactualRegret.Games.MatrixGame
CounterfactualRegret.OSCFRSolver
CounterfactualRegret.Throttle
CounterfactualRegret.ZeroBaseline
CounterfactualRegret.actions
CounterfactualRegret.chance_action
CounterfactualRegret.chance_actions
CounterfactualRegret.evaluate
CounterfactualRegret.exploitability
CounterfactualRegret.histtype
CounterfactualRegret.infokey
CounterfactualRegret.infokeytype
CounterfactualRegret.initialhist
CounterfactualRegret.isterminal
CounterfactualRegret.next_hist
CounterfactualRegret.observation
CounterfactualRegret.player
CounterfactualRegret.players
CounterfactualRegret.strategy
CounterfactualRegret.train!
CounterfactualRegret.utility
CounterfactualRegret.vectorized_hist
CounterfactualRegret.vectorized_info

Game Functions

CounterfactualRegret.infokeytype — Function

infokeytype(g::Game)

Returns information key type for game g

source

CounterfactualRegret.histtype — Function

histtype(g::Game)

Returns history type for game g

source

CounterfactualRegret.initialhist — Function

initialhist(game::Game)

Return initial history with which to start the game

source

CounterfactualRegret.isterminal — Function

isterminal(game::Game, h)

Returns boolean - whether or not current history is terminal

i.e h ∈ Z

source

CounterfactualRegret.utility — Function

utility(game::Game, i::Int, h)

Returns utility of some history h for some player i

source

CounterfactualRegret.player — Function

player(game::Game{H,K}, h::H)

Returns integer id corresponding to which player's turn it is at history h 0 - Chance Player 1 - Player 1 2 - Player 2

If converting to IIE to Matrix Game need to implement: player(game::Game{H,K}, k::K)

source

CounterfactualRegret.chance_action — Function

chance_action(game::Game, h)

Return randomly sampled action from chance player at a given history

source

CounterfactualRegret.chance_actions — Function

chance_actions(game::Game, h)

Return all chance actions available for chance player at history h

source

CounterfactualRegret.next_hist — Function

next_hist(game::Game, h, a)

Given some history and action return the next history h′ = next_hist(game, h, a)

source

CounterfactualRegret.infokey — Function

infokey(game::Game, h)

Returns unique identifier corresponding to some information set

infokey(game, h1) == infokey(game, h2) ⟺ h1 and h2 belong to the same info set

(key must be immutable as it's being stored as a key in a dictionary)

source

CounterfactualRegret.actions — Function

actions(game::Game, k)

Returns all actions available at some information state given by key k (See infokey)

source

CounterfactualRegret.players — Function

players(game)

Returns number of players in game (excluding chance player)

source

CounterfactualRegret.observation — Function

observation(game, h, a, h′)

For tree building - information given to acting player in history h

source

CounterfactualRegret.vectorized_info — Function

vectorized_info(game::Game{H,K}, key::K) where {H,K}

For converting information state representation to vector. Default behavior returns unmodified information state.

source

CounterfactualRegret.vectorized_hist — Function

vectorized_hist(game::Game{H}, h::H) where H

For converting history representation to vector. Default behavior returns unmodified history.

source

Solvers

CounterfactualRegret.train! — Function

train!(sol::AbstractCFRSolver, n; cb=()->(), show_progress=false)

Train a CFR solver for n iterations with optional callbacks cb and optional progress bar show_progress

source

CounterfactualRegret.strategy — Function

strategy(solver, k)

Return the current strategy of solver sol for information key k

If sufficiently trained (train!), this should be close to a Nash Equilibrium strategy.

source

CounterfactualRegret.CFRSolver — Type

CFRSolver(game; method=Vanilla())

Instantiate vanilla CFR solver with some game.

source

CounterfactualRegret.CSCFRSolver — Type

CSCFRSolver(game; debug=false, method=Vanilla())

Instantiate chance sampling CFR solver with some game.

source

CounterfactualRegret.ESCFRSolver — Type

ESCFRSolver(game::Game; method::Symbol=:vanilla, alpha::Float64 = 1.0, beta::Float64 = 1.0, gamma::Float64 = 1.0, d::Int)

Instantiate external sampling CFR solver with some game.

Samples a single actions from all players for single tree traversal. Time to complete a traversal is O(|𝒜ᵢ|ᵈ), where d is the depth of the game and |𝒜ᵢ| is the size of the action space for the acting player.

source

CounterfactualRegret.OSCFRSolver — Type

OSCFRSolver(game; method=Vanilla(), baseline=ZeroBaseline(), ϵ::Float64 = 0.6)

Instantiate outcome sampling CFR solver with some game.

Samples a single actions from all players for single tree traversal. Time to complete a traversal is O(d), where d is the depth of the game.

ϵ - exploration parameter

Available baselines:

ZeroBaseline - Equivalent to no baseline
ExpectedValueBaseline

source

Games

CounterfactualRegret.Games.MatrixGame — Type

Matrix game of arbitrary dimensionality

Defaults to 2-player zero-sum rock-paper-scissors

NOTE: N>2 player general-sum games have ill-defined convergence properties for counterfactual regret solvers

source

CounterfactualRegret.Games.Kuhn — Type

Kuhn Poker

"Kuhn poker is an extremely simplified form of poker developed by Harold W. Kuhn as a simple model zero-sum two-player imperfect-information game, amenable to a complete game-theoretic analysis. In Kuhn poker, the deck includes only three playing cards, for example a King, Queen, and Jack. One card is dealt to each player, which may place bets similarly to a standard poker. If both players bet or both players pass, the player with the higher card wins, otherwise, the betting player wins."

https://en.wikipedia.org/wiki/Kuhn_poker

source

Extras

CounterfactualRegret.ExploitabilityCallback — Type

ExploitabilityCallback(sol::AbstractCFRSolver, n=1; p=1)

sol :
n : Frequency with which to query exploitability e.g. n=10 indicates checking exploitability every 10 CFR iterations
p : Player whose exploitability is being measured

Usage:

using CounterfactualRegret
const CFR = CounterfactualRegret

game = CFR.Games.Kuhn()
sol = CFRSolver(game)
train!(sol, 10_000, cb=ExploitabilityCallback(sol))

source

CounterfactualRegret.Throttle — Type

Wraps a function, causing it to trigger every n CFR iterations

test_cb = Throttle(() -> println("test"), 100)

Above example will print "test" every 100 CFR iterations

source

CounterfactualRegret.CallbackChain — Type

Chain together multiple callbacks

Usage:

using CounterfactualRegret
const CFR = CounterfactualRegret


game = CFR.Games.Kuhn()
sol = CFRSolver(game)
exp_cb = ExploitabilityCallback(sol)
test_cb = Throttle(() -> println("test"), 100)
train!(sol, 10_000, cb=CFR.CallbackChain(exp_cb, test_cb))

source

CounterfactualRegret.exploitability — Function

exploitability(sol::AbstractCFRSolver, p::Int=1)

Calculates exploitability of player p given strategy specified by solver sol

source

CounterfactualRegret.evaluate — Function

evaluate(solver::AbstractCFRSolver)

Evaluate full tree traversed by CFR solver.

Returns tuple corresponding to game values for players given the strategies provided by the solver.

source

CounterfactualRegret.ExpectedValueBaseline — Type

Expected Value Baseline (Schmid 2018)

Uses aggregation counterfactual value estimates from previous runs as a baseline. "Learning rate" or exponential decay rate for learning the baseline is given by paramter α.

The stored action values for some information key k are retrieved by calling (b::ExpectedValueBaseline{K})(k, l), where l is the length of the action space at the given information state represented by k.

source

CounterfactualRegret.ZeroBaseline — Type

Default static baseline of 0 - equivalent to not using a baseline

source

API Documentation

Contents

Index

Game Functions

Solvers

Games

Extras