Stable Agent

class or_suite.agents.ambulance.stable.stableAgent(epLen)[source]

Agent that implements a stable heuristic algorithm for use with either ambulance environment

reset()[source]

clears data and call_locs which contain data on what has occurred so far in the environment

update_config()

(UNIMPLEMENTED)

pick_action(state, step)[source]

the action chosen is the most recently obesrved state of the environment

epLen

(int) number of time steps to run the experiment for

data

(int list list or float list list) a list of all the states of the environment observed so far

__init__(epLen)[source]
Parameters
  • epLen – number of time steps

  • data – all data observed so far

update_obs(obs, action, reward, newObs, timestep, info)[source]

Add observation to records.

Adds newObs, the most recently observed state, to data. Adds the most recent call arrival, found in info[‘arrival’] to call_locs.

update_policy(k)[source]

Update internal policy based upon records

Not used, because a greedy algorithm does not have a policy.