Stable Agent

class or_suite.agents.ambulance.stable.stableAgent(epLen)[source]

Agent that implements a stable heuristic algorithm for use with either ambulance environment

reset()[source]: clears data and call_locs which contain data on what has occurred so far in the environment

pick_action(state, step)[source]: the action chosen is the most recently obesrved state of the environment

data: (int list list or float list list) a list of all the states of the environment observed so far

__init__(epLen)[source]

Parameters

update_obs(obs, action, reward, newObs, timestep, info)[source]

Add observation to records.

Adds newObs, the most recently observed state, to data. Adds the most recent call arrival, found in info[‘arrival’] to call_locs.

update_policy(k)[source]

Update internal policy based upon records

Not used, because a greedy algorithm does not have a policy.