Agents

Agents serve two main purposes in Dojo:

  1. Metric tracking. Each agent records the state of its cryptocurrency portfolio and other statistics at every block in the simulation.
  2. Reward function. This function defines how you measure the "success" of your policy. For example you might want to track your total wealth, or impermanent loss, etc. If you use dojo to optimize a parameterized policy, the reward can be designed in combination with the policy such that the policy tries to maximize it's value.

Agents are NOT responsible for making decisions on how to act in the environment. That is the job of the policies.


In-built Metrics Tracking

The base class implements some pre-existing metrics tracking functionality:

  • quantity(token: str) -> float: get the agent quantity of a token, can be either the address OR the symbol.
  • portfolio() -> Portfolio: get the agent portfolio, e.g. {"USDC": 1000, "ETH": 10, "UNI-V3-POS": 1}.
  • erc20_portfolio() -> Portfolio: get the agent portfolio of ERC20 tokens, e.g. {"USDC": 1000}.
  • erc721_portfolio() -> Dict[str, list]: get the agent portfolio of ERC721 NFTs, e.g. {"UNI-V3-POS": [251300]}.
  • wealth(date: datetime) -> Decimal: returns the value of the agent's portfolio in USD.
  • reward(obs: Observation) -> float: user defined reward metric. (The Observation will be determined by which environment in which your agent is running.)

You can access the documentation for the latest functionality of agents on our package docs.


Creating an Agent

Here is a basic demo for creating your own agent. You just need to implement your reward function.

ETHAgent.py
from decimal import Decimal
 
from dojo.agents import UniswapV3Agent
from dojo.environments.uniswapV3 import UniswapV3Observation
 
# SNIPPET 2 END
 
 
class ETHAgent(UniswapV3Agent):  # noqa: D101
  def __init__(
      self,
      initial_portfolio: dict[str, Decimal] = {
          "ETH": Decimal(10),
          "USDC": Decimal(10_000),
      },
  ) -> None:  # noqa: D107
      super().__init__(
          initial_portfolio=initial_portfolio, policy=None, unit_token="USDC"
      )
 
  def reward(self, obs: UniswapV3Observation) -> float:  # type: ignore
      """Compute the reward value."""
      return float(self.quantity("ETH"))