Markov, find a selection strategy which maximizes probability

Click For Summary

Homework Help Overview

The problem involves an animal foraging across three patches with varying risks of predation, probabilities of finding food, and energy gains. The objective is to determine a patch selection strategy that maximizes the probability of survival over 20 foraging periods, considering energy reserves and risks associated with each patch.

Discussion Character

  • Exploratory, Conceptual clarification, Problem interpretation

Approaches and Questions Raised

  • Participants discuss setting up the problem as a Markov Decision Problem, focusing on maximizing expected rewards based on energy levels at the end of the foraging periods. Questions arise regarding the definition of reward thresholds and the reasoning behind them.

Discussion Status

There is ongoing exploration of the problem setup, with participants providing insights into the formulation of the decision-making framework. Some guidance has been offered regarding the structure of the problem, but multiple interpretations and clarifications are still being discussed.

Contextual Notes

Participants are questioning the assumptions related to reward thresholds and the availability of reference materials, indicating potential constraints in accessing sample problems or relevant textbooks.

skaterboy1
Messages
13
Reaction score
0

Homework Statement


There are three patchs in which an animal can forage.
Patch 1: Risk of predation is 0. Probability of finding food is 0. Energy value is 0.
Patch 2: Risk of predation is 0.004. Probability of finding food is 0.4 and energy gain is 3.
patch 3: risk of predation is 0.02. Probability of finding food is 0.6 and energy gain is 5.

Foraging in any patch uses one unit of energy reserves. Energy reserves below 4 indicate death. maximum energy capacity for animal is ten units.

Solve this problem for 20 foraging periods to find a patch selection strategy which maximizes the animals probability of survival over this period.

Homework Equations



?

The Attempt at a Solution



Three patchs; N=3
States: risk, food findings and energy gain.
action: Choosing patchs 1, 2 or 3.

How do I solve this problem?
 
Last edited:
Physics news on Phys.org
You could try to set it up as a standard Markov Decision Problem, where you want to maximize the expected value of a reward function. In this case, the reward is r = 1 if energy at t = 20 is >= 5 and is 0 otherwise. You need to identify states, decisions and decision-dependent transition probabilities.

RGV
 
Ray Vickson said:
You could try to set it up as a standard Markov Decision Problem, where you want to maximize the expected value of a reward function. In this case, the reward is r = 1 if energy at t = 20 is >= 5 and is 0 otherwise. You need to identify states, decisions and decision-dependent transition probabilities.

RGV

Why is the reward r=1 if energy is >= 5 (not 4?) and why 0 otherwice?
 
Do you have any sample problems to show me?
 
skaterboy1 said:
Why is the reward r=1 if energy is >= 5 (not 4?) and why 0 otherwice?

OK, it should be r = 1 if energy >= 4 and 0 otherwise.

RGV
 
skaterboy1 said:
Do you have any sample problems to show me?

Do you not have a textbook? Are there no lecture notes? Anyway, Google is your friend: try "Markov decision problem".

RGV
 

Similar threads

  • · Replies 18 ·
Replies
18
Views
3K
  • · Replies 6 ·
Replies
6
Views
2K
Replies
29
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
1
Views
2K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 23 ·
Replies
23
Views
7K
Replies
9
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K