Doubt about belief update #465
Replies: 1 comment 1 reply
-
Hey @Manavvora , sorry this slipped through the cracks and I forgot an answer.
Yes! I would encourage you to check this for yourself, by running something like this:
If you know that an action is never in the optimal policy it will help the solver to use a belief-dependent action space as you have described because the solver will not have to waste computation trying that action. But, I don't think this will have a huge affect on performance - there are many other factors that have a much more significant effect. Are you encountering poor solver performance? |
Beta Was this translation helpful? Give feedback.
-
Hi,
I am using the BasicPOMCP algorithm to solve my POMDP which I have created using the QuickPOMDP interface. I had a doubt regarding the belief update.
I have defined my transition and the observation actions in the following manner:
So, if I take actions a1 or a2, I get the observation equal to the exact value of the next state and if I take action a3, I get some error 'e' which is not a part of the state space. For example if my state space is {0,1,2,3} then e could be 4.
My question is: For a1 and a2, will the belief also be deterministic(sp)? I am asking this because theoretically, the optimal policy should not perform action a1 consecutively since it already knows the state it is in but the optimal policy I get from POMCP still does it. For now I have created a selective_actions function and added an if condition to prevent the same. But is there a better work-around for this?
Beta Was this translation helpful? Give feedback.
All reactions