Calculate Total Lane Changes:
I have written reward expressions and properties to measure the total number of lane changes in the model. However, upon verification, the results consistently yield infinity, which appears to be incorrect. I would appreciate guidance on how to structure a proper reward expression to accurately capture the cumulative count of lane changes.
Calculate Total Acceleration Cost with Negative Rewards:
I am also looking to calculate the total acceleration cost by assigning negative rewards during states or transitions where acceleration occurs. I seek confirmation on whether this approach is appropriate or if there are alternative strategies to model and calculate acceleration costs effectively.
Calculate Time Steps Before Entering a Critical State:
Additionally, I aim to compute the number of time steps before the ego vehicle transitions into a critical state (e.g., from s=0 to s=7). However, when I use the PRISM simulator to observe the rewards, it only displays values at s=0 and s=7. I need guidance on how to properly define and capture rewards or properties to account for the intermediate states and the total time steps elapsed during this transition.
I have attached the properties and reward expressions I have implemented so far for your review. Any suggestions or advice on how to address these challenges would be greatly appreciated.
Thank you for your time