Print Email Facebook Twitter Reinforcement Learning based Algorithm with Safety Handling and Risk Perception Title Reinforcement Learning based Algorithm with Safety Handling and Risk Perception Author Shyamsundar, S. Mannucci, T. (TU Delft Control & Simulation) van Kampen, E. (TU Delft Control & Simulation) Contributor Jin, Y (editor) Kollias, S. (editor) Date 2016 Abstract Navigation in an unknown or uncertain environment is a challenging task for an autonomous agent. The agent is expected to behave independently and to learn the suitable action to take for a given situation. Reinforcement Learning could be used to help the agent adapt to an unknown environment and learn the right actions to take. This paper presents the setup and the results of a reinforcement learning problem utilizing Q-learning and a Safety Handling Exploration with Risk Perception Algorithm (SHERPA) for safe exploration in an unknown environment. The agent has to explore its environment safely and must learn the optimal action for a given situation from the feedback received from the environment. The results show that the agent can learn a value function converged to within 10% of the optimal values after 5000 iterations. The simulation results show that the proposed approach ensures that the agent explores an unknown environment safely and learns the desirable actions for a given situation. To reference this document use: http://resolver.tudelft.nl/uuid:365b004d-74e8-47d5-9696-ee86f4cb37e4 DOI https://doi.org/10.1109/SSCI.2016.7849367 Publisher IEEE Embargo date 2018-01-01 Source 2016 IEEE Symposium Series on Computational Intelligence: Athens, Greece Event 2016 IEEE Symposium Series on Computational Intelligence, 2016-10-06 → 2016-10-09, Athens, Greece Part of collection Institutional Repository Document type conference paper Rights © 2016 S. Shyamsundar, T. Mannucci, E. van Kampen Files PDF Shyamsundar_Reinforcement ... evised.pdf 608.68 KB Close viewer /islandora/object/uuid:365b004d-74e8-47d5-9696-ee86f4cb37e4/datastream/OBJ/view