Reinforcement learning on Lunar Lander Continuous v2 using Soft actor-critic Soft actor-critic algorithm using Lunar Lander Continuous v2