Autonomous underwater vehicle control using reinforcement learning policy search methods