A behavior-based scheme using reinforcement learning for autonomous underwater vehicles