Reinforcement Learning in Block Markov Chains