Using Dueling Double Q-learning for Voltage Regulation in PV-Rich Distribution Networks