Q-Learning Based Two-Timescale Power Allocation for Multi-Homing Hybrid RF/VLC Networks | IEEE Journals & Magazine | IEEE Xplore