Integrating DVFS and Task Scheduling to Improve Energy Efficiency for Heterogeneous Edge Devices: A Reinforcement Learning Approach