Dynamic Pricing Scheme for Edge Computing Services: A Two-layer Reinforcement Learning Approach