DEVELOPMENT OF THE DIGITAL-TWIN FOR BUILDING FACILITIES (PART 3): A COMPARISON OF METAHEURISTICS AND REINFORCEMENT LEARNING FOR OPTIMAL CONTROLS