Expediting Distributed DNN Training With Device Topology-Aware Graph Deployment