Modoru: Clos nanosecond optical switching for distributed deep training [Invited]