Tailoring domain decomposition methods for efficient parallel coarse grid solution and for systems w