The millimeter-wave (mmWave) massive multi-user multiple-input multiple-output (MU-MIMO) can significantly improve the throughput and spectral efficiency by equipping multi-antenna at both the base station (BS) and the users, which is greatly promising technologies for wireless communication systems. However, the large hardware requirement and high energy consumption are great hindrances. To effectively reduce the hardware and computational complexity and achieve a better trade-off between system complexity and performance, we propose an energy-efficient split sub-connected architecture based multi-user hybrid block diagonalization (BD) precoding and combining (Sub MU-HBD) scheme. The proposed Sub MU-HBD scheme employs the split sub-connected architecture to reduce the hardware complexity and improve spectral efficiency. Additionally, to significantly reduce the inter-user interference and further lessen the computational complexity, a low-dimensional digital precoding and combining scheme based on the BD is proposed. Furthermore, we propose a proportional water-filling power allocation algorithm to ensure optimal power distribution to users and improve energy efficiency. The simulation results demonstrate that the proposed Sub MU-HBD scheme can achieve near-optimal, and at times even better performance than some existing schemes.