High-order Deep Neural Networks for Learning Multi-Modal Representations