HUD (head up display) device combining voice and video recognition

The invention provides an HUD (head up display) device combining voice and video recognition. The HUD device combining voice and video recognition comprises an HUD module, a camera-shooting module, a voice module and a circuit module. The circuit module integrates a GPU (graphics processing unit), an ARM (advanced RISC machine) processor, an acceleration sensor, a GPS (global positioning system) sensor, WIFI (wireless fidelity) and a Bluetooth module, wherein the GPU acquires image signals transmitted by the camera-shooting module to match the image signals with preset image signals, or the ARM processor acquires voice signals transmitted by the voice module to match the voice signals with preset voice signals so as to obtain a matched action command; the ARM processor receives the action command, executes relative in-vehicle actions automatically and outputs the actions to the HUD module to display finally. The HUD device combining voice and video recognition has the advantages that HUD display content is enriched greatly by the aid of voice and data such as acceleration sensor data; a driver can always concentrate on the roadways by watching the HUD display content and can obtain vehicle and peripheral information safely and timely, and accordingly, man-machine interaction, rapidity and convenience are achieved.