Speech recognition apparatus and method using visual information

PURPOSE: A voice recognition device using video information and a method thereof are provided to set a voice recognition variable by identifying a speaker or an age and sex of the speaker from the video information. CONSTITUTION: A variable setting unit(150) sets a voice recognition variable based on information about a video where a speaker is photographed. A voice recognition unit(140) recognizes voice information inputted from the speaker by using the set voice recognition variable. A video identifying unit(130) identifies the speaker corresponding to features extracted from the video information. The variable setting unit sets up the voice recognition variable corresponding to the identified speaker. If the speaker is unidentified from the video information, the video identifying unit identifies an age or sex of the speaker based on the features extracted from the video information.