Improving user verification in human-robot interaction from audio or image inputs through sample quality assessment