Silent Speech Interface With Vocal Speaker Assistance Based on Convolution-Augmented Transformer