End-to-End Automatic Speech Recognition Integrated with CTC-Based Voice Activity Detection