Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning