Vision Transformer for NeRF-Based View Synthesis from a Single Input Image