A Methodology for Automatically Tuned Parallel Tridiagonalization on Distributed Memory Vector-parallel Machines