Robust analysis of 5′-transcript ends: a high-throughput protocol for characterization of sequence diversity of transcription start sites