PhyloCSF: a comparative genomics method to distinguish protein-coding and non-coding regions