An Empirical Comparative Assessment of Inter-Rater Agreement of Binary Outcomes and Multiple Raters