A survey of historical document image datasets