An analysis of design process and performance in distributed data science teams