Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants