Investigation of operationally more powerful duo-trio test protocols: Effects of different reference schemes