Evaluating Features for Machine Learning Detection of Order- and Non-Order-Dependent Flaky Tests