Scaling-Up Distributed Processing of Data Streams for Machine Learning