BenchmarkDP - Text extraction from general documents benchmark dataset