Document structure extraction for interactive document retrieval systems