Automatic text structuring and retrieval in large natural-language text files