Web text-based acquiring and screening method of seismic macroscopic anomaly information

The invention belongs to the field of text data mining and provides a web text-based acquiring and screening method of seismic macroscopic anomaly information, applied to collecting and screening seismic macroscopic anomaly text information from the internet. According to the method, based on a Heritrix frame, by the use of a seismic macroscopic anomaly subject descriptor group, a crawling strategy from seismic macroscopic anomaly subject relevancy judging and link ordering to information extraction is customized for three information sources, namely common webs, post bars and social networks, and subject related webs crawled are subjected to information screening mainly from three aspects, namely subjective sentence judging, text subjectivity judging and seismic macroscopic anomaly matching. The method has the advantages that online collection of seismic macroscopic anomaly information is provided with a scientific, efficient and accurate technical means and information acquisition efficiency is greatly improved.