Method and device for sorting information of namesake persons on Internet
暂无分享,去创建一个
Embodiment of the present invention discloses a method and apparatus for namesake classifying information on the Internet, the method comprising: information input character name for retrieving the pages comprising the character name information; was extracted from the correlation and wherein the character attributes of web pages relating to characteristics, respectively; and wherein the character attributes relating to page generalization characteristics of upper and lower bit dictionary and / or thesaurus; obtaining an initial relationship of the characteristic properties according to the page in the figure generalization As a result, according to the page and acquires subject feature generalization after initial clustering result of the relevant page; the initial integration result and the relationship between the initial clustering result to obtain a final classification result of the relevant page. Embodiments of the present invention, can be more precisely and accurately various pages comprising the same person's name clustering, thereby obtaining more accurate classification result of the actual character.