Since the growth of the Internet,World Wide Web has become significant infrastructure in various fields such as business, commerce, education and so on. Accordingly, a user has gathered information by using the Internet. However due to increasing Web pages, it becomes difficult for a user to collect desirable information. Advanced Web search engines may provide solution to some extent, it is still up to a user to summarize or extract meaningful information from such retrieval results. Based on this viewpoints, this paper addresses a generation method of table-style data from heterogeneous Web pages that reflects a user’s intention. To achieve it, the method utilize a user’s instantiated example in a table in addition to column labels as the table. Based on a user’s instantiated example, meaningful information are extracted using pattern matching and N-gram method. We apply this method to 57 pages with 27 travel agencies whether the proposed method is effective or not. As the result, 88% was precision rate and 68% was recall rate.
[1]
Tadashi Dohi,et al.
A Web Page Ranking Algorithm Based on a Markov Decision Process
,
2006
.
[2]
Shigeru Fujita,et al.
Improvement of a Re-ranking Method for Web Search Based on Mutual Evaluation among Web Pages
,
2006
.
[3]
Ryoji Kataoka,et al.
Snippet Generation for Geographic Information Retrieval
,
2009
.
[4]
Hiroyuki Sakai,et al.
A Multiple-Document Summarization System Introducing User Interaction for Reflecting User's Summarization Need
,
2004,
NTCIR.
[5]
Masaaki Kikuchi,et al.
Influence of Presentation Style in Web-Search Result Recommendation
,
2009
.