Comparison of IR Models for Text Classification

As there is availability of large amount of data on the web, but due to constraints web is only used for browsing and searching. Traditional IE uses NLP techniques such as lexicons, grammars, whereas web applies machine learning and pattern mining techniques to exploit the syntactical patterns or layout structures of the template-based documents.Information Retrieval is the art of presentation, storage, organization of and access to information items.IR now–-days mainly deals with retrieving information based on user queries. The paper deals with basic understanding of IR and IR models and shows Support Vector Machines is a good technique fir classification of huge data sets. General Terms Information Retrieval (IR)

[1]  Hang Li,et al.  A Short Introduction to Learning to Rank , 2011, IEICE Trans. Inf. Syst..

[2]  R. Manmatha,et al.  An Inference Network Approach to Image Retrieval , 2004, CIVR.

[3]  M Lebl,et al.  Using Support Vector Machine Regression to Model the Retention of Peptides in Immobilized Metal-affinity Chromatography. , 2007, Sensors and actuators. B, Chemical.

[4]  Ricardo Baeza-Yates,et al.  Modern Information Retrieval - the concepts and technology behind search, Second edition , 2011 .

[5]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[6]  James G. Shanahan,et al.  Improving SVM Text Classification Performance through Threshold Adjustment , 2003, ECML.

[7]  Vannevar Bush,et al.  As we may think , 1945, INTR.

[8]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[9]  Berkant Barla Cambazoglu,et al.  Review of "Search Engines: Information Retrieval in Practice" by Croft, Metzler and Strohman , 2010, Inf. Process. Manag..

[10]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[11]  W. Bruce Croft,et al.  Search Engines - Information Retrieval in Practice , 2009 .

[12]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[13]  CHENGXIANG ZHAI,et al.  A study of smoothing methods for language models applied to information retrieval , 2004, TOIS.

[14]  Lidan Wang,et al.  Learning to efficiently rank , 2010, SIGIR.

[15]  Sotiris Kotsiantis,et al.  Text Classification Using Machine Learning Techniques , 2005 .

[16]  Mandar Mitra,et al.  Information Retrieval from Documents: A Survey , 2000, Information Retrieval.

[17]  Gerard Salton,et al.  Improving retrieval performance by relevance feedback , 1997, J. Am. Soc. Inf. Sci..

[18]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[19]  John Davies,et al.  Information Retrieval: Searching in the 21st Century , 2009, Information Retrieval.

[20]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[21]  Tristan Fletcher,et al.  Support Vector Machines Explained , 2008 .

[22]  Dustin Boswell,et al.  Introduction to Support Vector Machines , 2002 .

[23]  W. Bruce Croft,et al.  Combining the language model and inference network approaches to retrieval , 2004, Inf. Process. Manag..