Lucene in Action
暂无分享,去创建一个
Lucene is a rare gem in the Apache Jakarta world?a highly scalable, fast, and pure Java search engine. Its initial attractions are its performance, its simplicity, and its disarming ease-of-use. But there are many caveats, best practices, and examples that users are looking for that are not yet well documented. The lucene-user e-mail list is very active and helpful, but many users seek more guidance and examples. Lucene in Action describes what Lucene is and how it works and most importantly how it can be used in a variety of real-world use cases, such at Nutch. Nutch?an open-source project designed to index the internet very much like Google?is built upon Lucene. Lucene in Action provides readers with best practices, tried and true from the field, including: * Understanding and solving ?analysis paralysis? * Advanced searching techniques, including filtering and custom query parsing techniques. * Handling document types such as Word, PDF, HTML, XML, and others.