Science Current Events | Science News | Brightsurf.com
 
Building Search Applications: Lucene, LingPipe, and Gate
View Larger Image

Building Search Applications: Lucene, LingPipe, and Gate | Paperback

by Manu Konchady (Author)

List Price: $44.95  
Price:  $40.45
You Save:  $4.50 (10%)
Available:  Usually ships in 24 hours

Binding:  Paperback
Publisher:  Mustru Publishing
Edition:  Firstth Edition
Page Count:  448 Pages
Publication Date:  June 01, 2008
Sales Rank:  81,773st


EDITORIAL REVIEWS


Product Description
Lucene, LingPipe, and Gate are popular open source tools to build powerful search applications. Building Search Applications describes functions from Lucene that include indexing, searching, ranking, and spelling correction to build search engines. Use LingPipe and Gate to find the meaning of text to make search applications more useful. With this book you will learn to: - Extract tokens from text using custom tokenizers and analyzers from Lucene, LingPipe, and Gate. - Construct a search engine index with an optional backend database to manage large document collections. - Explore the wide range of Lucene queries to search an index, understand the ranking algorithm for a query, and suggest spelling corrections. - Find the names of people, places, and other entities in text using LingPipe and Gate. - Categorize documents by topic using classifiers and build groups of self-organized documents using clustering algorithms from LingPipe. - Create a Web crawler to scan the Web, Intranet, or desktop using Nutch. - Track the sentiment of articles published on the Web with LingPipe - Detect plagiarism of documents using a registered document collection.


CUSTOMER REVIEWS (Average Customer Rating: 4.5 based on 3 reviews)

Pragmatic intro to Web Information Retrieval by Gulli Antonino 4 Stars
August 09, 2009
Building Search Applications: Lucene, LingPipe, and Gate is a pretty good introduction to Information Retrieval with a lot of pragmatic examples. Based on Lucene, Gate and LingPipe. I recomend to add it to your library if you like Lucene and Nutch or if you need to maintain or create a medium scale search application.

Good but.. by Songkran Thongsawang 4 Stars
May 14, 2009
This is a good book to create search application. However, it is not easy enough for newbies. You need to have some experience and familiar enough with Lucene and related packages.

An excellent discussion of the topic with plenty of example source code by Fayyazuddin A. Syed (Toronto, ON Canada) 5 Stars
April 19, 2009
Unfortunately, there are not too many books written on the subject of Information Retrieval as it relates to Java programming, and thankfully, Mr.Konchady's contribution is an excellent resource. It provides a nice balance between the discussion of the theory of Information Retrieval, and providing concrete examples in Java, using Lucene, LingPipe, and Gate (API's for Information Retrieval used in Java). I personally had only heard of Lucene before coming across this book, and was very thankful to learn of the other two (LingPipe, and Gate) afterwards. The book shows the user how to use the above API's together when building an application, which is a great learning opportunity for the reader, because most tutorials available for Lucene, LingPipe, or Gate that you'll find online show you how to use that particular API only, instead of showing you how to use it in conjunction with others to truly harness the power of Information Retrieval with Natural Language Processing, together. The other very nice thing about this book is that the author also introduces the reader to other tools (such as Nutch, WordNet, etc.) that allows the user to provide advanced functionality, without "re-inventing the wheel". This book is a must read for anyone who is serious about learning to develop applications involving Information Retrieval.

SIMILAR PRODUCTS


Lucene in Action (In Action series)

Lucene in Action (In Action series)
by Otis Gospodnetic (Author), Erik Hatcher (Author)

Lucene's performance, simplicity, disarming ease-of-use, and best practices are covered in this look at the highly scalable, fast, and pure Java search engine. Newly documented solutions explain what Lucene is and how it works and how it can be used in a variety of real-world applications such as Nutch. Users will also use this guide to understand and solve "analysis paralysis," employ advanced searching techniques, such as filtering and custom query parsing, and handle a variety of document...

Introduction to Information Retrieval

Introduction to Information Retrieval
by Christopher D. Manning (Author), Prabhakar Raghavan (Author), Hinrich Schütze (Author)

Class-tested and coherent, this groundbreaking new textbook teaches web-era information retrieval, including web search and the related areas of text classification and text clustering from basic concepts. Written from a computer science perspective by three leading experts in the field, it gives an up-to-date treatment of all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of...

Solr 1.4 Enterprise Search Server

Solr 1.4 Enterprise Search Server
by David Smiley (Author), Eric Pugh (Author)

Enhance your search with faceted navigation, result highlighting, fuzzy queries, ranked scoring, and more Deploy, embed, and integrate Solr with a host of programming languagesImplement faceting in e-commerce and other sites to summarize and navigate the results of a text searchEnhance your search by highlighting search results, offering spell-corrections, auto-suggest, finding "similar" records, boosting records and fields for scoring, phonetic matchingInformative and practical approach to...

Collective Intelligence in Action

Collective Intelligence in Action
by Satnam Alag (Author), Richard MacManus (Foreword)

There's a great deal of wisdom in a crowd, but how do you listen to a thousand people talking at once? Identifying the wants, needs, and knowledge of internet users can be like listening to a mob.

In the Web 2.0 era, leveraging the collective power of user contributions, interactions, and feedback is the key to market dominance. A new category of powerful programming techniques lets you discover the patterns, inter-relationships, and individual profiles-the collective intelligence--locked...

Hadoop: The Definitive Guide

Hadoop: The Definitive Guide
by Tom White (Author), White Tom (Author)

Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build reliable, scalable, distributed systems: programmers will find details for analyzing large datasets, and administrators will learn how to set up and run Hadoop clusters.

Complete with case...

© 2009 BrightSurf.com