Create bookmark
Apache Solr 3.1 Cookbook
Over 100 recipes to discover new ways to work with Apache’s Enterprise Search Server
Do you like this document?
Notes
Please login to add notes
- Cover
- Copyright
- Credits
- About the Author
- About the Reviewers
- www.PacktPub.com
- Table of Contents
- Preface
-
+
Chapter 1: Apache Solr Configuration
- Introduction
- Running Solr on Jetty
- Running Solr on Apache Tomcat
- Using the Suggester component
- Handling multiple languages in a single index
- Indexing fields in a dynamic way
- Making multilingual data searchable
- with multicore deployment
- Solr cache configuration
- How to fetch and index web pages
- Getting the most relevant results with
- early query termination
- How to set up Extracting Request Handler
-
+
Chapter 2: Indexing your Data
- Introduction
- Indexing data in CSV format
- Indexing data in XML format
- Indexing data in JSON format
- Indexing PDF files
- Indexing Microsoft Office files
- Extracting metadata from binary files
- How to properly configure Data Import
- Handler with JDBC
- Indexing data from a database using Data
- Import Handler
- How to import data using Data Import
- Handler and delta query
- How to use Data Import Handler with
- URL Data Source
- How to modify data while importing with
- Data Import Handler
-
+
Chapter 3: Analyzing your Text Data
- Introduction
- Storing additional information using
- payloads
- Eliminating XML and HTML tags from
- the text
- Copying the contents of one field to another
- Changing words to other words
- Splitting text by camel case
- Splitting text by whitespace only
- Making plural words singular, but
- without stemming
- Lowercasing the whole string
- Storing geographical points in the index
- Stemming your data
- Preparing text to do efficient trailing
- wildcard search
- Splitting text by numbers and non-white
- space characters
-
+
Chapter 4: Solr Administration
- Introduction
- Monitoring Solr via JMX
- How to check the cache status
- How to check how the data type or field
- behave
- How to check Solr query handler usage
- How to check Solr update handler usage
- How to change Solr instance logging
- configuration
- How to check the Java based replication
- status
- How to check the script based replication
- status
- Setting up a Java based index replication
- Setting up script based replication
- How to manage Java based replication
- status using HTTP commands
- How to analyze your index structure
-
+
Chapter 5: Querying Solr
- Introduction
- Asking for a particular field value
- Sorting results by a field value
- Choosing a different query parser
- How to search for a phrase, not a single
- word
- Boosting phrases over words
- Positioning some documents over others
- on a query
- Positioning documents with words closer
- to each other first
- Sorting results by a distance from a point
- Getting documents with only a partial match
- Affecting scoring with function
- Nesting queries
-
+
Chapter 6: Using Faceting Mechanism
- Introduction
- Getting the number of documents with
- the same field value
- Getting the number of documents with
- the same date range
- Getting the number of documents with
- the same value range
- Getting the number of documents matching Getting the number of documents matching
- the query and sub query
- How to remove filters from faceting results
- How to name different faceting results
- How to sort faceting results in an
- alphabetical order
- How to implement the autosuggest
- feature using faceting
- How to get the number of documents
- that don't have a value in the field
- How to get all the faceting results,
- not just the first hundred ones
- How to have two different facet limits for
- two different fields in the same query
-
+
Chapter 7: Improving Solr Performance
- Introduction
- Paging your results quickly
- Configuring the document cache
- Configuring the query result cache
- Configuring the filter cache
- Improving Solr performance right after
- the startup or commit operation
- Setting up a sharded deployment
- Caching whole result pages
- Improving faceting performance
- What to do when Solr slows down during
- indexing when using Data Import Handler
- Getting the first top documents fast when
- having millions of them
- + Chapter 8: Creating Applications that use Solr and Developing your Own Solr Modules
-
+
Chapter 9: Using Additional Solr Functionalities
- Introduction
- Getting more documents similar to those
- returned in the results list
- Presenting search results in a fast
- and easy way
- Highlighting matched words
- How to highlight long text fields and get
- good performance
- Sorting results by a function value
- Searching words by how they sound
- Ignoring defined words
- Computing statistics for the search results
- Checking user's spelling mistakes
- Using "group by" like functionalities in Solr
-
+
Chapter 10: Dealing with Problems
- Introduction
- How to deal with a corrupted index
- How to reduce the number of files the index
- is made of
- How to deal with a locked index
- How to deal with too many opened files
- How to deal with out of memory problems
- How to sort non-English languages properly
- How to deal with the infinite loop exception
- when using shards
- How to deal with garbage collection running
- too long
- How to update only one field in all
- documents without the need of full
- indexation
- How to make your index smaller
- Index
This book is part of Packt's Cookbook series; each chapter looks at a different aspect of working with Apache Solr. The recipes deal with common problems of working with Solr by using easy-to-understand, real-life examples. The book is not in any way a complete Apache Solr reference and you should see it as a helping hand when things get rough on your journey with Apache Solr. Developers who are working with Apache Solr and would like to know how to combat common problems will find this book of great use. Knowledge of Apache Lucene would be a bonus but is not required.
Your free to read time expires in minutes. After that you have to pause for an hour.
Test the closed alpha on paperc.com
Book Details
Authors
Categories
Publishers
Publication year : 2011
License: All rights reserved ©
Times read: 676

