2007-06-06 23:17:18 -04:00
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
2006-02-24 21:51:19 -05:00
< html >
< head >
< META http-equiv = "Content-Type" content = "text/html; charset=UTF-8" >
< meta content = "Apache Forrest" name = "Generator" >
2007-06-06 23:17:18 -04:00
< meta name = "Forrest-version" content = "0.8" >
2006-02-24 21:51:19 -05:00
< meta name = "Forrest-skin-name" content = "pelt" >
2006-03-01 14:50:03 -05:00
< title > Introduction to The Solr Enterprise Search Server< / title >
2006-02-24 21:51:19 -05:00
< link type = "text/css" href = "skin/basic.css" rel = "stylesheet" >
< link media = "screen" type = "text/css" href = "skin/screen.css" rel = "stylesheet" >
< link media = "print" type = "text/css" href = "skin/print.css" rel = "stylesheet" >
< link type = "text/css" href = "skin/profile.css" rel = "stylesheet" >
< script src = "skin/getBlank.js" language = "javascript" type = "text/javascript" > < / script > < script src = "skin/getMenu.js" language = "javascript" type = "text/javascript" > < / script > < script src = "skin/fontsize.js" language = "javascript" type = "text/javascript" > < / script >
< link rel = "shortcut icon" href = "images/favicon.ico" >
< / head >
< body onload = "init()" >
< script type = "text/javascript" > ndeSetTextSize ( ) ; < / script >
< div id = "top" >
2007-06-06 23:17:18 -04:00
<!-- +
|breadtrail
+-->
2006-02-24 21:51:19 -05:00
< div class = "breadtrail" >
2007-01-18 11:27:03 -05:00
< a href = "http://www.apache.org/" > apache< / a > > < a href = "http://lucene.apache.org/" > lucene< / a > < script src = "skin/breadcrumbs.js" language = "JavaScript" type = "text/javascript" > < / script >
2006-02-24 21:51:19 -05:00
< / div >
2007-06-06 23:17:18 -04:00
<!-- +
|header
+-->
2006-02-24 21:51:19 -05:00
< div class = "header" >
2007-06-06 23:17:18 -04:00
<!-- +
|start group logo
+-->
2006-02-24 21:51:19 -05:00
< div class = "grouplogo" >
2007-01-18 00:10:35 -05:00
< a href = "http://lucene.apache.org/" > < img class = "logoImage" alt = "Lucene" src = "images/lucene_green_150.gif" title = "Apache Lucene" > < / a >
2006-02-24 21:51:19 -05:00
< / div >
2007-06-06 23:17:18 -04:00
<!-- +
|end group logo
+-->
<!-- +
|start Project Logo
+-->
2006-02-24 21:51:19 -05:00
< div class = "projectlogo" >
2007-01-18 00:10:35 -05:00
< a href = "http://lucene.apache.org/solr/" > < img class = "logoImage" alt = "Solr" src = "images/solr.png" title = "Solr Description" > < / a >
2006-02-24 21:51:19 -05:00
< / div >
2007-06-06 23:17:18 -04:00
<!-- +
|end Project Logo
+-->
<!-- +
|start Search
+-->
2006-02-24 21:51:19 -05:00
< div class = "searchbox" >
< form action = "http://www.google.com/search" method = "get" class = "roundtopsmall" >
2007-01-18 00:10:35 -05:00
< input value = "lucene.apache.org" name = "sitesearch" type = "hidden" > < input onFocus = "getBlank (this, 'Search the site with google');" size = "25" name = "q" id = "query" type = "text" value = "Search the site with google" >
2007-06-06 23:17:18 -04:00
< input name = "Search" value = "Search" type = "submit" >
2006-02-24 21:51:19 -05:00
< / form >
< / div >
2007-06-06 23:17:18 -04:00
<!-- +
|end search
+-->
<!-- +
|start Tabs
+-->
2006-02-24 21:51:19 -05:00
< ul id = "tabs" >
< li class = "current" >
2007-06-06 23:17:18 -04:00
< a class = "selected" href = "index.html" > Main< / a >
2006-02-24 21:51:19 -05:00
< / li >
< li >
2007-06-06 23:17:18 -04:00
< a class = "unselected" href = "http://wiki.apache.org/solr" > Wiki< / a >
2006-02-24 21:51:19 -05:00
< / li >
< / ul >
2007-06-06 23:17:18 -04:00
<!-- +
|end Tabs
+-->
2006-02-24 21:51:19 -05:00
< / div >
< / div >
< div id = "main" >
< div id = "publishedStrip" >
2007-06-06 23:17:18 -04:00
<!-- +
|start Subtabs
+-->
2006-02-24 21:51:19 -05:00
< div id = "level2tabs" > < / div >
2007-06-06 23:17:18 -04:00
<!-- +
|end Endtabs
+-->
2006-02-24 21:51:19 -05:00
< script type = "text/javascript" > < ! - -
2007-06-06 23:17:18 -04:00
document.write("Last Published: " + document.lastModified);
2006-02-24 21:51:19 -05:00
// -->< / script >
< / div >
2007-06-06 23:17:18 -04:00
<!-- +
|breadtrail
+-->
2006-02-24 21:51:19 -05:00
< div class = "breadtrail" >
2007-06-06 23:17:18 -04:00
2006-02-24 21:51:19 -05:00
< / div >
2007-06-06 23:17:18 -04:00
<!-- +
|start Menu, mainarea
+-->
<!-- +
|start Menu
+-->
2006-02-24 21:51:19 -05:00
< div id = "menu" >
< div onclick = "SwitchMenu('menu_1.1', 'skin/')" id = "menu_1.1Title" class = "menutitle" > About< / div >
< div id = "menu_1.1" class = "menuitemgroup" >
< div class = "menuitem" >
< a href = "index.html" title = "Welcome to Solr" > Welcome< / a >
< / div >
< div class = "menuitem" >
< a href = "who.html" title = "Solr Committers" > Who We Are< / a >
< / div >
< / div >
< div onclick = "SwitchMenu('menu_selected_1.2', 'skin/')" id = "menu_selected_1.2Title" class = "menutitle" style = "background-image: url('skin/images/chapter_open.gif');" > Documentation< / div >
< div id = "menu_selected_1.2" class = "selectedmenuitemgroup" style = "display: block;" >
< div class = "menupage" >
< div class = "menupagetitle" > Features< / div >
< / div >
< div class = "menuitem" >
2007-01-18 00:10:35 -05:00
< a href = "tutorial.html" > Tutorial< / a >
2006-02-24 21:51:19 -05:00
< / div >
< div class = "menuitem" >
2007-01-18 00:10:35 -05:00
< a href = "http://wiki.apache.org/solr/" > Docs (Wiki)< / a >
2006-02-24 21:51:19 -05:00
< / div >
< div class = "menuitem" >
2007-01-18 00:10:35 -05:00
< a href = "http://wiki.apache.org/solr/FAQ" > FAQ< / a >
2006-02-24 21:51:19 -05:00
< / div >
< div class = "menuitem" >
2007-01-18 00:10:35 -05:00
< a href = "api/index.html" > javadoc< / a >
2006-02-24 21:51:19 -05:00
< / div >
< / div >
< div onclick = "SwitchMenu('menu_1.3', 'skin/')" id = "menu_1.3Title" class = "menutitle" > Resources< / div >
< div id = "menu_1.3" class = "menuitemgroup" >
< div class = "menuitem" >
2007-01-18 00:10:35 -05:00
< a href = "http://www.apache.org/dyn/closer.cgi/lucene/solr/" > Download< / a >
2006-02-24 21:51:19 -05:00
< / div >
< div class = "menuitem" >
< a href = "mailing_lists.html" > Mailing Lists< / a >
< / div >
< div class = "menuitem" >
< a href = "issue_tracking.html" > Issue Tracking< / a >
< / div >
< div class = "menuitem" >
< a href = "version_control.html" > Version Control< / a >
< / div >
< / div >
< div onclick = "SwitchMenu('menu_1.4', 'skin/')" id = "menu_1.4Title" class = "menutitle" > Related Projects< / div >
< div id = "menu_1.4" class = "menuitemgroup" >
< div class = "menuitem" >
< a href = "http://lucene.apache.org/java/" > Lucene Java< / a >
< / div >
< div class = "menuitem" >
< a href = "http://lucene.apache.org/nutch/" > Nutch< / a >
< / div >
< / div >
< div id = "credit" > < / div >
< div id = "roundbottom" >
< img style = "display: none" class = "corner" height = "15" width = "15" alt = "" src = "skin/images/rc-b-l-15-1body-2menu-3menu.png" > < / div >
2007-06-06 23:17:18 -04:00
<!-- +
|alternative credits
+-->
2006-02-24 21:51:19 -05:00
< div id = "credit2" > < / div >
< / div >
2007-06-06 23:17:18 -04:00
<!-- +
|end Menu
+-->
<!-- +
|start content
+-->
2006-02-24 21:51:19 -05:00
< div id = "content" >
< div title = "Portable Document Format" class = "pdflink" >
< a class = "dida" href = "features.pdf" > < img alt = "PDF -icon" src = "skin/images/pdfdoc.gif" class = "skin" > < br >
PDF< / a >
< / div >
2006-03-01 14:50:03 -05:00
< h1 > Introduction to The Solr Enterprise Search Server< / h1 >
2006-02-24 21:51:19 -05:00
< div id = "minitoc-area" >
< ul class = "minitoc" >
< li >
< a href = "#Solr+in+a+Nutshell" > Solr in a Nutshell< / a >
< / li >
< li >
2006-03-01 14:50:03 -05:00
< a href = "#Solr+Uses+the+Lucene+Search+Library+and+Extends+it%21" > Solr Uses the Lucene Search Library and Extends it!< / a >
2006-02-24 21:51:19 -05:00
< / li >
< li >
< a href = "#Detailed+Features" > Detailed Features< / a >
< ul class = "minitoc" >
< li >
< a href = "#Schema" > Schema< / a >
< / li >
< li >
< a href = "#Query" > Query< / a >
< / li >
< li >
< a href = "#Core" > Core< / a >
< / li >
< li >
< a href = "#Caching" > Caching< / a >
< / li >
< li >
< a href = "#Replication" > Replication< / a >
< / li >
< li >
< a href = "#Admin+Interface" > Admin Interface< / a >
< / li >
< / ul >
< / li >
< / ul >
< / div >
2006-11-10 19:47:14 -05:00
< a name = "N1000D" > < / a > < a name = "Solr+in+a+Nutshell" > < / a >
2006-02-24 21:51:19 -05:00
< h2 class = "boxed" > Solr in a Nutshell< / h2 >
< div class = "section" >
< p >
2006-03-01 14:50:03 -05:00
Solr is a standalone enterprise search server with a web-services like API. You put documents in it (called "indexing") via XML over HTTP. You query it via HTTP GET and receive XML results.
2006-02-24 21:51:19 -05:00
< / p >
< ul >
< li > Advanced Full-Text Search Capabilities < / li >
< li > Optimized for High Volume Web Traffic < / li >
< li > Standards Based Open Interfaces - XML and HTTP < / li >
< li > Comprehensive HTML Administration Interfaces < / li >
2006-03-01 14:50:03 -05:00
< li > Scalability - Efficient Replication to other Solr Search Servers < / li >
2006-02-24 21:51:19 -05:00
< li > Flexible and Adaptable with XML configuration < / li >
< li > Extensible Plugin Architecture < / li >
< / ul >
< / div >
2006-11-10 19:47:14 -05:00
< a name = "N1002F" > < / a > < a name = "Solr+Uses+the+Lucene+Search+Library+and+Extends+it%21" > < / a >
2006-03-01 14:50:03 -05:00
< h2 class = "boxed" > Solr Uses the Lucene Search Library and Extends it!< / h2 >
2006-02-24 21:51:19 -05:00
< div class = "section" >
< ul >
2007-06-06 23:17:18 -04:00
< li > A Real Data Schema, with Numeric Types, Dynamic Fields, Unique Keys < / li >
2006-02-24 21:51:19 -05:00
< li > Powerful Extensions to the Lucene Query Language < / li >
2007-05-23 15:39:58 -04:00
< li > Support for Dynamic Faceted Browsing and Filtering < / li >
2006-02-24 21:51:19 -05:00
< li > Advanced, Configurable Text Analysis < / li >
< li > Highly Configurable and User Extensible Caching < / li >
< li > Performance Optimizations < / li >
< li > External Configuration via XML < / li >
< li > An Administration Interface < / li >
< li > Monitorable Logging < / li >
< li > Fast Incremental Updates and Snapshot Distribution < / li >
2007-06-06 23:17:18 -04:00
< li > XML and CSV/delimited-text update formats < / li >
2006-02-24 21:51:19 -05:00
< / ul >
< / div >
2007-06-06 23:17:18 -04:00
< a name = "N1005A" > < / a > < a name = "Detailed+Features" > < / a >
2006-02-24 21:51:19 -05:00
< h2 class = "boxed" > Detailed Features< / h2 >
< div class = "section" >
2007-06-06 23:17:18 -04:00
< a name = "N10060" > < / a > < a name = "Schema" > < / a >
2006-02-24 21:51:19 -05:00
< h3 class = "boxed" > Schema< / h3 >
< ul >
< li > Defines the field types and fields of documents< / li >
< li > Can drive more intelligent processing< / li >
< li > Declarative Lucene Analyzer specification< / li >
2007-06-06 23:17:18 -04:00
< li > Dynamic Fields enables on-the-fly addition of new fields< / li >
2006-02-24 21:51:19 -05:00
< li > CopyField functionality allows indexing a single field multiple ways, or combining multiple fields into a single searchable field< / li >
< li > Explicit types eliminates the need for guessing types of fields< / li >
< li > External file-based configuration of stopword lists, synonym lists, and protected word lists< / li >
2007-06-06 23:17:18 -04:00
< li > Many additional text analysis components including word splitting, regex and sounds-like filters< / li >
2006-02-24 21:51:19 -05:00
< / ul >
2007-06-06 23:17:18 -04:00
< a name = "N10081" > < / a > < a name = "Query" > < / a >
2006-02-24 21:51:19 -05:00
< h3 class = "boxed" > Query< / h3 >
< ul >
2006-11-06 15:16:04 -05:00
< li > HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby)< / li >
2006-07-31 11:33:23 -04:00
2007-06-06 23:17:18 -04:00
< li > Sort by any number of fields< / li >
2006-07-31 11:33:23 -04:00
2007-06-06 23:17:18 -04:00
< li > Advanced DisMax query parser for high relevancy results from user-entered queries< / li >
2006-11-06 15:16:04 -05:00
2007-06-06 23:17:18 -04:00
< li > Highlighted context snippets< / li >
< li > Faceted Searching based on unique field values and explicit queries< / li >
2006-02-24 21:51:19 -05:00
< li > Constant scoring range and prefix queries - no idf, coord, or lengthNorm factors, and no restriction on the number of terms the query matches.< / li >
< li > Function Query - influence the score by a function of a field's numeric value or ordinal< / li >
2007-06-06 23:17:18 -04:00
< li > Date Math - specify dates relative to "NOW" in queries and updates< / li >
2006-02-24 21:51:19 -05:00
< li > Performance Optimizations< / li >
< / ul >
2007-06-06 23:17:18 -04:00
< a name = "N100A5" > < / a > < a name = "Core" > < / a >
2006-02-24 21:51:19 -05:00
< h3 class = "boxed" > Core< / h3 >
< ul >
< li > Pluggable query handlers and extensible XML data format< / li >
< li > Document uniqueness enforcement based on unique key field< / li >
< li > Batches updates and deletes for high performance< / li >
< li > User configurable commands triggered on index changes< / li >
< li > Searcher concurrency control< / li >
< li > Correct handling of numeric types for both sorting and range queries< / li >
< li > Ability to control where docs with the sort field missing will be placed< / li >
2007-06-06 23:17:18 -04:00
< li > "Luke" request handler for corpus information< / li >
2006-02-24 21:51:19 -05:00
< / ul >
2007-06-06 23:17:18 -04:00
< a name = "N100C6" > < / a > < a name = "Caching" > < / a >
2006-02-24 21:51:19 -05:00
< h3 class = "boxed" > Caching< / h3 >
< ul >
< li > Configurable Query Result, Filter, and Document cache instances< / li >
< li > Pluggable Cache implementations< / li >
< li > Cache warming in background
< ul >
< li > When a new searcher is opened, configurable searches are run against
it in order to warm it up to avoid
slow first hits. During warming, the current searcher handles live requests.
< / li >
< / ul >
< / li >
< li > Autowarming in background
< ul >
< li > The most recently accessed items in the caches of the current
searcher are re-populated in the new searcher, enabing high cache hit
rates across index/searcher changes.< / li >
< / ul >
< / li >
< li > Fast/small filter implementation< / li >
< li > User level caching with autowarming support< / li >
< / ul >
2007-06-06 23:17:18 -04:00
< a name = "N100EB" > < / a > < a name = "Replication" > < / a >
2006-02-24 21:51:19 -05:00
< h3 class = "boxed" > Replication< / h3 >
< ul >
< li > Efficient distribution of index parts that have changed via rsync transport< / li >
< li > Pull strategy allows for easy addition of searchers< / li >
< li > Configurable distribution interval allows tradeoff between timeliness and cache utilization< / li >
< / ul >
2007-06-06 23:17:18 -04:00
< a name = "N100FD" > < / a > < a name = "Admin+Interface" > < / a >
2006-02-24 21:51:19 -05:00
< h3 class = "boxed" > Admin Interface< / h3 >
< ul >
< li > Comprehensive statistics on cache utilization, updates, and queries< / li >
< li > Text analysis debugger, showing result of every stage in an analyzer< / li >
< li > Web Query Interface w/ debugging output
< ul >
< li > parsed query output< / li >
< li > Lucene explain() document score detailing< / li >
< li > explain score for documents outside of the requested range to debug why a given document wasn't ranked higher.< / li >
< / ul >
< / li >
< / ul >
< / div >
< / div >
2007-06-06 23:17:18 -04:00
<!-- +
|end content
+-->
2006-02-24 21:51:19 -05:00
< div class = "clearboth" > < / div >
< / div >
< div id = "footer" >
2007-06-06 23:17:18 -04:00
<!-- +
|start bottomstrip
+-->
2006-02-24 21:51:19 -05:00
< div class = "lastmodified" >
< script type = "text/javascript" > < ! - -
2007-06-06 23:17:18 -04:00
document.write("Last Published: " + document.lastModified);
2006-02-24 21:51:19 -05:00
// -->< / script >
< / div >
< div class = "copyright" >
Copyright ©
2007-01-18 00:10:35 -05:00
2007 < a href = "http://www.apache.org/licenses/" > The Apache Software Foundation.< / a >
2006-02-24 21:51:19 -05:00
< / div >
2007-06-06 23:17:18 -04:00
<!-- +
|end bottomstrip
+-->
2006-02-24 21:51:19 -05:00
< / div >
< / body >
< / html >