lucene/docs/demo2.html

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">

<!--
Copyright 1999-2004 The Apache Software Foundation
Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->


<!-- Content Stylesheet for Site -->

        
<!-- start the processing -->
    <!-- ====================================================================== -->
    <!-- GENERATED FILE, DO NOT EDIT, EDIT THE XML FILE IN xdocs INSTEAD! -->
    <!-- Main Page Section -->
    <!-- ====================================================================== -->
    <html>
        <head>
            <meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>

                                                    <meta name="author" value="Andrew C. Oliver">
            <meta name="email" value="acoliver@apache.org">
            
           
            <title>Apache Lucene - Apache Lucene - Basic Demo Sources Walkthrough</title>
        </head>

        <body bgcolor="#ffffff" text="#000000" link="#525D76">        
            <table border="0" width="100%" cellspacing="0">
                <!-- TOP IMAGE -->
                <tr>
                    <td align="left">
<a href="http://www.apache.org"><img src="http://lucene.apache.org/java/docs/images/asf-logo.gif" width="387" height="100" border="0"/></a>
</td>
<td align="right">
<a href="http://lucene.apache.org/"><img src="./images/lucene_green_300.gif" alt="Apache Lucene" border="0"/></a>
</td>
                </tr>
            </table>
            <table border="0" width="100%" cellspacing="4">
                <tr><td colspan="2">
                    <hr noshade="" size="1"/>
                </td></tr>
                
                <tr>
                    <!-- LEFT SIDE NAVIGATION -->
                    <td width="20%" valign="top" nowrap="true">
                    
    <!-- ============================================================ -->

                <p><strong>About</strong></p>
        <ul>
                    <li>    <a href="./index.html">Overview</a>
</li>
                    <li>    <a href="./features.html">Features</a>
</li>
                    <li>    <a href="http://wiki.apache.org/jakarta-lucene/PoweredBy">Powered by Lucene</a>
</li>
                    <li>    <a href="./whoweare.html">Who We Are</a>
</li>
                    <li>    <a href="./mailinglists.html">Mailing Lists</a>
</li>
                </ul>
            <p><strong>Resources</strong></p>
        <ul>
                    <li>    <a href="http://wiki.apache.org/jakarta-lucene">Wiki</a>
</li>
                    <li>    <a href="http://wiki.apache.org/jakarta-lucene/LuceneFAQ">FAQ</a>
</li>
                    <li>    <a href="./gettingstarted.html">Getting Started</a>
</li>
                    <li>    <a href="./queryparsersyntax.html">Query Syntax</a>
</li>
                    <li>    <a href="./fileformats.html">File Formats</a>
</li>
                    <li>    <a href="./api/index.html">Javadoc</a>
</li>
                    <li>    <a href="./contributions.html">Contributions</a>
</li>
                    <li>    <a href="./benchmarks.html">Benchmarks</a>
</li>
                    <li>    <a href="http://issues.apache.org/jira/browse/LUCENE">Issue Tracker</a>
</li>
                    <li>    <a href="./lucene-sandbox/">Lucene Sandbox</a>
</li>
                </ul>
            <p><strong>Download</strong></p>
        <ul>
                    <li>    <a href="http://www.apache.org/dyn/closer.cgi/jakarta/lucene/binaries/">Binaries</a>
</li>
                    <li>    <a href="http://www.apache.org/dyn/closer.cgi/jakarta/lucene/source/">Source Code</a>
</li>
                    <li>    <a href="http://svn.apache.org/viewcvs.cgi/lucene/">Source Repository</a>
</li>
                </ul>
                        </td>
                    <td width="80%" align="left" valign="top">
                                                                    <table border="0" cellspacing="0" cellpadding="2" width="100%">
      <tr><td bgcolor="#525D76">
        <font color="#ffffff" face="arial,helvetica,sanserif">
          <a name="About the Code"><strong>About the Code</strong></a>
        </font>
      </td></tr>
      <tr><td>
        <blockquote>
                                    <p>
In this section we walk through the sources behind the basic Lucene demo such as where to 
find it, its parts and their function.  This section is intended for Java developers
wishing to understand how to use Apache Lucene in their applications.
</p>
                            </blockquote>
        </p>
      </td></tr>
      <tr><td><br/></td></tr>
    </table>
                                                <table border="0" cellspacing="0" cellpadding="2" width="100%">
      <tr><td bgcolor="#525D76">
        <font color="#ffffff" face="arial,helvetica,sanserif">
          <a name="Location of the source"><strong>Location of the source</strong></a>
        </font>
      </td></tr>
      <tr><td>
        <blockquote>
                                    <p>
Relative to the directory created when you extracted Lucene or retreived it from Subversion, you
should see a directory called "src" which in turn contains a directory called "demo".
This is the root for all of the Lucene demos.  Under this directory is org/apache/lucene/demo,
this is where all the Java sources live.  
</p>
                                                <p>
Within this directory you should see the IndexFiles class we executed earlier.  Bring that
up in vi or your alternative text editor and lets take a look at it.
</p>
                            </blockquote>
        </p>
      </td></tr>
      <tr><td><br/></td></tr>
    </table>
                                                <table border="0" cellspacing="0" cellpadding="2" width="100%">
      <tr><td bgcolor="#525D76">
        <font color="#ffffff" face="arial,helvetica,sanserif">
          <a name="IndexFiles"><strong>IndexFiles</strong></a>
        </font>
      </td></tr>
      <tr><td>
        <blockquote>
                                    <p>
As we discussed in the previous walkthrough, the IndexFiles class creates a Lucene Index.
Lets take a look at how it does this.  
</p>
                                                <p>
The first substantial thing the main function does is instantiate an instance
of IndexWriter.  It passes a string called "index" and a new instance of a class called
"StandardAnalyzer".  The "index" string is the name of the directory that all index information
should be stored in.  Because we're not passing any path information, one must assume this
will be created as a subdirectory of the current directory (if it does not already exist). On
some platforms this may actually result in it being created in other directories (such as 
the user's home directory). 
</p>
                                                <p>
The <b>IndexWriter</b> is the main class responsible for creating indicies. To use it you
must instantiate it with a path that it can write the index into, if this path does not 
exist it will create it, otherwise it will refresh the index living at that path.  You 
must a also pass an instance of <b>org.apache.lucene.analysis.Analyzer</b>. 
</p>
                                                <p>
The <b>Analyzer</b>, in this case, the <b>StandardAnalyzer</b> is little more than a standard Java
Tokenizer, converting all strings to lowercase and filtering out useless words and characters from the index.
By useless words and characters I mean common language words such as articles (a, an, the, etc.) and other 
strings that would be useless for searching (e.g. <b>'s</b>) .  It should be noted that there are different 
rules for every  language, and you should use the proper analyzer for each.  Lucene currently 
provides Analyzers for English and German, more can be found in the Lucene Sandbox.
</p>
                                                <p>
Looking down further in the file, you should see the indexDocs() code.  This recursive function 
simply crawls the directories and uses FileDocument to create Document objects.  The Document
is simply a data object to represent the content in the file as well as its creation time and 
location.  These instances are added to the indexWriter.  Take a look inside FileDocument.  It's
not particularly complicated, it just adds fields to the Document.
</p>
                                                <p>
As you can see there isn't much to creating an index.  The devil is in the details.  You may also
wish to examine the other samples in this directory, particularly the IndexHTML class.  It is 
a bit more complex but builds upon this example.
</p>
                            </blockquote>
        </p>
      </td></tr>
      <tr><td><br/></td></tr>
    </table>
                                                <table border="0" cellspacing="0" cellpadding="2" width="100%">
      <tr><td bgcolor="#525D76">
        <font color="#ffffff" face="arial,helvetica,sanserif">
          <a name="Searching Files"><strong>Searching Files</strong></a>
        </font>
      </td></tr>
      <tr><td>
        <blockquote>
                                    <p>
The SearchFiles class is quite simple.  It primarily collaborates with an IndexSearcher, StandardAnalyzer
(which is used in the IndexFiles class as well) and a QueryParser.  The query parser is constructed
with an analyzer used to interperate your query in the same way the Index was interperated: finding 
the end of words and removing useless words like 'a', 'an' and 'the'.  The Query object contains the 
results from the QueryParser which is passed to the searcher.  The searcher results are returned in 
a collection of Documents called "Hits" which is then iterated through and displayed to the user.
</p>
                            </blockquote>
        </p>
      </td></tr>
      <tr><td><br/></td></tr>
    </table>
                                                <table border="0" cellspacing="0" cellpadding="2" width="100%">
      <tr><td bgcolor="#525D76">
        <font color="#ffffff" face="arial,helvetica,sanserif">
          <a name="The Web example..."><strong>The Web example...</strong></a>
        </font>
      </td></tr>
      <tr><td>
        <blockquote>
                                    <p>
<a href="demo3.html">read on&gt;&gt;&gt;</a>
</p>
                            </blockquote>
        </p>
      </td></tr>
      <tr><td><br/></td></tr>
    </table>
                                        </td>
                </tr>

                <!-- FOOTER -->
                <tr><td colspan="2">
                    <hr noshade="" size="1"/>
                </td></tr>
                <tr><td colspan="2">
                    <div align="center"><font color="#525D76" size="-1"><em>
                    Copyright &#169; 1999-2005, The Apache Software Foundation
                    </em></font></div>
                </td></tr>
            </table>
        </body>
    </html>
<!-- end the processing -->
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">`

- Added a reference to Arabic Analyzer for Java - Synced with jakarta-site2 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150224 13f79535-47bb-0310-9956-ffa450edef68 2004-03-04 14:41:28 +00:00			`<!--`
			`Copyright 1999-2004 The Apache Software Foundation`
			`Licensed under the Apache License, Version 2.0 (the "License");`
			`you may not use this file except in compliance with the License.`
			`You may obtain a copy of the License at`

			`http://www.apache.org/licenses/LICENSE-2.0`

			`Unless required by applicable law or agreed to in writing, software`
			`distributed under the License is distributed on an "AS IS" BASIS,`
			`WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.`
			`See the License for the specific language governing permissions and`
			`limitations under the License.`
			`-->`


Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`<!-- Content Stylesheet for Site -->`


			`<!-- start the processing -->`
			`<!-- ====================================================================== -->`
- Modified docs. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149903 13f79535-47bb-0310-9956-ffa450edef68 2002-12-12 06:23:48 +00:00			`<!-- GENERATED FILE, DO NOT EDIT, EDIT THE XML FILE IN xdocs INSTEAD! -->`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`<!-- Main Page Section -->`
			`<!-- ====================================================================== -->`
			`<html>`
			`<head>`
			`<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"/>`

			`<meta name="author" value="Andrew C. Oliver">`
			`<meta name="email" value="acoliver@apache.org">`

added scarab to the powered by page. ant docs seemed to have produced a few changes in files (make sure you are using the latest jakarta-site2!) -jon git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149677 13f79535-47bb-0310-9956-ffa450edef68 2002-02-11 19:45:24 +00:00

- Updated (keine Scheisse). git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149907 13f79535-47bb-0310-9956-ffa450edef68 2003-01-04 16:29:08 +00:00
update docs to account for TLP migration git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153802 13f79535-47bb-0310-9956-ffa450edef68 2005-02-14 16:48:47 +00:00			`<title>Apache Lucene - Apache Lucene - Basic Demo Sources Walkthrough</title>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</head>`

			`<body bgcolor="#ffffff" text="#000000" link="#525D76">`
			`<table border="0" width="100%" cellspacing="0">`
			`<!-- TOP IMAGE -->`
			`<tr>`
			`<td align="left">`
sorry, typo in image URL git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@165660 13f79535-47bb-0310-9956-ffa450edef68 2005-05-02 18:50:45 +00:00			`<a href="http://www.apache.org"><img src="http://lucene.apache.org/java/docs/images/asf-logo.gif" width="387" height="100" border="0"/></a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</td>`
			`<td align="right">`
update docs to account for TLP migration git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153802 13f79535-47bb-0310-9956-ffa450edef68 2005-02-14 16:48:47 +00:00			`<a href="http://lucene.apache.org/"><img src="./images/lucene_green_300.gif" alt="Apache Lucene" border="0"/></a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</td>`
			`</tr>`
			`</table>`
			`<table border="0" width="100%" cellspacing="4">`
			`<tr><td colspan="2">`
			`<hr noshade="" size="1"/>`
			`</td></tr>`

			`<tr>`
			`<!-- LEFT SIDE NAVIGATION -->`
			`<td width="20%" valign="top" nowrap="true">`
- Added Christoph Goller and eluniversal.com, updated from jakarta-site2. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150089 13f79535-47bb-0310-9956-ffa450edef68 2003-10-09 14:40:30 +00:00
			`<!-- ============================================================ -->`

			`<p><strong>About</strong></p>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`<ul>`
			`<li> <a href="./index.html">Overview</a>`
re-adding the features page and a link to it git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@170179 13f79535-47bb-0310-9956-ffa450edef68 2005-05-14 23:05:04 +00:00			`</li>`
			`<li> <a href="./features.html">Features</a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</li>`
- Removed powered.xml/.html and pointed to Wiki git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150345 13f79535-47bb-0310-9956-ffa450edef68 2004-05-18 13:32:01 +00:00			`<li> <a href="http://wiki.apache.org/jakarta-lucene/PoweredBy">Powered by Lucene</a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</li>`
			`<li> <a href="./whoweare.html">Who We Are</a>`
			`</li>`
Update mailing list link git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@159103 13f79535-47bb-0310-9956-ffa450edef68 2005-03-26 12:58:35 +00:00			`<li> <a href="./mailinglists.html">Mailing Lists</a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</li>`
			`</ul>`
			`<p><strong>Resources</strong></p>`
			`<ul>`
adding link to wiki, and note to deprecate the powered by page in CVS git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150216 13f79535-47bb-0310-9956-ffa450edef68 2004-02-29 02:43:36 +00:00			`<li> <a href="http://wiki.apache.org/jakarta-lucene">Wiki</a>`
			`</li>`
link the new FAQ git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150725 13f79535-47bb-0310-9956-ffa450edef68 2004-12-30 21:50:43 +00:00			`<li> <a href="http://wiki.apache.org/jakarta-lucene/LuceneFAQ">FAQ</a>`
Update .html files to reflect change in left nav bar. Otis added lucene-sandbox link. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149780 13f79535-47bb-0310-9956-ffa450edef68 2002-06-20 14:23:48 +00:00			`</li>`
			`<li> <a href="./gettingstarted.html">Getting Started</a>`
Update website to include Query Parser syntax in resources area. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149756 13f79535-47bb-0310-9956-ffa450edef68 2002-05-16 05:16:10 +00:00			`</li>`
			`<li> <a href="./queryparsersyntax.html">Query Syntax</a>`
- New docs, new sidebar. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149874 13f79535-47bb-0310-9956-ffa450edef68 2002-10-30 04:14:11 +00:00			`</li>`
			`<li> <a href="./fileformats.html">File Formats</a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</li>`
			`<li> <a href="./api/index.html">Javadoc</a>`
			`</li>`
			`<li> <a href="./contributions.html">Contributions</a>`
- Added the link to benchmarks to the side. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149900 13f79535-47bb-0310-9956-ffa450edef68 2002-12-04 05:56:33 +00:00			`</li>`
			`<li> <a href="./benchmarks.html">Benchmarks</a>`
- Added 3 links to Resources section. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149898 13f79535-47bb-0310-9956-ffa450edef68 2002-11-29 21:23:47 +00:00			`</li>`
adjust site to point to JIRA git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@290677 13f79535-47bb-0310-9956-ffa450edef68 2005-09-21 10:41:16 +00:00			`<li> <a href="http://issues.apache.org/jira/browse/LUCENE">Issue Tracker</a>`
- Added the link to benchmarks to the side. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149900 13f79535-47bb-0310-9956-ffa450edef68 2002-12-04 05:56:33 +00:00			`</li>`
			`<li> <a href="./lucene-sandbox/">Lucene Sandbox</a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</li>`
			`</ul>`
			`<p><strong>Download</strong></p>`
			`<ul>`
Update download links to Apache mirrors git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150684 13f79535-47bb-0310-9956-ffa450edef68 2004-11-29 13:34:57 +00:00			`<li> <a href="http://www.apache.org/dyn/closer.cgi/jakarta/lucene/binaries/">Binaries</a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</li>`
Update download links to Apache mirrors git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150684 13f79535-47bb-0310-9956-ffa450edef68 2004-11-29 13:34:57 +00:00			`<li> <a href="http://www.apache.org/dyn/closer.cgi/jakarta/lucene/source/">Source Code</a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</li>`
- Fixed path to Lucene SVN repository git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@157218 13f79535-47bb-0310-9956-ffa450edef68 2005-03-12 03:13:09 +00:00			`<li> <a href="http://svn.apache.org/viewcvs.cgi/lucene/">Source Repository</a>`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</li>`
			`</ul>`
			`</td>`
			`<td width="80%" align="left" valign="top">`
			`<table border="0" cellspacing="0" cellpadding="2" width="100%">`
			`<tr><td bgcolor="#525D76">`
			`<font color="#ffffff" face="arial,helvetica,sanserif">`
			`<a name="About the Code"><strong>About the Code</strong></a>`
			`</font>`
			`</td></tr>`
			`<tr><td>`
			`<blockquote>`
			`<p>`
			`In this section we walk through the sources behind the basic Lucene demo such as where to`
			`find it, its parts and their function. This section is intended for Java developers`
update docs to account for TLP migration git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153802 13f79535-47bb-0310-9956-ffa450edef68 2005-02-14 16:48:47 +00:00			`wishing to understand how to use Apache Lucene in their applications.`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</p>`
			`</blockquote>`
			`</p>`
			`</td></tr>`
			`<tr><td><br/></td></tr>`
			`</table>`
			`<table border="0" cellspacing="0" cellpadding="2" width="100%">`
			`<tr><td bgcolor="#525D76">`
			`<font color="#ffffff" face="arial,helvetica,sanserif">`
			`<a name="Location of the source"><strong>Location of the source</strong></a>`
			`</font>`
			`</td></tr>`
			`<tr><td>`
			`<blockquote>`
			`<p>`
update docs to account for TLP migration git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@153802 13f79535-47bb-0310-9956-ffa450edef68 2005-02-14 16:48:47 +00:00			`Relative to the directory created when you extracted Lucene or retreived it from Subversion, you`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`should see a directory called "src" which in turn contains a directory called "demo".`
			`This is the root for all of the Lucene demos. Under this directory is org/apache/lucene/demo,`
			`this is where all the Java sources live.`
			`</p>`
			`<p>`
			`Within this directory you should see the IndexFiles class we executed earlier. Bring that`
			`up in vi or your alternative text editor and lets take a look at it.`
			`</p>`
			`</blockquote>`
			`</p>`
			`</td></tr>`
			`<tr><td><br/></td></tr>`
			`</table>`
			`<table border="0" cellspacing="0" cellpadding="2" width="100%">`
			`<tr><td bgcolor="#525D76">`
			`<font color="#ffffff" face="arial,helvetica,sanserif">`
			`<a name="IndexFiles"><strong>IndexFiles</strong></a>`
			`</font>`
			`</td></tr>`
			`<tr><td>`
			`<blockquote>`
			`<p>`
			`As we discussed in the previous walkthrough, the IndexFiles class creates a Lucene Index.`
			`Lets take a look at how it does this.`
			`</p>`
			`<p>`
			`The first substantial thing the main function does is instantiate an instance`
			`of IndexWriter. It passes a string called "index" and a new instance of a class called`
			`"StandardAnalyzer". The "index" string is the name of the directory that all index information`
			`should be stored in. Because we're not passing any path information, one must assume this`
mostly spelling fixes, some small clarifications git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150623 13f79535-47bb-0310-9956-ffa450edef68 2004-10-30 12:16:09 +00:00			`will be created as a subdirectory of the current directory (if it does not already exist). On`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`some platforms this may actually result in it being created in other directories (such as`
			`the user's home directory).`
			`</p>`
			`<p>`
			`The <b>IndexWriter</b> is the main class responsible for creating indicies. To use it you`
			`must instantiate it with a path that it can write the index into, if this path does not`
			`exist it will create it, otherwise it will refresh the index living at that path. You`
- Fixed paths/URLs git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@164170 13f79535-47bb-0310-9956-ffa450edef68 2005-04-22 04:31:17 +00:00			`must a also pass an instance of <b>org.apache.lucene.analysis.Analyzer</b>.`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</p>`
			`<p>`
mostly spelling fixes, some small clarifications git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150623 13f79535-47bb-0310-9956-ffa450edef68 2004-10-30 12:16:09 +00:00			`The <b>Analyzer</b>, in this case, the <b>StandardAnalyzer</b> is little more than a standard Java`
			`Tokenizer, converting all strings to lowercase and filtering out useless words and characters from the index.`
			`By useless words and characters I mean common language words such as articles (a, an, the, etc.) and other`
			`strings that would be useless for searching (e.g. <b>'s</b>) . It should be noted that there are different`
			`rules for every language, and you should use the proper analyzer for each. Lucene currently`
			`provides Analyzers for English and German, more can be found in the Lucene Sandbox.`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</p>`
			`<p>`
			`Looking down further in the file, you should see the indexDocs() code. This recursive function`
			`simply crawls the directories and uses FileDocument to create Document objects. The Document`
			`is simply a data object to represent the content in the file as well as its creation time and`
mostly spelling fixes, some small clarifications git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150623 13f79535-47bb-0310-9956-ffa450edef68 2004-10-30 12:16:09 +00:00			`location. These instances are added to the indexWriter. Take a look inside FileDocument. It's`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`not particularly complicated, it just adds fields to the Document.`
			`</p>`
			`<p>`
			`As you can see there isn't much to creating an index. The devil is in the details. You may also`
			`wish to examine the other samples in this directory, particularly the IndexHTML class. It is`
			`a bit more complex but builds upon this example.`
			`</p>`
			`</blockquote>`
			`</p>`
			`</td></tr>`
			`<tr><td><br/></td></tr>`
			`</table>`
			`<table border="0" cellspacing="0" cellpadding="2" width="100%">`
			`<tr><td bgcolor="#525D76">`
			`<font color="#ffffff" face="arial,helvetica,sanserif">`
			`<a name="Searching Files"><strong>Searching Files</strong></a>`
			`</font>`
			`</td></tr>`
			`<tr><td>`
			`<blockquote>`
			`<p>`
			`The SearchFiles class is quite simple. It primarily collaborates with an IndexSearcher, StandardAnalyzer`
			`(which is used in the IndexFiles class as well) and a QueryParser. The query parser is constructed`
			`with an analyzer used to interperate your query in the same way the Index was interperated: finding`
			`the end of words and removing useless words like 'a', 'an' and 'the'. The Query object contains the`
			`results from the QueryParser which is passed to the searcher. The searcher results are returned in`
			`a collection of Documents called "Hits" which is then iterated through and displayed to the user.`
			`</p>`
			`</blockquote>`
			`</p>`
			`</td></tr>`
			`<tr><td><br/></td></tr>`
			`</table>`
			`<table border="0" cellspacing="0" cellpadding="2" width="100%">`
			`<tr><td bgcolor="#525D76">`
			`<font color="#ffffff" face="arial,helvetica,sanserif">`
			`<a name="The Web example..."><strong>The Web example...</strong></a>`
			`</font>`
			`</td></tr>`
			`<tr><td>`
			`<blockquote>`
			`<p>`
			`<a href="demo3.html">read on>>></a>`
			`</p>`
			`</blockquote>`
			`</p>`
			`</td></tr>`
			`<tr><td><br/></td></tr>`
			`</table>`
			`</td>`
			`</tr>`

			`<!-- FOOTER -->`
			`<tr><td colspan="2">`
			`<hr noshade="" size="1"/>`
			`</td></tr>`
			`<tr><td colspan="2">`
			`<div align="center"><font color="#525D76" size="-1"><em>`
Add pylucene to resources, and refresh generated docs from site2 git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150731 13f79535-47bb-0310-9956-ffa450edef68 2005-01-18 13:57:32 +00:00			`Copyright © 1999-2005, The Apache Software Foundation`
Getting Started tutorial added by Andrew C. Oliver. git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149651 13f79535-47bb-0310-9956-ffa450edef68 2002-01-26 16:38:28 +00:00			`</em></font></div>`
			`</td></tr>`
			`</table>`
			`</body>`
			`</html>`
			`<!-- end the processing -->`














added scarab to the powered by page. ant docs seemed to have produced a few changes in files (make sure you are using the latest jakarta-site2!) -jon git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@149677 13f79535-47bb-0310-9956-ffa450edef68 2002-02-11 19:45:24 +00:00