Commit Graph

24 Commits

Author SHA1 Message Date
cmarschner 2bd4df4bcb added with -kb option
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150800 13f79535-47bb-0310-9956-ffa450edef68
2002-06-30 18:05:15 +00:00
cmarschner 407fadc3bb was empty
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150799 13f79535-47bb-0310-9956-ffa450edef68
2002-06-30 15:12:54 +00:00
cmarschner c7e103618b .doc doesn't seem to work. let's try rtf
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150798 13f79535-47bb-0310-9956-ffa450edef68
2002-06-30 15:12:04 +00:00
Otis Gospodnetic 1a0a676354 - Added oro.jar property.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150797 13f79535-47bb-0310-9956-ffa450edef68
2002-06-30 14:58:49 +00:00
Otis Gospodnetic 05d752d877 - Fixed Usage text.
- Added oro.jar property.


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150796 13f79535-47bb-0310-9956-ffa450edef68
2002-06-30 14:58:27 +00:00
Otis Gospodnetic 6af8f0a0c2 - Initial checkin.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150793 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 22:30:17 +00:00
Otis Gospodnetic a2ec870f44 - Renamed the assert(boolean) method to affirm(boolean) to avoid warnings
about assert being a reserved word (JDK 1.4).


git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150792 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 21:57:00 +00:00
Otis Gospodnetic e6aeb1c4d8 - This is not needed any more.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150791 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 21:55:49 +00:00
cmarschner 138b3d3b1f not much changed
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150790 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 11:42:54 +00:00
cmarschner bea35900d5 see file
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150789 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 11:39:51 +00:00
cmarschner 91b3058b38 added LuceneStorage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150788 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 00:49:57 +00:00
cmarschner be12c931f4 lucene.jar is now necessary for building lucene storage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150787 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 00:47:39 +00:00
cmarschner 42c33097b3 changed web doc. to field/value pairs
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150786 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 00:46:35 +00:00
cmarschner 8790e328db added experimental version of LuceneStorage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150785 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 00:45:10 +00:00
cmarschner bdd627d35c added experimental version of a LuceneStorage
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150784 13f79535-47bb-0310-9956-ffa450edef68
2002-06-18 00:44:22 +00:00
cmarschner 8ed5fa47fd added URLNormalizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150783 13f79535-47bb-0310-9956-ffa450edef68
2002-06-17 14:16:12 +00:00
cmarschner 8e18fa1cb0 moved HostInfo/HostManager to larm.net package; added URLNormalizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150782 13f79535-47bb-0310-9956-ffa450edef68
2002-06-17 14:00:13 +00:00
cmarschner 5b90c10cb5 added URLNormalizer. Changed filters to use normalized URLs if possible
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150781 13f79535-47bb-0310-9956-ffa450edef68
2002-06-17 13:59:28 +00:00
cmarschner 14fdfb458f removed bug: doc is saved under new URL if 301/302 error occured
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150780 13f79535-47bb-0310-9956-ffa450edef68
2002-06-17 13:58:33 +00:00
cmarschner 6fd283db33 added storage pipeline; some fixes on Tokenizer
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150778 13f79535-47bb-0310-9956-ffa450edef68
2002-06-01 18:55:16 +00:00
cmarschner 7a3b5acf37 added license info; added anchor text extraction
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150775 13f79535-47bb-0310-9956-ffa450edef68
2002-05-22 23:09:22 +00:00
cmarschner e16a9e73df added documentation
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150772 13f79535-47bb-0310-9956-ffa450edef68
2002-05-13 21:26:09 +00:00
Otis Gospodnetic f4e2c2bbbb - A REDME for LARM webcrawler contribution.
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150752 13f79535-47bb-0310-9956-ffa450edef68
2002-05-04 14:32:24 +00:00
Otis Gospodnetic cf2fa142c8 Initial revision
git-svn-id: https://svn.apache.org/repos/asf/lucene/java/trunk@150751 13f79535-47bb-0310-9956-ffa450edef68
2002-05-04 13:58:45 +00:00