2002-05-22 19:09:22 -04:00
|
|
|
$id: $
|
|
|
|
|
2002-06-01 14:55:16 -04:00
|
|
|
2002-06-01 (cmarschner)
|
|
|
|
* divided Storage into LinkStorage and DocumentStorage
|
|
|
|
* introduced StoragePipeline, made MessageHandler a LinkStorage. Fetcher now stores everything in storages
|
|
|
|
* removed a couple of unused classes
|
|
|
|
now everything's prepared for a LuceneStorage
|
|
|
|
* added build.xml by Mehran Mehr
|
|
|
|
|
2002-05-22 19:09:22 -04:00
|
|
|
2002-05-23 (cmarschner)
|
|
|
|
* removed 0x0d0d from the source files (Otis?)
|
|
|
|
* included Apache License into all of the source files in de.lanlab.larm.* directories
|
|
|
|
* added anchor text deparsing to the Tokenizer
|
|
|
|
* split store.log in two files:
|
|
|
|
- store.log contains the page file index: <referer> <URL> <ResultCode> <MimeType> <Size> <Title> <PageFileNo> <PageFileOffset>
|
|
|
|
- links.log contains link information: <referer> <URL> <isFrame> <AnchorText>
|
|
|
|
* changed lib to libs in the startup scripts
|
|
|
|
* added .bat files for Windows
|