2008-08-15 04:49:35 -04:00
|
|
|
Apache Solr - DataImportHandler Version 1.3-dev
|
|
|
|
Release Notes
|
|
|
|
|
|
|
|
Introduction
|
|
|
|
------------
|
|
|
|
DataImportHandler is a data import tool for Solr which makes importing data from Databases, XML files and
|
|
|
|
HTTP data sources quick and easy.
|
|
|
|
|
|
|
|
|
|
|
|
$Id$
|
|
|
|
|
2008-09-20 10:48:54 -04:00
|
|
|
================== Release 1.4-dev ==================
|
|
|
|
Upgrading from Solr 1.3
|
|
|
|
-----------------------
|
|
|
|
|
|
|
|
Detailed Change List
|
|
|
|
----------------------
|
|
|
|
|
|
|
|
New Features
|
|
|
|
----------------------
|
|
|
|
1. SOLR-768: Set last_index_time variable in full-import command.
|
|
|
|
(Wojtek Piaseczny, Noble Paul via shalin)
|
|
|
|
|
2008-10-21 07:57:56 -04:00
|
|
|
2. SOLR-811: Allow a "deltaImportQuery" attribute in SqlEntityProcessor which is used for delta imports
|
|
|
|
instead of DataImportHandler manipulating the SQL itself.
|
|
|
|
(Noble Paul via shalin)
|
|
|
|
|
2008-11-12 04:51:12 -05:00
|
|
|
3. SOLR-842: Better error handling in DataImportHandler with options to abort, skip and continue imports.
|
|
|
|
(Noble Paul, shalin)
|
|
|
|
|
2008-11-12 05:29:49 -05:00
|
|
|
4. SOLR-833: A DataSource to read data from a field as a reader. This can be used, for example, to read XMLs
|
|
|
|
residing as CLOBs or BLOBs in databases.
|
|
|
|
(Noble Paul via shalin)
|
|
|
|
|
2008-12-04 14:50:43 -05:00
|
|
|
5. SOLR-887: A Transformer to strip HTML tags.
|
|
|
|
(Ahmed Hammad via shalin)
|
|
|
|
|
2008-12-11 04:05:39 -05:00
|
|
|
6. SOLR-886: DataImportHandler should rollback when an import fails or it is aborted
|
|
|
|
(shalin)
|
2008-12-11 03:31:26 -05:00
|
|
|
|
2008-12-12 02:02:09 -05:00
|
|
|
7. SOLR-891: A Transformer to read strings from Clob type.
|
|
|
|
(Noble Paul via shalin)
|
2008-12-13 12:38:00 -05:00
|
|
|
|
|
|
|
8. SOLR-812: Configurable JDBC settings in JdbcDataSource including optimized defaults for read only mode.
|
|
|
|
(David Smiley, Glen Newton, shalin)
|
2008-12-12 02:02:09 -05:00
|
|
|
|
2008-12-14 13:30:38 -05:00
|
|
|
9. SOLR-910: Add a few utility commands to the DIH admin page such as full import, delta import, status, reload config.
|
2009-01-08 07:52:16 -05:00
|
|
|
(Ahmed Hammad via shalin)
|
|
|
|
|
|
|
|
10.SOLR-938: Add event listener API for import start and end.
|
|
|
|
(Kay Kay, Noble Paul via shalin)
|
2008-12-14 13:30:38 -05:00
|
|
|
|
2009-01-25 13:05:41 -05:00
|
|
|
11.SOLR-801: Add support for configurable pre-import and post-import delete query per root-entity.
|
|
|
|
(Noble Paul via shalin)
|
|
|
|
|
2009-01-27 02:21:52 -05:00
|
|
|
12.SOLR-988: Add a new scope for session data stored in Context to store objects across imports.
|
|
|
|
(Noble Paul via shalin)
|
|
|
|
|
2009-01-28 03:30:02 -05:00
|
|
|
13.SOLR-980: A PlainTextEntityProcessor which can read from any DataSource<Reader> and output a String.
|
|
|
|
(Nathan Adams, Noble Paul via shalin)
|
|
|
|
|
2008-09-20 10:48:54 -04:00
|
|
|
Optimizations
|
|
|
|
----------------------
|
2008-12-11 04:05:39 -05:00
|
|
|
1. SOLR-846: Reduce memory consumption during delta import by removing keys when used
|
|
|
|
(Ricky Leung, Noble Paul via shalin)
|
2008-09-20 10:48:54 -04:00
|
|
|
|
2009-01-27 02:49:22 -05:00
|
|
|
2. SOLR-974: DataImportHandler skips commit if no data has been updated.
|
|
|
|
(Wojtek Piaseczny, shalin)
|
|
|
|
|
2008-09-20 10:48:54 -04:00
|
|
|
Bug Fixes
|
|
|
|
----------------------
|
2008-10-14 03:51:43 -04:00
|
|
|
1. SOLR-800: Deep copy collections to avoid ConcurrentModificationException in XPathEntityprocessor while streaming
|
|
|
|
(Kyle Morrison, Noble Paul via shalin)
|
2008-09-20 10:48:54 -04:00
|
|
|
|
2008-10-23 02:16:45 -04:00
|
|
|
2. SOLR-823: Request parameter variables ${dataimporter.request.xxx} are not resolved
|
|
|
|
(Mck SembWever, Noble Paul, shalin)
|
|
|
|
|
2008-10-23 02:54:56 -04:00
|
|
|
3. SOLR-728: Add synchronization to avoid race condition of multiple imports working concurrently
|
|
|
|
(Walter Ferrara, shalin)
|
|
|
|
|
2008-10-29 14:05:26 -04:00
|
|
|
4. SOLR-742: Add ability to create dynamic fields with custom DataImportHandler transformers
|
|
|
|
(Wojtek Piaseczny, Noble Paul, shalin)
|
|
|
|
|
2008-10-31 01:32:03 -04:00
|
|
|
5. SOLR-832: Rows parameter is not honored in non-debug mode and can abort a running import in debug mode.
|
|
|
|
(Akshay Ukey, shalin)
|
|
|
|
|
2008-11-07 00:51:41 -05:00
|
|
|
6. SOLR-838: The VariableResolver obtained from a DataSource's context does not have current data.
|
|
|
|
(Noble Paul via shalin)
|
|
|
|
|
2008-11-17 03:26:29 -05:00
|
|
|
7. SOLR-864: DataImportHandler does not catch and log Errors (shalin)
|
|
|
|
|
2008-11-20 02:19:41 -05:00
|
|
|
8. SOLR-873: Fix case-sensitive field names and columns (Jon Baer, shalin)
|
|
|
|
|
2008-12-05 14:14:11 -05:00
|
|
|
9. SOLR-893: Unable to delete documents via SQL and deletedPkQuery with deltaimport
|
|
|
|
(Dan Rosher via shalin)
|
|
|
|
|
2008-12-11 03:34:08 -05:00
|
|
|
10. SOLR-888: DateFormatTransformer cannot convert non-string type
|
|
|
|
(Amit Nithian via shalin)
|
|
|
|
|
2008-12-11 04:39:31 -05:00
|
|
|
11. SOLR-841: DataImportHandler should throw exception if a field does not have column attribute
|
|
|
|
(Michael Henson, shalin)
|
|
|
|
|
2008-12-11 08:43:23 -05:00
|
|
|
12. SOLR-884: CachedSqlEntityProcessor should check if the cache key is present in the query results
|
|
|
|
(Noble Paul via shalin)
|
|
|
|
|
2009-01-26 13:00:06 -05:00
|
|
|
13. SOLR-985: Fix thread-safety issue with TemplateString for concurrent imports with multiple cores.
|
|
|
|
(Ryuuichi Kumai via shalin)
|
|
|
|
|
2009-02-02 06:30:18 -05:00
|
|
|
14. SOLR-999: XPathRecordReader fails on XMLs with nodes mixed with CDATA content.
|
|
|
|
(Fergus McMenemie, Noble Paul via shalin)
|
|
|
|
|
2008-09-20 10:48:54 -04:00
|
|
|
Documentation
|
|
|
|
----------------------
|
|
|
|
|
2008-10-21 03:05:39 -04:00
|
|
|
Other
|
|
|
|
----------------------
|
2008-12-30 01:42:48 -05:00
|
|
|
1. SOLR-782: Refactored SolrWriter to make it a concrete class and removed wrappers over SolrInputDocument.
|
|
|
|
Refactored to load Evaluators lazily. Removed multiple document nodes in the configuration xml.
|
|
|
|
Removed support for 'default' variables, they are automatically available as request parameters.
|
2008-10-21 03:05:39 -04:00
|
|
|
(Noble Paul via shalin)
|
2008-09-20 10:48:54 -04:00
|
|
|
|
2009-01-22 07:12:24 -05:00
|
|
|
2. SOLR-964: XPathEntityProcessor now ignores DTD validations
|
|
|
|
(Fergus McMenemie, Noble Paul via shalin)
|
|
|
|
|
2008-09-20 10:48:54 -04:00
|
|
|
================== Release 1.3.0 20080915 ==================
|
2008-08-15 04:49:35 -04:00
|
|
|
|
|
|
|
Status
|
|
|
|
------
|
|
|
|
This is the first release since DataImportHandler was added to the contrib solr distribution.
|
|
|
|
The following changes list changes since the code was introduced, not since
|
|
|
|
the first official release.
|
|
|
|
|
|
|
|
|
|
|
|
Detailed Change List
|
|
|
|
--------------------
|
|
|
|
|
|
|
|
New Features
|
|
|
|
1. SOLR-700: Allow configurable locales through a locale attribute in fields for NumberFormatTransformer.
|
|
|
|
(Stefan Oestreicher, shalin)
|
|
|
|
|
|
|
|
Changes in runtime behavior
|
|
|
|
|
|
|
|
Bug Fixes
|
2008-08-15 07:53:34 -04:00
|
|
|
1. SOLR-704: NumberFormatTransformer can silently ignore part of the string while parsing. Now it tries to
|
|
|
|
use the complete string for parsing. Failure to do so will result in an exception.
|
|
|
|
(Stefan Oestreicher via shalin)
|
2008-08-15 04:49:35 -04:00
|
|
|
|
2008-08-28 12:04:45 -04:00
|
|
|
2. SOLR-729: Context.getDataSource(String) gives current entity's DataSource instance regardless of argument.
|
|
|
|
(Noble Paul, shalin)
|
|
|
|
|
2008-08-29 03:00:48 -04:00
|
|
|
3. SOLR-726: Jdbc Drivers and DataSources fail to load if placed in multicore sharedLib or core's lib directory.
|
2008-08-29 03:03:13 -04:00
|
|
|
(Walter Ferrara, Noble Paul, shalin)
|
2008-08-29 03:00:48 -04:00
|
|
|
|
2008-08-15 04:49:35 -04:00
|
|
|
Other Changes
|
|
|
|
|
|
|
|
|