lucene/solr/contrib/map-reduce
Mark Robert Miller 9441f0d4ce SOLR-5734: We should use System.nanoTime rather than System.currentTimeMillis when calculating elapsed time.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1573425 13f79535-47bb-0310-9956-ffa450edef68
2014-03-03 02:08:45 +00:00
..
src SOLR-5734: We should use System.nanoTime rather than System.currentTimeMillis when calculating elapsed time. 2014-03-03 02:08:45 +00:00
README.txt SOLR-1301: Add some readme files. 2014-02-14 14:39:00 +00:00
build.xml SOLR-5764: Fix recently added tests to not use absolute paths to load test-files, use SolrTestCaseJ4.getFile() and getResource() instead; fix morphlines/map-reduce to not duplicate test resources and fix dependencies among them. 2014-02-22 20:02:13 +00:00
ivy.xml SOLR-1301: Update to Kite 0.10 from CDK 0.9 2013-12-19 17:51:39 +00:00

README.txt

Apache Solr MapReduce

*Experimental* - This contrib is currently subject to change in ways that may 
break back compatibility.

The Solr MapReduce contrib provides an a mapreduce job that allows you to build
Solr indexes and optionally merge them into a live Solr cluster.

Example:

# Build an index with map-reduce and deploy it to SolrCloud

source $solr_distrib/example/scripts/map-reduce/set-map-reduce-classpath.sh

$hadoop_distrib/bin/hadoop --config $hadoop_conf_dir jar \
$solr_distrib/dist/solr-map-reduce-*.jar -D 'mapred.child.java.opts=-Xmx500m' \
-libjars "$HADOOP_LIBJAR" --morphline-file readAvroContainer.conf \
--zk-host 127.0.0.1:9983 --output-dir hdfs://127.0.0.1:8020/outdir \
--collection $collection --log4j log4j.properties --go-live \
--verbose "hdfs://127.0.0.1:8020/indir"