lucene/solr/contrib/map-reduce
Mark Robert Miller 84adc8604d SOLR-7289: Tests should not ignore all leaking threads and instead just ignore the known leaking threads.
git-svn-id: https://svn.apache.org/repos/asf/lucene/dev/trunk@1668771 13f79535-47bb-0310-9956-ffa450edef68
2015-03-24 02:16:13 +00:00
..
src SOLR-7289: Tests should not ignore all leaking threads and instead just ignore the known leaking threads. 2015-03-24 02:16:13 +00:00
README.txt SOLR-1301: Add some readme files. 2014-02-14 14:39:00 +00:00
build.xml LUCENE-5593: speed up javadocs generation in build 2014-07-28 15:36:27 +00:00
ivy.xml LUCENE-6007: Regularize ivy.xml files to use configurations that map to remote master configurations, so that Ivy won't try to download extraneous crap 2014-10-16 20:13:48 +00:00

README.txt

Apache Solr MapReduce

*Experimental* - This contrib is currently subject to change in ways that may 
break back compatibility.

The Solr MapReduce contrib provides an a mapreduce job that allows you to build
Solr indexes and optionally merge them into a live Solr cluster.

Example:

# Build an index with map-reduce and deploy it to SolrCloud

source $solr_distrib/example/scripts/map-reduce/set-map-reduce-classpath.sh

$hadoop_distrib/bin/hadoop --config $hadoop_conf_dir jar \
$solr_distrib/dist/solr-map-reduce-*.jar -D 'mapred.child.java.opts=-Xmx500m' \
-libjars "$HADOOP_LIBJAR" --morphline-file readAvroContainer.conf \
--zk-host 127.0.0.1:9983 --output-dir hdfs://127.0.0.1:8020/outdir \
--collection $collection --log4j log4j.properties --go-live \
--verbose "hdfs://127.0.0.1:8020/indir"