Mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Coffey <>
Subject Re: indexing to Solr
Date Sat, 17 Dec 2016 21:18:22 GMT
Here is an issue with the official Nutch tutorial.
In the "setup Solr for Search" it gives the following instructions.* download binary file
from here
* unzip to $HOME/apache-solr, we will now refer to this as ${APACHE_SOLR_HOME}
* cd ${APACHE_SOLR_HOME}/example
* java -jar start.jar

Unfortunately, there is no start .jar in the examples directory. When I instead try to use
the start.jar in the servers directory, Java says "WARNING: Nothing to start, exiting ..."
You need something like the following to start solr.$APACHE_SOLR_HOME/bin/solr start -e cloud

In this case, I am using solr 5.4.1
Also, as mentioned previously, the tutorial says nothing about which version of solr to use.

      From: lewis john mcgibbney <>
 To: "" <> 
 Sent: Monday, November 21, 2016 10:34 AM
 Subject: Re: indexing to Solr
Hi Michael,

On Sat, Nov 19, 2016 at 8:09 AM, <> wrote:

> From: Michael Coffey <>
> To: "" <>
> Cc:
> Date: Fri, 18 Nov 2016 21:15:14 +0000 (UTC)
> Subject: indexing to Solr
> Where can I find up-to-date information on indexing to Solr?
in particular
If you find any issues with this tutorial then please let us know. Thank

> When I search the web, I find tutorials that use the deprecated solrindex
> command. I also find questions where people want to know why it doesn't
> work.

That is because the only official documentation resides at

> I have a good nutch 1.12 installation on a working hadoop cluster and a
> Solr 6.3.0 installation which works for their gettingstarted example.

You should use the specified version of Solr for the Nutch release. This is
Solr 5.4.1 as defined in the indexer-solr plugin ivy.xml

> I have questions likeDo I need to create a core and a collection in solr?

Yes I would. This is explained at

> Do I need http or cloud type server?Do I need solr.zookeeper.url ?

This is not a Nutch question. This is your preferred Solr configuration. If
you are just starting out then I would say it is not a big deal...
experiment and go with what works best for your requirements and resources

> What else needs to be set in nutch-site.xml?

Not much. For reference though, here are the Solr configuration options.

> What about schema?

This is covered in

> Thanks for all the help so far!
No problems. Any more issues, ping us here and we will help.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message