Mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Michael Coffey <>
Subject Re: indexing to Solr
Date Sat, 17 Dec 2016 21:43:34 GMT
Here is another issue with the official Nutch tutorial.
In the section "Integrate Solr with Nutch" it says to backup the original solr schema.xml
and replace it with one from nutch. It say that the original schema.xml is in the directory
example/solr/collection1/conf. But there is no such directory. When I search for schema.xml,
I get the following.

It's not obvious that any one of these is the right one to use.

      From: lewis john mcgibbney <>
 To: "" <> 
 Sent: Monday, November 21, 2016 10:34 AM
 Subject: Re: indexing to Solr
Hi Michael,

On Sat, Nov 19, 2016 at 8:09 AM, <> wrote:

> From: Michael Coffey <>
> To: "" <>
> Cc:
> Date: Fri, 18 Nov 2016 21:15:14 +0000 (UTC)
> Subject: indexing to Solr
> Where can I find up-to-date information on indexing to Solr?
in particular
If you find any issues with this tutorial then please let us know. Thank

> When I search the web, I find tutorials that use the deprecated solrindex
> command. I also find questions where people want to know why it doesn't
> work.

That is because the only official documentation resides at

> I have a good nutch 1.12 installation on a working hadoop cluster and a
> Solr 6.3.0 installation which works for their gettingstarted example.

You should use the specified version of Solr for the Nutch release. This is
Solr 5.4.1 as defined in the indexer-solr plugin ivy.xml

> I have questions likeDo I need to create a core and a collection in solr?

Yes I would. This is explained at

> Do I need http or cloud type server?Do I need solr.zookeeper.url ?

This is not a Nutch question. This is your preferred Solr configuration. If
you are just starting out then I would say it is not a big deal...
experiment and go with what works best for your requirements and resources

> What else needs to be set in nutch-site.xml?

Not much. For reference though, here are the Solr configuration options.

> What about schema?

This is covered in

> Thanks for all the help so far!
No problems. Any more issues, ping us here and we will help.

  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message