Mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Markus Jelsma <markus.jel...@openindex.io>
Subject RE: Solr not showing metadata of a url
Date Wed, 04 Jan 2017 20:47:30 GMT
Hello - parsechecker only shows data emitted by the parser, not what is being sent to the index.
Use indexchecker instead, and probably index-metadata to add metadata fields, if that is available
in 2.x.

Markus

 
 
-----Original message-----
> From:Ruchika Jain <webassistant7@outlook.com>
> Sent: Wednesday 28th December 2016 10:53
> To: user@nutch.apache.org
> Subject: Solr not showing metadata of a url
> 
> 
> Hi All,
> 
> 
> 
> I've downloaded latest code from  nutch 2.3.1 branch and try to index meta data to Solr
but Solr query results are not showing meta data.
> 
> But , parsechecker working fine . Do I need to do any additional configurations to get
meta data on solr query results.
> 
> $ ./bin/nutch parsechecker http://nutch.apache.org/
> fetching: http://nutch.apache.org/
> parsing: http://nutch.apache.org/
> contentType: text/html
> signature: b2bb805dcd51f12784190d58d619f0bc
> ---------
> Url
> ---------------
> http://nutch.apache.org/
> 
> [http://nutch.apache.org/assets/img/examples/all_systems_go_ahart.jpg]<http://nutch.apache.org/>
> 
> Apache Nutch(tm)<http://nutch.apache.org/>
> nutch.apache.org
> The Apache Nutch PMC are pleased to announce the immediate release of Apache Nutch v1.12,
we advise all current users and developers of the 1.X series to upgrade to ...
> 
> 
> ---------
> 
> 
> Command I'm using to crawl and Index is ,
> bin/crawl urls/seed.txt TestCrawl3.1 http://localhost:8983/solr/ 2
> 
> I've not done much configuration changes, I've configure nutch-sites.xml and gora.properties
to use hbase & gora
> 
> Appreciate if anyone can help me to identify the missing configurations.
> 
> 
> 
> 
> Thanks & Regards
> Ruchika Jain
> 
> 

Mime
View raw message