Mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From KRIS MUSSHORN <mussho...@comcast.net>
Subject Re: nutch/Solr/tika
Date Mon, 19 Dec 2016 18:58:03 GMT

any thoughts on how to get this right? 

Is processing the docs in SOLR after indexing with nutch the right option? 

----- Original Message -----

From: "KRIS MUSSHORN" <musshorns@comcast.net> 
To: user@nutch.apache.org 
Sent: Tuesday, December 13, 2016 10:31:42 AM 
Subject: nutch/Solr/tika 

All, 

Using nutch 1.12 into solr 5.4.1. and the metatag plugin to get metatadata into SOLR. 

Some of the content is PDF. 

metatag.date is coming in as multi valued where the data is repeated..i.e 

" metatag.date ": [ "2012-06-28T18:28:27Z" , "2012-06-28T18:28:27Z" ] 
where doc properties are: 


How do I get a single metatag.date value? 
TIA 
Kris 


Mime
  • Unnamed multipart/alternative (inline, None, 0 bytes)
View raw message