Mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Stephen Sprague <>
Subject Tez GC issues perhaps? not sure.
Date Wed, 14 Dec 2016 00:43:28 GMT
hey guys,
gotta slightly weird issue here.   Tez runs great. :)  client completes in
a short amount time (5 minutes) but - and here's the gotcha -  the tez
server side process takes upwards of an hour to clear out of the RM.

This is a problem for us since the queue it's in has maxRunning set to 15
and these jobs are just squatting holding slots.

The thing is... why?  i'm wondering if it isn't some kind for GC going on
but sure how to diagnose.  i can logon to a DN and cat stderr but its not
particularly useful to me but i can pass it along if desired.

Here's a screenshot of the "squatters":

[image: Inline image 1]

all have one container that the histogram shows 100%. And the client has
completed an hour ago! that's the part i don't get.

Any other output and/or configs to pass along?  Tez v0.8.4, hive v2.1.0.

Much appreciated,

  • Unnamed multipart/related (inline, None, 0 bytes)
View raw message