Mailing list archives

Site index · List index
Message view « Date » · « Thread »
Top « Date » · « Thread »
From Grandl Robert <rgra...@yahoo.com>
Subject Re: get input size for each task
Date Wed, 09 Jul 2014 03:29:55 GMT
Hitesh,

With respect to the below comment: So a vertex will have a number of tasks, which is decided
strictly based on the input data the vertex has to process ? Also, it is guaranteed that every
task will have same input size ? (all except the last one probably).

Thanks,
Robert


Correct. The hierarchy is dag -> vertex -> task -> task attempt ( each relationship
being a 1:N ).
Vertex
 defines a stage of common processing logic applied on a parallel data 
set. A task represents processing of a subset of the data set.



On Monday, July 7, 2014 10:37 AM, Hitesh Shah <hitesh@apache.org> wrote:
 


Correct. The hierarchy is dag -> vertex -> task -> task attempt ( each relationship
being a 1:N ).
Vertex defines a stage of common processing logic applied on a parallel data set. A task represents
processing of a subset of the data set.

thanks
— Hitesh


On Jul 7, 2014, at 9:40 AM, Grandl Robert <rgrandl@yahoo.com> wrote:

> Another dumb question: A vertex can have multiple tasks(not task attempts), for different
input blocks, right ? So a vertex entity is kind of a stage abstraction, not a task abstraction,
right ?
> 
Mime
View raw message