Re: What's the best practice to determine whether a job has finished or not?

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

Re: What's the best practice to determine whether a job has finished or not?

Kurt Young
+dev <[hidden email]>

Best,
Kurt


On Fri, May 8, 2020 at 3:35 PM Caizhi Weng <[hidden email]> wrote:

> Hi Jeff,
>
> Thanks for the response. However I'm using executeAsync so that I can run
> the job asynchronously and get a JobClient to monitor the job. JobListener
> only works for synchronous execute method. Is there other way to achieve
> this?
>
> Jeff Zhang <[hidden email]> 于2020年5月8日周五 下午3:29写道:
>
>> I use JobListener#onJobExecuted to be notified that the flink job is
>> done.
>> It is pretty reliable for me, the only exception is the client process is
>> down.
>>
>> BTW, the reason you see ApplicationNotFound exception is that yarn app
>> is terminated which means the flink cluster is shutdown. While for
>> standalone mode, the flink cluster is always up.
>>
>>
>> Caizhi Weng <[hidden email]> 于2020年5月8日周五 下午2:47写道:
>>
>>> Hi dear Flink community,
>>>
>>> I would like to determine whether a job has finished (no matter
>>> successfully or exceptionally) in my code.
>>>
>>> I used to think that JobClient#getJobStatus is a good idea, but I found
>>> that it behaves quite differently under different executing environments.
>>> For example, under a standalone session cluster it will return the FINISHED
>>> status for a finished job, while under a yarn per job cluster it will throw
>>> a ApplicationNotFound exception. I'm afraid that there might be other
>>> behaviors for other environments.
>>>
>>> So what's the best practice to determine whether a job has finished or
>>> not? Note that I'm not waiting for the job to finish. If the job hasn't
>>> finished I would like to know it and do something else.
>>>
>>
>>
>> --
>> Best Regards
>>
>> Jeff Zhang
>>
>
Reply | Threaded
Open this post in threaded view
|

Re: What's the best practice to determine whether a job has finished or not?

Aljoscha Krettek-2
Hi,

The problem is that the JobClient is talking to the wrong system. In YARN per-job mode the cluster will only run as long as the job runs so there will be no-one there to respond with the job status after the job is finished.

I think the solution is that the JobClient should talk to the right system, in the YARN example it should talk directly to YARN for figuring out the job status (and maybe for other things as well).

Best,
Aljoscha

On Fri, May 8, 2020, at 15:05, Kurt Young wrote:

> +dev <[hidden email]>
>
> Best,
> Kurt
>
>
> On Fri, May 8, 2020 at 3:35 PM Caizhi Weng <[hidden email]> wrote:
>
> > Hi Jeff,
> >
> > Thanks for the response. However I'm using executeAsync so that I can run
> > the job asynchronously and get a JobClient to monitor the job. JobListener
> > only works for synchronous execute method. Is there other way to achieve
> > this?
> >
> > Jeff Zhang <[hidden email]> 于2020年5月8日周五 下午3:29写道:
> >
> >> I use JobListener#onJobExecuted to be notified that the flink job is
> >> done.
> >> It is pretty reliable for me, the only exception is the client process is
> >> down.
> >>
> >> BTW, the reason you see ApplicationNotFound exception is that yarn app
> >> is terminated which means the flink cluster is shutdown. While for
> >> standalone mode, the flink cluster is always up.
> >>
> >>
> >> Caizhi Weng <[hidden email]> 于2020年5月8日周五 下午2:47写道:
> >>
> >>> Hi dear Flink community,
> >>>
> >>> I would like to determine whether a job has finished (no matter
> >>> successfully or exceptionally) in my code.
> >>>
> >>> I used to think that JobClient#getJobStatus is a good idea, but I found
> >>> that it behaves quite differently under different executing environments.
> >>> For example, under a standalone session cluster it will return the FINISHED
> >>> status for a finished job, while under a yarn per job cluster it will throw
> >>> a ApplicationNotFound exception. I'm afraid that there might be other
> >>> behaviors for other environments.
> >>>
> >>> So what's the best practice to determine whether a job has finished or
> >>> not? Note that I'm not waiting for the job to finish. If the job hasn't
> >>> finished I would like to know it and do something else.
> >>>
> >>
> >>
> >> --
> >> Best Regards
> >>
> >> Jeff Zhang
> >>
> >
>
Reply | Threaded
Open this post in threaded view
|

Re: What's the best practice to determine whether a job has finished or not?

Till Rohrmann
For that attached YARN per-job mode, the cluster should wait until the
result of the job has been fetched. Can't this be used to report the final
status of the job?

Cheers,
Till

On Tue, May 12, 2020 at 9:52 AM Aljoscha Krettek <[hidden email]>
wrote:

> Hi,
>
> The problem is that the JobClient is talking to the wrong system. In YARN
> per-job mode the cluster will only run as long as the job runs so there
> will be no-one there to respond with the job status after the job is
> finished.
>
> I think the solution is that the JobClient should talk to the right
> system, in the YARN example it should talk directly to YARN for figuring
> out the job status (and maybe for other things as well).
>
> Best,
> Aljoscha
>
> On Fri, May 8, 2020, at 15:05, Kurt Young wrote:
> > +dev <[hidden email]>
> >
> > Best,
> > Kurt
> >
> >
> > On Fri, May 8, 2020 at 3:35 PM Caizhi Weng <[hidden email]> wrote:
> >
> > > Hi Jeff,
> > >
> > > Thanks for the response. However I'm using executeAsync so that I can
> run
> > > the job asynchronously and get a JobClient to monitor the job.
> JobListener
> > > only works for synchronous execute method. Is there other way to
> achieve
> > > this?
> > >
> > > Jeff Zhang <[hidden email]> 于2020年5月8日周五 下午3:29写道:
> > >
> > >> I use JobListener#onJobExecuted to be notified that the flink job is
> > >> done.
> > >> It is pretty reliable for me, the only exception is the client
> process is
> > >> down.
> > >>
> > >> BTW, the reason you see ApplicationNotFound exception is that yarn app
> > >> is terminated which means the flink cluster is shutdown. While for
> > >> standalone mode, the flink cluster is always up.
> > >>
> > >>
> > >> Caizhi Weng <[hidden email]> 于2020年5月8日周五 下午2:47写道:
> > >>
> > >>> Hi dear Flink community,
> > >>>
> > >>> I would like to determine whether a job has finished (no matter
> > >>> successfully or exceptionally) in my code.
> > >>>
> > >>> I used to think that JobClient#getJobStatus is a good idea, but I
> found
> > >>> that it behaves quite differently under different executing
> environments.
> > >>> For example, under a standalone session cluster it will return the
> FINISHED
> > >>> status for a finished job, while under a yarn per job cluster it
> will throw
> > >>> a ApplicationNotFound exception. I'm afraid that there might be other
> > >>> behaviors for other environments.
> > >>>
> > >>> So what's the best practice to determine whether a job has finished
> or
> > >>> not? Note that I'm not waiting for the job to finish. If the job
> hasn't
> > >>> finished I would like to know it and do something else.
> > >>>
> > >>
> > >>
> > >> --
> > >> Best Regards
> > >>
> > >> Jeff Zhang
> > >>
> > >
> >
>