[jira] [Created] (FLINK-12472) Support setting attemptFailuresValidityInterval of jobs on Yarn

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-12472) Support setting attemptFailuresValidityInterval of jobs on Yarn

Shang Yuanchun (Jira)
Victor Wong created FLINK-12472:
-----------------------------------

             Summary: Support setting attemptFailuresValidityInterval of jobs on Yarn
                 Key: FLINK-12472
                 URL: https://issues.apache.org/jira/browse/FLINK-12472
             Project: Flink
          Issue Type: Improvement
          Components: Deployment / YARN
            Reporter: Victor Wong
            Assignee: Victor Wong


According to the documentation of [Yarn|http://hadoop.apache.org/docs/r2.6.0/api/org/apache/hadoop/yarn/api/records/ApplicationSubmissionContext.html], a yarn application can set a _attemptFailuresValidityInterval_  to reset application attempts.

 

"attemptFailuresValidityInterval. _The default value is -1. when attemptFailuresValidityInterval in milliseconds is set to > 0, the failure number will no take failures which happen out of the validityInterval into failure count. If failure count reaches to maxAppAttempts, the application will be failed."_

 

We can make use of this feature to make Flink jobs on Yarn to be more long-running.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)