[jira] [Created] (FLINK-2430) Potential race condition when restart all is called for a Twill runnable

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-2430) Potential race condition when restart all is called for a Twill runnable

Shang Yuanchun (Jira)
Henry Saputra created FLINK-2430:
------------------------------------

             Summary: Potential race condition when restart all is called for a Twill runnable
                 Key: FLINK-2430
                 URL: https://issues.apache.org/jira/browse/FLINK-2430
             Project: Flink
          Issue Type: Bug
    Affects Versions: 0.6-incubating
            Reporter: Henry Saputra


When sending restart instance to all for a particular TwillRunnable, it could have race condition where the heartbeat thread run right after all containers have been released which make the check:

     // Looks for containers requests.
      if (provisioning.isEmpty() && runnableContainerRequests.isEmpty() && runningContainers.isEmpty()) {
        LOG.info("All containers completed. Shutting down application master.");
        break;
      }

This could happen when all running containers are empty and new runnableContainerRequests has not been added.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)