[jira] [Created] (FLINK-10465) Jepsen: runit supervised sshd is stopped on tear down

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-10465) Jepsen: runit supervised sshd is stopped on tear down

Shang Yuanchun (Jira)
Gary Yao created FLINK-10465:
--------------------------------

             Summary: Jepsen: runit supervised sshd is stopped on tear down
                 Key: FLINK-10465
                 URL: https://issues.apache.org/jira/browse/FLINK-10465
             Project: Flink
          Issue Type: Bug
          Components: Tests
    Affects Versions: 1.7.0, 1.6.2
            Reporter: Gary Yao
            Assignee: Gary Yao
             Fix For: 1.7.0, 1.6.2


When tearing down the _DB_, we tear down all services supervised by runit. However when running the tests in Docker, sshd is under supervision by runit. When sshd is stopped, the tests cannot be continued because the control node cannot interact with the DB nodes anymore.

*How to reproduce*
Run command below in control-node container:
{code}
./docker/run-tests.sh 1 [...]/flink/flink-1.6.1/flink-1.6.1-bin-hadoop28-scala_2.11.tgz
{code}

*Expected behavior*
sshd should never be stopped



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)