(DEPRECATED) Apache Flink Mailing List archive.

Have trouble on running flink

Classic

List

Threaded

2 messages Options

Russell Bie

Have trouble on running flink

Hi Flink team,

I am trying to submit flink job (version 1.8.2) with RocksDB backend to my own yarn cluster (hadoop version 2.6.0-cdh5.7.3), the job always failed after running for a few hours with the connection loss of some taskmanagers. Here<https://stackoverflow.com/questions/58046847/ioexception-when-taskmanager-restored-from-rocksdb-state-in-hdfs> is the question details on the stackoverflow. I am just wondering if you could provide some advice on this issue?

Thanks,
Russell

Biao Liu

Re: Have trouble on running flink

Hi Russell,

I don't think `BackendBuildingException` is root cause. In your case, this
exception appears when task is under cancelling.

Have you ever checked the log of yarn node manager? There should be an exit
code of container. Even more the container is probably killed by yarn node
manager.

BTW, I think we should discuss this in flink-user mailing list, not dev
mailing list. Will forward this mail there.

Thanks,
Biao /'bɪ.aʊ/

On Tue, 24 Sep 2019 at 19:19, Russell Bie <[hidden email]> wrote:

> Hi Flink team,
>
> I am trying to submit flink job (version 1.8.2) with RocksDB backend to my
> own yarn cluster (hadoop version 2.6.0-cdh5.7.3), the job always failed
> after running for a few hours with the connection loss of some
> taskmanagers. Here<
> https://stackoverflow.com/questions/58046847/ioexception-when-taskmanager-restored-from-rocksdb-state-in-hdfs>
> is the question details on the stackoverflow. I am just wondering if you
> could provide some advice on this issue?
>
> Thanks,
> Russell
>
>