[jira] [Created] (FLINK-21279) flink on yarn sink nothing

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-21279) flink on yarn sink nothing

Shang Yuanchun (Jira)
Spongebob created FLINK-21279:
---------------------------------

             Summary: flink on yarn sink nothing
                 Key: FLINK-21279
                 URL: https://issues.apache.org/jira/browse/FLINK-21279
             Project: Flink
          Issue Type: Bug
          Components: Table SQL / API
    Affects Versions: 1.12.1
         Environment: flink: 1.12.1

hive: 3.1.2
            Reporter: Spongebob
         Attachments: yarn.log

Here's the data chain of the flink application:
 # read from HDFS file via ExecutionEnvironment, get Dataset
 # collect the Dataset into Seq object.
 # transform Seq object into multi ArrayBuffer[Expression] objects.
 # create TableObjects from ArrayBuffers using `fromValues`
 # create catalog views from TableObjects
 # sink into hive table from catalog views.

It is all normal until step 6. And it runs inconsistently on local IDE and yarn cluster. When I run the application on local IDE it cost all network buffer memory then turn into failed( actually the HDFS file size is less then 2MB, and I had set the parrallelism of tableEnv to 1. If I run the one sink of them only it can be run normally). And to the yarn cluster, there throws the exception `Job was submitted in detached mode. Results of job execution, such as accumulators, runtime, etc. are not available. Please make sure your program doesn't call an eager execution function [collect, print, printToErr, count]` but the application can run successfully however can not sink anything to hive, I find it does not request any slot while running.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)