[jira] [Created] (FLINK-17863) flink streaming sql read hive with lots small files need to control parallelism

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-17863) flink streaming sql read hive with lots small files need to control parallelism

Shang Yuanchun (Jira)
richt richt created FLINK-17863:
-----------------------------------

             Summary: flink streaming sql  read hive with lots small files  need to control  parallelism
                 Key: FLINK-17863
                 URL: https://issues.apache.org/jira/browse/FLINK-17863
             Project: Flink
          Issue Type: Improvement
          Components: Connectors / Hive
    Affects Versions: 1.10.1
            Reporter: richt richt


the table wy.cartest  has 19 rows with 19 files  

so when i query the table use *streaming* mode it will require 19 slots , my cluster cannot allocate so much resource to the task.
----
Caused by: org.apache.flink.runtime.JobException: Vertex Source: HiveTableSource(carid, time, num, var) TablePath: wy.cartest, Par
titionPruned: false, PartitionNums: null -> SinkConversionToTuple2's parallelism (19) is higher than the max parallelism (2). Plea
se lower the parallelism or increase the max parallelism.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)