[jira] [Created] (FLINK-17044) FLIP-108: Add GPU support in Flink

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-17044) FLIP-108: Add GPU support in Flink

Shang Yuanchun (Jira)
Yangze Guo created FLINK-17044:
----------------------------------

             Summary: FLIP-108: Add GPU support in Flink
                 Key: FLINK-17044
                 URL: https://issues.apache.org/jira/browse/FLINK-17044
             Project: Flink
          Issue Type: New Feature
          Components: Runtime / Coordination
            Reporter: Yangze Guo
             Fix For: 1.11.0


With widespread advances in machine learning (or deep learning), more and more enterprises are beginning to incorporate ML models across a number of products. Supporting the ML scenarios is one of Flink’s roadmap targets. GPU is widely used as the accelerator by people from the ML community. It is necessary to add GPU support. 

Currently, Flink only supports to request GPU resource in Mesos integration while most users and enterprises deploying Flink on Yarn/Kubernetes or Standalone mode. Thus, we propose to add GPU support in Flink. As a first step, we propose to:

- Enable user to configure the GPU cores per task executor and forward such requirements to the external resource managers (for Kubernetes/Yarn/Mesos setups).
- Provide information of available GPU resources to operators.




--
This message was sent by Atlassian Jira
(v8.3.4#803005)