[jira] [Created] (FLINK-5072) MetricFetcher Ask Timeout

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-5072) MetricFetcher Ask Timeout

Shang Yuanchun (Jira)
Ufuk Celebi created FLINK-5072:
----------------------------------

             Summary: MetricFetcher Ask Timeout
                 Key: FLINK-5072
                 URL: https://issues.apache.org/jira/browse/FLINK-5072
             Project: Flink
          Issue Type: Improvement
            Reporter: Ufuk Celebi


Running a large scale test with 1.2-SNAPSHOT and heavy load on the TMs, I encountered a lot of ask timeouts for the metric fetcher:

{code}
akka.pattern.AskTimeoutException: Ask timed out on [Actor[akka.tcp://flink@10.240.0.52:34471/user/MetricQueryService_container_1479207428252_0014_01_000026]] after [10000 ms]
        at akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:333)
        at akka.actor.Scheduler$$anon$7.run(Scheduler.scala:117)
        at scala.concurrent.Future$InternalCallbackExecutor$.scala$concurrent$Future$InternalCallbackExecutor$$unbatchedExecute(Future.scala:694)
        at scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:691)
        at akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(Scheduler.scala:467)
        at akka.actor.LightArrayRevolverScheduler$$anon$8.executeBucket$1(Scheduler.scala:419)
        at akka.actor.LightArrayRevolverScheduler$$anon$8.nextTick(Scheduler.scala:423)
        at akka.actor.LightArrayRevolverScheduler$$anon$8.run(Scheduler.scala:375)
        at java.lang.Thread.run(Thread.java:745)
{code}

[~zentol] Does it make sense to investigate this further?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)