As we know, with acker storm can process data at least once. If we want it to process data exactly once, the status of stream must be saved.Trident(storm's DAG abstract) leaves this problem to users.I would like to know where flink save the status of stream
: ) Thanks a lot |
And How to make exactly once processing : P
------------------ Original ------------------ From: "cY";<[hidden email]>; Date: Sun, Oct 12, 2014 00:30 AM To: "dev"<[hidden email]>; Subject: a question about Exactly once processing As we know, with acker storm can process data at least once. If we want it to process data exactly once, the status of stream must be saved.Trident(storm's DAG abstract) leaves this problem to users.I would like to know where flink save the status of stream : ) Thanks a lot |
Hey,
There is currently an ongoing effort to enable stateful exactly once processing guarantees in Flink Streaming, but there is no currently available version that supports that. We expect this to be available in the next release after 0.7 Regards, Gyula On Sat, Oct 11, 2014 at 6:31 PM, cY <[hidden email]> wrote: > And How to make exactly once processing : P > > > ------------------ Original ------------------ > From: "cY";<[hidden email]>; > Date: Sun, Oct 12, 2014 00:30 AM > To: "dev"<[hidden email]>; > > Subject: a question about Exactly once processing > > > > As we know, with acker storm can process data at least once. If we want it > to process data exactly once, the status of stream must be > saved.Trident(storm's DAG abstract) leaves this problem to users.I would > like to know where flink save the status of stream > : ) Thanks a lot > |
Thanks for reply
Is there any idea about it?This is such a challenge~ I care this problem a lot. : ) ------------------ 原始邮件 ------------------ 发件人: "Gyula Fóra";<[hidden email]>; 发送时间: 2014年10月12日(星期天) 凌晨0:38 收件人: "dev"<[hidden email]>; 主题: Re: a question about Exactly once processing Hey, There is currently an ongoing effort to enable stateful exactly once processing guarantees in Flink Streaming, but there is no currently available version that supports that. We expect this to be available in the next release after 0.7 Regards, Gyula On Sat, Oct 11, 2014 at 6:31 PM, cY <[hidden email]> wrote: > And How to make exactly once processing : P > > > ------------------ Original ------------------ > From: "cY";<[hidden email]>; > Date: Sun, Oct 12, 2014 00:30 AM > To: "dev"<[hidden email]>; > > Subject: a question about Exactly once processing > > > > As we know, with acker storm can process data at least once. If we want it > to process data exactly once, the status of stream must be > saved.Trident(storm's DAG abstract) leaves this problem to users.I would > like to know where flink save the status of stream > : ) Thanks a lot > |
Hello,
Generally speaking the designed solution will most probably combine upstream backup with asynchronous state checkpointing. We are looking into ways to minimise the communication and recovery costs by exploiting properties of the stream (eg. recomputable segments-windows). Are you also working on fault tolerance? Paris On 11 Oct 2014, at 18:43, cY <[hidden email]> wrote: > Thanks for reply > Is there any idea about it?This is such a challenge~ > I care this problem a lot. : ) > > > > > ------------------ 原始邮件 ------------------ > 发件人: "Gyula Fóra";<[hidden email]>; > 发送时间: 2014年10月12日(星期天) 凌晨0:38 > 收件人: "dev"<[hidden email]>; > > 主题: Re: a question about Exactly once processing > > > > Hey, > > There is currently an ongoing effort to enable stateful exactly once > processing guarantees in Flink Streaming, but there is no currently > available version that supports that. We expect this to be available in the > next release after 0.7 > > Regards, > Gyula > > On Sat, Oct 11, 2014 at 6:31 PM, cY <[hidden email]> wrote: > >> And How to make exactly once processing : P >> >> >> ------------------ Original ------------------ >> From: "cY";<[hidden email]>; >> Date: Sun, Oct 12, 2014 00:30 AM >> To: "dev"<[hidden email]>; >> >> Subject: a question about Exactly once processing >> >> >> >> As we know, with acker storm can process data at least once. If we want it >> to process data exactly once, the status of stream must be >> saved.Trident(storm's DAG abstract) leaves this problem to users.I would >> like to know where flink save the status of stream >> : ) Thanks a lot |
Hello
I'm the core contributor of JStorm (https://github.com/alibaba/jstorm) which is the java version of Storm with a lot of optimization. I want to build a DAG abstract on storm like RDD. However I found the flink and its exciting DAG abstract.So I have an idea to make a repository which use the Storm to replace the flink's runtime .It will combine flink and storm so that flink can focus on DAG (DAG optimization, DAG view, etc) and storm can focus on the engine (failover, serialization, etc). Of course it's just an idea and i'm still investigate the feasibility of this scheme.Besides flink is a wonderful repository and i'm very glad to contribute to it. Thanks a lot : P cY ------------------ 原始邮件 ------------------ 发件人: "Paris Carbone";<[hidden email]>; 发送时间: 2014年10月12日(星期天) 下午3:56 收件人: "[hidden email]"<[hidden email]>; 主题: Re: 回复: a question about Exactly once processing Hello, Generally speaking the designed solution will most probably combine upstream backup with asynchronous state checkpointing. We are looking into ways to minimise the communication and recovery costs by exploiting properties of the stream (eg. recomputable segments-windows). Are you also working on fault tolerance? Paris On 11 Oct 2014, at 18:43, cY <[hidden email]> wrote: > Thanks for reply > Is there any idea about it?This is such a challenge~ > I care this problem a lot. : ) > > > > > ------------------ 原始邮件 ------------------ > 发件人: "Gyula Fóra";<[hidden email]>; > 发送时间: 2014年10月12日(星期天) 凌晨0:38 > 收件人: "dev"<[hidden email]>; > > 主题: Re: a question about Exactly once processing > > > > Hey, > > There is currently an ongoing effort to enable stateful exactly once > processing guarantees in Flink Streaming, but there is no currently > available version that supports that. We expect this to be available in the > next release after 0.7 > > Regards, > Gyula > > On Sat, Oct 11, 2014 at 6:31 PM, cY <[hidden email]> wrote: > >> And How to make exactly once processing : P >> >> >> ------------------ Original ------------------ >> From: "cY";<[hidden email]>; >> Date: Sun, Oct 12, 2014 00:30 AM >> To: "dev"<[hidden email]>; >> >> Subject: a question about Exactly once processing >> >> >> >> As we know, with acker storm can process data at least once. If we want it >> to process data exactly once, the status of stream must be >> saved.Trident(storm's DAG abstract) leaves this problem to users.I would >> like to know where flink save the status of stream >> : ) Thanks a lot |
Hi!
That is an exciting project! Let us know if you stumble over any questions! Stephan On Sun, Oct 12, 2014 at 2:26 PM, cY <[hidden email]> wrote: > Hello > I'm the core contributor of JStorm ( > https://github.com/alibaba/jstorm) which is the java version of Storm > with a lot of optimization. I want to build a DAG abstract on storm like > RDD. However I found the flink and its exciting DAG abstract.So I have an > idea to make a repository which use the Storm to replace the flink's > runtime .It will combine flink and storm so that flink can focus on DAG > (DAG optimization, DAG view, etc) and storm can focus on the engine > (failover, serialization, etc). Of course it's just an idea and i'm still > investigate the feasibility of this scheme.Besides flink is a wonderful > repository and i'm very glad to contribute to it. > Thanks a lot : P > cY > > > > > > ------------------ 原始邮件 ------------------ > 发件人: "Paris Carbone";<[hidden email]>; > 发送时间: 2014年10月12日(星期天) 下午3:56 > 收件人: "[hidden email]"<[hidden email]>; > > 主题: Re: 回复: a question about Exactly once processing > > > > Hello, > Generally speaking the designed solution will most probably combine > upstream backup with asynchronous state checkpointing. We are looking into > ways to minimise the communication and recovery costs by exploiting > properties of the stream (eg. recomputable segments-windows). Are you also > working on fault tolerance? > > Paris > > On 11 Oct 2014, at 18:43, cY <[hidden email]> wrote: > > > Thanks for reply > > Is there any idea about it?This is such a challenge~ > > I care this problem a lot. : ) > > > > > > > > > > ------------------ 原始邮件 ------------------ > > 发件人: "Gyula Fóra";<[hidden email]>; > > 发送时间: 2014年10月12日(星期天) 凌晨0:38 > > 收件人: "dev"<[hidden email]>; > > > > 主题: Re: a question about Exactly once processing > > > > > > > > Hey, > > > > There is currently an ongoing effort to enable stateful exactly once > > processing guarantees in Flink Streaming, but there is no currently > > available version that supports that. We expect this to be available in > the > > next release after 0.7 > > > > Regards, > > Gyula > > > > On Sat, Oct 11, 2014 at 6:31 PM, cY <[hidden email]> wrote: > > > >> And How to make exactly once processing : P > >> > >> > >> ------------------ Original ------------------ > >> From: "cY";<[hidden email]>; > >> Date: Sun, Oct 12, 2014 00:30 AM > >> To: "dev"<[hidden email]>; > >> > >> Subject: a question about Exactly once processing > >> > >> > >> > >> As we know, with acker storm can process data at least once. If we want > it > >> to process data exactly once, the status of stream must be > >> saved.Trident(storm's DAG abstract) leaves this problem to users.I would > >> like to know where flink save the status of stream > >> : ) Thanks a lot > |
Free forum by Nabble | Edit this page |