Posted by
Vasiliki Kalavri on
Apr 26, 2016; 4:57pm
URL: http://deprecated-apache-flink-mailing-list-archive.368.s1.nabble.com/Updated-Gelly-Roadmap-tp11431.html
Hi all,
as promised, I have updated the Gelly roadmap [1].
Below, I am describing and reasoning about the changes I made. Please, let
me know whether you agree and if you have any other ideas for further
improvements and feature additions.
*1. Operators for highly skewed graphs*:
I have removed this item completely. It referred to Andra's master thesis
which has been completed. We had a discussion back then (see [2]), but no
activity after that.
*2. Scala API*:
I removed this item because it's done 🎆
*3. Graph Streaming*:
I removed this item also. We have built an experimental API for graph
streaming with Paris and KTH students. The code is available in [3]. If you
think it would be a valuable addition as a library to Flink, we can start a
separate discussion thread about it.
*4. Library Methods*:
- Affinity Propagation: this is WIP in FLINK-1707
- HITS + Adsorption: Removed. They were started by TUB students and not
finished. I propose to only revisit these only if someone asks for them.
- Strongly CC + DIA: Removed. Nobody has worked on them AFAIK.
*5. Graph partitioning*:
This is still relevant in my opinion and thus I kept it in the updated
roadmap.
*6. Partition-centric iterations*:
We have created a POC implementation with KTH students [4]. In my opinion,
it would be nice to add this to Gelly and most of the work has already been
done, so I kept it.
*7. Generic Iterations*:
This requires caching intermediate results. Anyone has a status update on
that?
*8. Performance evaluation*:
I'm currently working on this, integrating Gelly with the Graphalytics
benchmark. My WIP is in [5] in case you want to get involved :)
*9. Bipartite support*:
This is still relevant in my opinion. Someone had started working on it,
but has been inactive for a while. I pinged the JIRA (FLINK-2254).
From the wishlist:
- *Neo4j* input/output formats have been implemented as an external project
by Martin Junghanns. Shall we go ahead and them to project-flink? We should
definitely link to this from the third-party packages.
- *TinkerPop*: Discussion started from the TinkerPop community, but there
was not much activity from our side [6]. I wrote my opinion then and I am
very much in favor. Anyone else wants to share their thoughts?
Looking forward to your input,
-Vasia.
[1]:
https://cwiki.apache.org/confluence/display/FLINK/Flink+Gelly[2]:
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Proposal-Addition-to-Gelly-td7436.html[3]:
https://github.com/vasia/gelly-streaming[4]:
https://github.com/vasia/gelly-partition-centric[5]:
https://github.com/vasia/graphalytics-platforms-gelly[6]:
http://apache-flink-mailing-list-archive.1008284.n3.nabble.com/Apache-Tinkerpop-amp-Geode-Integration-td9252.html