[jira] [Created] (FLINK-17971) Speed up RocksDB bulk loading with SST generation and ingestion

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

[jira] [Created] (FLINK-17971) Speed up RocksDB bulk loading with SST generation and ingestion

Shang Yuanchun (Jira)
Joey Pereira created FLINK-17971:
------------------------------------

             Summary: Speed up RocksDB bulk loading with SST generation and ingestion
                 Key: FLINK-17971
                 URL: https://issues.apache.org/jira/browse/FLINK-17971
             Project: Flink
          Issue Type: Improvement
          Components: Runtime / State Backends
            Reporter: Joey Pereira


RocksDB provides an API for creating SST files and ingesting them directly into RocksDB: [https://github.com/facebook/rocksdb/wiki/Creating-and-Ingesting-SST-files]

Using this method for bulk loading data into RocksDB may provide a significant performance increase, specifically for paths doing inserts such as full savepoint recovery and state migrations. This is one method of optimizing bulk loads, as described in https://issues.apache.org/jira/browse/FLINK-17288

This was discussed on the user maillist: [http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/RocksDB-savepoint-recovery-performance-improvements-td35238.html]

A draft PR is here: [https://github.com/apache/flink/pull/12345/]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)