Joey Pereira created FLINK-17971:
------------------------------------
Summary: Speed up RocksDB bulk loading with SST generation and ingestion
Key: FLINK-17971
URL:
https://issues.apache.org/jira/browse/FLINK-17971 Project: Flink
Issue Type: Improvement
Components: Runtime / State Backends
Reporter: Joey Pereira
RocksDB provides an API for creating SST files and ingesting them directly into RocksDB: [
https://github.com/facebook/rocksdb/wiki/Creating-and-Ingesting-SST-files]
Using this method for bulk loading data into RocksDB may provide a significant performance increase, specifically for paths doing inserts such as full savepoint recovery and state migrations. This is one method of optimizing bulk loads, as described in
https://issues.apache.org/jira/browse/FLINK-17288This was discussed on the user maillist: [
http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/RocksDB-savepoint-recovery-performance-improvements-td35238.html]
A draft PR is here: [
https://github.com/apache/flink/pull/12345/]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)