Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
HIVE-28622: Duplicate Entries in TXN_WRITE_NOTIFICATION_LOG Due to Or…
…acle's Handling of Empty Strings In Oracle, empty strings ('') are treated as NULL values for VARCHAR2 and CHAR data types. This behavior is unique to Oracle and can be confusing, as an empty string is typically considered distinct from NULL in other databases. As a result, the TXN_WRITE_NOTIFICATION_LOG table receives duplicate entries for a single Hive ACID transaction involving MERGE statements. This discrepancy leads to issues: the _files and _dumpmetadata files in a Hive ACID incremental dump will not align if the dump scope includes one or more MERGE statements. Consequently, the Hive ACID incremental LOAD fails at the target (DR), blocking subsequent replication executions. Solution * Add additional check for partition being null Testing: * Tested on cluster with oracle and mysql as backend database
- Loading branch information