Verifying data migration correctness
The checksum principle
Wei, B., & Chen, T. (2014). Verifying data migration correctness: The checksum principle. (RTI Press Publication No. OP-0019-1403; No. OP-0019-1403). Research Triangle Park, NC: RTI Press. DOI: 10.3768/rtipress.2014.op.0019.1403
Data migration, generally referred to as the process of reading data from their source and inserting them into a target database, is an important element of data extract, transform, and load (ETL) systems. During data migration, errors can occur during data transmission. These errors can directly affect the quality of the data in the target database. Therefore, verifying the correctness of the outcome is a critical component of a data migration operation. Current methods in data migration correctness verification have many limitations, including incompleteness and inaccuracy. This paper describes an innovative method that applies the well-proven checksum methodology to verify the correctness of the data migration outcome. This method performs a thorough and accurate verification on the correctness of the migrated data, mitigating most of the weaknesses associated with current verification methods. This method is also easy to implement and will greatly enhance the quality of data migration operations.