Migration Tool

As mentioned above, MatrixDB uses the mxshift tool to perform data migration operations.
The mxshift tool currently supports the following features:

  • Full migration: In the unit of database, implement parallel migration of data from Greenplum 5 / Greenplum 6 / YMatrix cluster to a new YMatrix cluster.
  • Reverse migration: Reverse migration from YMatrix to Greenplum 5 / Greenplum 6.
  • Incremental migration: Incremental migration of data is achieved through the WHERE condition.
  • Supported data distribution strategies: hash distribution, random distribution, replication table (REPLICATED), Master-only.

![](https://img.ymatrix.cn/ymatrix_home/mxshift full library migration (screenshot)_1693981579.png)

If the data migration process is likened by water flowing through the pipeline and the data transmission efficiency is likened by water flow rate, then the water flow rate is often limited to a low value as a whole because of the "narrowest part of the pipeline", resulting in the so-called "single-point bottleneck".
With this in mind, the mxshift tool implements data transfer from the Segment directly to the Segment, eliminating single point bottlenecks that may occur in common migration operations. When the topology of the source cluster and the target cluster are inconsistent, that is, heterogeneous, mxshift can also calculate the optimal mapping scheme based on the number of source cluster segments, the number of target cluster segments, whether the segment is on the same physical machine, etc., to realize automated data migration.

Notes!
For detailed parameters, please refer to mxshift