Follow

Migrating Kinetica Persist Data

Mirgating data from one Kinetica instance to another

Problem:

Data currently resides in Kinetica instance A and you need it in Kinetica instance B

Solution:

Use gpudb_migrate_persistence.py to copy persist data

Method:

If you have a current backup of your persist data for each node you can use that, should you not have a current backup either;

  • Take a current backup, or
  • Shutdown instance A and you can source the data from the persist directory on each node

    Once you have identified the source of your data, you will need to open up a connection to the Instance A head node and worker nodes.

After connecting to the Instance A head node you will need to put together the string for executing the migration, you can reference the help file if you have questions.

script resides: /opt/gpudb/core/bin/gpudb_migrate_persistence.py
help: /opt/gpudb/core/bin/gpudb_migrate_persistence.py --help


In both examples below the assumption is you are leveraging a backup located in /tmp/gpudb_bak, which is a copy of /opt/gpudb/persist

sample script w/o authentication:

/opt/gpudb/bin/gpudb_python /opt/gpudb/core/bin/gpudb_migrate_persistence.py -p /tmp/gpudb_bak -g 10.10.10.15:9191

sample script w/ authentication:

/opt/gpudb/bin/gpudb_python /opt/gpudb/core/bin/gpudb_migrate_persistence.py -p /tmp/gpudb_bak -g 10.10.10.15:9191 --user admin --password <your_password>


In the example below the assumption is you are migrating data directly from the default persist directory

sample script w/o authentication:

 /opt/gpudb/bin/gpudb_python /opt/gpudb/core/bin/gpudb_migrate_persistence.py -p /opt/gpudb/persist -g 10.10.10.15:9191


sample script w/ authentication:

 /opt/gpudb/bin/gpudb_python /opt/gpudb/core/bin/gpudb_migrate_persistence.py -p /opt/gpudb/persist -g 10.10.10.15:9191 --user admin --password <your_password>

 

Once you have executed the transfer from the head node, you can then execute the migration from all worker nodes using the same script.

Alternatively, if all your backups are stored in the same directory (the rank-<#> directories) you can run the script one time and source your data from that directory.

Was this article helpful?
0 out of 0 found this helpful
Have more questions? Submit a request

0 Comments

Please sign in to leave a comment.