In-Place A2HA to Automate HA
This page explains the In-Place migration of A2HA to Automate HA. This migration involves the following steps:
Prerequisites
A healthy state of the A2HA cluster to take fresh backup.
A2HA is configured to take backup on a mounted network drive (location example: /mnt/automate_backup).
Availability of 60% of space.
Taking Backup and clean up of instances
Take the latest backup of A2HA by running the following commands from any automate instance:
sudo chef-automate backup create
The above command will store the backup in a configured backup patch in a2ha.rb config file
/hab/a2_deploy_workspace/a2ha.rb
. Once the backup is completed successfully, save the backup Id. For example:20210622065515
. To use the backup created previously, run the following command on Automate node to get the backup id:chef-automate backup list
The output looks like as shown below:
Backup State Age 20180508201548 completed 8 minutes old 20180508201643 completed 8 minutes old 20180508201952 completed 4 minutes old
Create a bootstrap bundle from one of automate node using the following command:
sudo chef-automate bootstrap bundle create bootstrap.abb
The above command will create the bootstrap bundle and copy the same bundle to bastion or backup-dir.
Stop each of the frontend nodes (automate and chef-server) using the following command:
sudo chef-automate stop
Rename /hab dir to something else like /hab-old. Remove /bin/chef-automate.
Unload services from each of the Postgresql Nodes:
sudo hab svc unload chef/automate-backend-postgresql sudo hab svc unload chef/automate-backend-metricbeat sudo hab svc unload chef/automate-backend-journalbeat sudo hab svc unload chef/automate-backend-haproxy sudo hab svc unload chef/automate-backend-pgleaderchk
Check the status using the
hab svc status
command. None of the services should be running. Once checked, stop the habitat supervisor with the commandsystemctl stop hab-sup
. Rename /hab dir to something else like /hab-old.Unload services from each of the Elasticsearch Nodes
sudo hab svc unload chef/automate-backend-elasticsidecar sudo hab svc unload chef/automate-backend-elasticsearch sudo hab svc unload chef/automate-backend-journalbeat sudo hab svc unload chef/automate-backend-metricbeat sudo hab svc unload chef/automate-backend-curator sudo hab svc unload chef/automate-backend-kibana
Check the status using the
hab svc status
command. None of the services should be running. Once checked, stop the habitat supervisor with the commandsystemctl stop hab-sup
. Rename /hab dir to something else like /hab-old.In the bastion host, take a copy of your current workspace and keep it safe for a while.
Remove or Rename /hab dir in the bastion host.
Installing the Latest Automate HA
Follow Automate HA installation documentation. Click here to know more about config.toml
,
provide the same IPs and backup config in config.toml as in the a2ha.rb
file.
EFS backup configuration
In case the backup configuration was skipped in the deployment config.toml, the User needs to configure EFS backup manually in Automate HA please click here to know more.
Note
/mnt/automate_backups/opensearch/
it will be /mnt/automate_backups/elasticsearch/
Restore Backup
Once deployment is successful, proceed with restoring the backup in Automate HA. Click here to know more.
Login to one of automate nodes, and take current_config.toml file as shown below:
sudo chef-automate config show > current_config.toml
Add the following config to the current_config.toml file.
[global.v1.external.opensearch.auth.basic_auth]
username = "admin"
password = "admin"
AND
[global.v1.external.opensearch.backup.fs]
path = "/mnt/automate_backups/elasticsearch"
To restore, use the below command from same automate node, Make sure to stop all other frontend nodes using chef-automate stop
:
sudo chef-automate backup restore /mnt/automate_backups/backups/20210622065515/ --patch-config current_config.toml --airgap-bundle /var/tmp/frontend-4.x.y.aib --skip-preflight
Note
- After the restore command is successfully executed, run the
chef-automate config show
command. Both the ElasticSearch and OpenSearch configs are part of Automate Config. Keep both configs; it won’t impact the functionality. After restoring Automate HA is configured to communicate with OpenSearch.
OR
- We can remove the ElasticSearch config from the automate. To do that, redirect the applied config to the file and set the config again.
chef-automate config show > applied_config.toml
Modify applied_config.toml
, remove the elastic search config, and set the config. Set applied_config.toml
on all the frontend nodes manually. As the removal of config is not supported from the bastion. Use the below command to set the config manually.
chef-automate config set applied_config.toml
Note
In case backup_config = "file_system"
had been provided in config.toml of Automate HA deployment, then please patch the below OpenSearch config from bastion before starting the restore.
Create a
.toml
(say os_config.toml) file from Provision host and copy the following template with the path to the repo.[path] repo = "/mnt/automate_backups/elasticsearch"
Following command will add the configuration to the OpenSearch node.
chef-automate config patch --opensearch <PATH TO OS_CONFIG.TOML>
Click here to know more about the usage of S3 backup.
Copy the bootstrap.abb bundle to all the Frontend nodes of the Chef Automate HA cluster. Unpack the bundle using the below command on all the Frontend nodes:
sudo chef-automate bootstrap bundle unpack bootstrap.abb
Note
- Once Automate HA is up and running with restored data, We can remove old backed-up directories sudo
rm -rf hab-old
, freeing up acquired space. - Reset the backup configuration path to Opensearch so that new backups will be stored in Opensearch directory, please click here to know more.
Troubleshoot
While installing the new Automate HA, if PostgreSQL is having any issues in starting, and in PostgreSQL instance
hab svc status
shows a secret key mismatch error, then try the cleanup command with new Automate HA clichef-automate cleanup --onprem-deployment
and then remove /bin/chef-automate from all frontend nodes, now try the installation again.Click here to know more if you encounter an error while restoring related to the ElasticSearch snapshot.
While restoring the backup if an error related to backup directory occurs like
Error in Automate node: failed to create snapshot repository: Elasticsearch repository create request failed for repo** OR Error in Opensearch node: /mnt/automate_backups/backups/automate-elasticsearch-data/chef-automate-*-service] doesn’t match any of the locations specified by path.repo
please re-check your EFS backup configuration for the Automate and OpenSearch node, click here to know more.