What might the process be for failover cardano-node services?

Curtis_Paul · 21 September 2021 21:40

What if there were two relay nodes on a public network with a relay and a block producer on the private network. We will call this private relay the “standby” block producer server. Let’s say the private relay is running simply to keep it’s DB synced.

Would it be technically possible to copy the block producer keys to the standby server and shut down the other block producer to force pool activities to move to the standby server?
Would the keys still function?
Would it continue to produce blocks for the pool?

Is there a better/more appropriate way to create redundancy with cardano-node services?

Markus-VITAL · 21 September 2021 21:47

Many pools run failover approaches. Find mine here: Block Producer - Failover Approach with a BP Standby

Maybe not the easiest in terms of the switchover mechanism. But the architecture is similar to your description.

Alexd1985 · 21 September 2021 21:49

Yes, uploading these 3 files (op/node.cert, vrf.skey and kes/hot.skey) and updating the start script will start the private relay as a producer and yes, it will continue to produce blocks

U can do it manually or automatically

Cheers,

Alexd1985 · 21 September 2021 21:52

Curtis_Paul · 21 September 2021 21:52

Can the relay “standby” node continue hosting on the same IP it was as a relay or does the node IP on the standby need to change to the primary block producer node IP?

Alexd1985 · 21 September 2021 21:55

Nope, the IP will not need to changed

The private relay should not be registered and should not run the topologyupdater script

connect the producer to the private relay and to the registered relays
connect the private relay with the registered relays only
if u will not connect the main producer to the registered relays then the private relay will be one point of failure

Curtis_Paul · 21 September 2021 21:56

What if, for some reason, the relay is behind on synchronization versus the primary block producer at failover time? Would this be a problem? or should the relay be 100% synced before reconfiguring the relay to start producing blocks?

Alexd1985 · 21 September 2021 21:58

Must be 100% synced, but if u will run it as a relay then should be synced all the time

Curtis_Paul · 21 September 2021 22:00

Forgive the stupid questions…
What do you mean by not registered?
What is the topologyupdater script for? Is it a requirement to run a block producer?

Curtis_Paul · 21 September 2021 22:01

What would happen if the standby wasn’t 100% synced on failover?

Alexd1985 · 21 September 2021 22:02

It will not start

Alexd1985 · 21 September 2021 22:03

Do u remember that u registered the relays when u registered the pool certificate… ?
the topology updater is for relays only… should run once/hour… to announce the nodes to the public network

Curtis_Paul · 21 September 2021 22:15

I haven’t actually configured a pool yet, I’m still in the planning phase of implementation.

Alexd1985 · 21 September 2021 22:17

Aa ok, then when u will register the pool add only the public relays

Curtis_Paul · 21 September 2021 22:22

I think I understand the reasoning.

So I guess registering it would have an affect on the public meta data that represents the pool right?
In other words, we don’t want the public aware of that private “standby” relay.

Topic		Replies	Views
Block Producer - Failover Approach with a BP Standby Operate a Stake Pool	63	2832	24 February 2022
HA Stake Pool Cluster Operate a Stake Pool	3	915	20 August 2020
New to SPs - Backup Block Producer, Server sizing Setup a Stake Pool	6	810	2 June 2022
Failover - how to setup a secondary BP node Operate a Stake Pool	1	278	24 June 2021
Redundant BP Nodes? Setup a Stake Pool	1	669	13 April 2021

What might the process be for failover cardano-node services?

Related topics