How is the topology actually updated?

tomdx · 10 June 2021 10:44

Many of us run the TopologyUpdater service from guild-operators, which first publishes the node’s IP and then fetches an updated topology. Up until now, I always assumed that the node somehow monitors its topology config file and reloads changes when they occur.

This does not seem to be the case and I was told that a node restart would be needed to reload the updated topology config. For obvious reasons, I don’t do an hourly restart of the node, in fact I very rarely do docker restart relay.

How do you update your topology?

PS: Up until now, I never questioned this, because the nodes were always well connected even without an explicit restart. With Alonzo, the need to explicitly update ones topology will likely go away, but still this puzzles me.

Alexd1985 · 10 June 2021 12:17

I am restarting the relays once/12 hours… but default (topologyUpdater) it will restart once/24 hours;

deploying the topology updater as systemd will have the following services:

DEPLOY THE SCRIPT

systemd service
The script can be deployed as a background service in different ways but the recommended and easiest way if prereqs.sh was used, is to utilize the deploy-as-systemd.sh script to setup and schedule the execution. This will deploy both push & fetch service files as well as timers for a scheduled 60 min node alive message and cnode restart at the user set interval when running the deploy script.

cnode-tu-push.service : pushes a node alive message to Topology Updater API
cnode-tu-push.timer : schedules the push service to execute once every hour
cnode-tu-fetch.service : fetches a fresh topology file before cnode.service file is started/restarted
cnode-tu-restart.service : handles the restart of cardano-node(cnode.sh)
cnode-tu-restart.timer : schedules the cardano-node restart service, default every 24h

tomdx · 10 June 2021 12:23

Oh dear. Is that synced with the leaderlog in any way? You wouldn’t want to miss a block only because your script happens to restart the node.

Alexd1985 · 10 June 2021 12:24

of course, that’s why I never restart the relays same time there are 6 hours differences between

tomdx · 10 June 2021 12:25

ok, +1 for redundancy.

So, although you get a topology update every hour, you only reload it once a day, right?

Alexd1985 · 10 June 2021 12:26

now, u can add peers manually from main topology, from trust pools and u will never have problems and perhaps no reload need

anyway u can monitor the number of peers in grafana… if are around 20 then it is not mandatory to reload the node

tomdx · 10 June 2021 12:30

We are currently wondering, whether we should built this script based functionality into the “official” upstream docker image. Alonzo is supposed to support config reloads triggered by signals - I guess that would be the right time to add this functionality, if even needed by then.

Alexd1985 · 10 June 2021 12:33

I don’t know what to say…

tomdx · 10 June 2021 12:40

When you press Ctrl+C on a foreground process, you sent the SIGINT signal to that process. The same can be done with all sorts of signals with the linux kill command.

Currently SIGINT causes the node to do a graceful shutdown, which SIGTERM does not do. The result of a non-graceful shutdown is that the node has to re-validate the entire block data base, which may take 15min. Therefore, never just pull the plug on your node

Alexd1985 · 10 June 2021 12:43

you are perfectly right

but my relay is starting faster

tomdx · 10 June 2021 12:44

Yes, and this is because it probably does a graceful shutdown. In the terminal you should see "Shutting down ..."

Alexd1985 · 10 June 2021 12:46

exactly

tomdx · 10 June 2021 12:55

With Alonzo the node will support config reload when you send it a specific signal (I don’t yet know which one) - a restart of the node won’t be necessary any more.

Alexd1985 · 10 June 2021 12:56

and also topologyupdater I think will be removed… P2P will replace it, if I understood well

tomdx · 10 June 2021 13:02

Yes. I therefore should probably first wait on the signal support (because requiring a restart is out of question) and then (and only if alonzo p2p gets delayed for some reason) can we build that script base topology update into the image.

Nye_Liu · 16 October 2021 03:54

It is ridiculous that you have to restart the process simply to have it reread the topology file. This means every time the updater alters it, you have to restart the process. This sets off alerts every time (since we have it automated).

Nye_Liu · 16 October 2021 03:54

Is this documented anywhere? If not, why not? This should also reread the KES keys so rotation doesn’t also require a restart

Topic		Replies	Views
How Often Should You Update Your Topology On Mainnet? Operate a Stake Pool topology	3	1009	1 April 2021
Why do we need the topology updater? Operate a Stake Pool	39	2685	26 December 2021
Questions on TopologyUpdater push and relay topology pull Operate a Stake Pool	5	640	11 October 2021
Nodes slower to restart following 1.25.1 update, anyone? Staking & Delegation	12	408	8 February 2021
Updating relay topology Setup a Stake Pool topology	2	753	22 July 2021

How is the topology actually updated?

DEPLOY THE SCRIPT

Related topics