Is running topology updater a must?

eladws · 10 January 2022 14:56

Hi,
I’m trying to figure out wether running the topology updater script (using the 3rd party server by https://api.clio.one/htopology) is a must,
or is it just a best practice, but my node will be able to run perfectly ok with a static topology file (even the one with only the default relays-new.cardano-mainnet.iohk.io peer in it) ?

I can’t find a clear explanation of this topology updater mechanism in the official Cardano documentations, hence would appreciate your help.

Thanks!

georgem1976 · 10 January 2022 15:03

You need incoming connections to your relays from other relays in order to propagate the blocks minted by your block producer.
This will change with p2p, but until then… topology updater script or connections manually set by other SPOs to your relays.

os11k · 10 January 2022 15:22

Hi!

Running topology updater is an optional and probably is recommended way to do this.

Nevertheless you will be totally fine without it and there are several pool operators who don’t use it due different reasons.

I had discussion in github regarding this with @rdlrt . He explained me that this is not a mandatory thing:

What is mandatory is having an incoming peer who is indirectly fetching blocks from another block producer. Until P2P is live, IOG uses the relays specified in stake pool registration to extract and connect to peers (twice per day). topologyUpdater provides a method for community to discover other live peers without a specific preference - you can read more about it here.

georgem1976 · 10 January 2022 19:24

“Totally fine” is debatable. You only depend on one incoming connection to distribute your blocks as fast as possible to the whole blockchain, instead of having tens of incoming connections. I find it very risky.

eladws · 10 January 2022 19:34

got it. so it is not mandatory per se, but it helps propagate my produced blocks faster to the entire network, which is obviously important.

Thank you all!

mcrio · 10 January 2022 19:55

Here’s a long, related discussion.

I myself use the updater. In almost all examples the block producer IP is provided to the remote service as a query parameter, but that is not required if you manually add it to the topology file after pulling topology data.

7.4d4 · 10 January 2022 23:50

I read that discussion link and I would summarise it as follows:

You don’t need to use the “topology updater” to generate your own mainnet-topology.json file. You can do this however you like.
You DO need to keep pinging your node information to a service like api.clio.one. (This is one thing the “topology updater” script (whichever script you use) does every 1hr.)
You DO need to keep your relays online even when you don’t have blocks to produce.

The reason for 2 (and 3):

If you don’t do 2. then all other node operators that get their topology files from api.clio.one won’t be given your relays as peers for them to use.
This means that you won’t have many (or any?) incoming connections to your node.
Remember that node connections are half-duplex so if you don’t have incoming connections then other relays won’t pull any blocks from your block-producer. Or if you only have a few incoming connections then your blocks will propagate more slowly.

This last point is a problem but maybe not how many people think:

Unfortunately some people who run very small pools may not be setting up their nodes well. They think it is unnecessary to remain online much since mostly they don’t produce blocks. They just keep running their leader-logs to see if they get awarded a block.

Then when they get awarded a block, and just before they need to mint it, they make sure their block-producer is running and pulling blocks from lots of other nodes. However, they now don’t have many incoming connections because they haven’t been remaining on the network much before this time and they haven’t been pinging information to api.clio.one or similar.

Consequently when they do produce their block, its propagation through the network is slow.

This results in the next block-producer not seeing their block before producing theirs. Now we have two blocks with the same block number and conflicting transactions propagating across the network. Only 1 of these blocks will be adopted and thus rewarded with staking rewards.

Unfortunately, the second block-producer is likely at a disadvantage because this stake pool is more likely to be a bigger pool. The nodes adjudicate which block to adopt according to the VRF calculation based on keys and stake distribution. This puts the small pool at a massive advantage because its VRF score is likely to be very low (since it has little stake).

Thus the second pool gets penalised for the poor running and disconnectedness of the small pool.

For more information see this link:

github.com/input-output-hk/ouroboros-network

Consensus should favor expected slot height to ward off delay attack

opened 05:12PM - 30 Jan 21 UTC

Straightpool

**Internal/External** *External* **Summary** When a pool produces or propa…gates a block late so the block collides with the block of the next slot leader, only the vrf value is evaluated to determine the winning block, which is on its own the correct strategy deciding randomly between competitive slots. Due to the current logic in the case of delayed blocks it does happen that the block of the next slot leader which was properly propagated and produced on-time is lost due to the misconfiguration of the prior slot leader. This can be seen as a form of attack from the viewpoint of the on-time pool. Similarily, a later slot leader could produce his block multiple seconds earlier and collide with the previous block, if his vrf value was lower he could attack the previous block leader as his early block would make it on chain, the on-time block of the prior slot leader would be lost. We do not see this type of attack yet, as this would be a conscious effort, right now this attack is most likely without malice just out of misconfiguration. **Steps to reproduce** Steps to reproduce the behavior: 1. Wait on a situation where there are two slots with only a few seconds "x" apart 2. Delay production of first block by "x" seconds on first slot leader 3. Produce second block on second slot leader on-time 4. Wait until the block of the first slot leader has the lower vrf value 5. Observe that the block of the first slot leader makes it on chain, the block of the second slot leader is lost (had both blocks be on-time, both blocks would have made it on chain) **Expected behavior** The consensus protocol should evaluate the slot of the blocks and favor the block group which is expected in the current time frame. With *expected* I refer to the exact block slot height. The algorithm can calculate precisely which slot# a block at this exact moment in time should have. If there is more than one block in that group of "on-time" blocks only then the lower vrf should decide the winner. The block of the pool which produced the block on-time and propagated the block swiflty should not be attackable by a prior slot leader who delays his blocks accidently or on purpose or by a following block leader who produces his block multiple seconds earlier by modifying the system time on purpose as we have seen on the ITN as a tactic to win competitve slots. **System info (please complete the following information):** - OS: Ubunto - Version 20.04 LTS - Node version: cardano-node 1.25.1 - linux-x86_64 - ghc-8.10 git rev 9a7331cce5e8bc0ea9c6bfa1c28773f4c5a7000f **Screenshots and attachments** ![2021-01-30 17 57 49](https://user-images.githubusercontent.com/42584250/106362733-b3b6b480-6324-11eb-809b-7f6dd1d6e45b.jpg) See epoch 244: https://pooltool.io/pool/000006d97fd0415d2dafdbb8b782717a3d3ff32f865792b8df7ddd00/orphans This is the propagation delay of the slot leader before my block: ![2021-01-30 17 59 21](https://user-images.githubusercontent.com/42584250/106362778-f4aec900-6324-11eb-8ad0-8f42d449b946.jpg) See propagation delays of the pool before my block here: https://pooltool.io/pool/59d12b7a426724961607014aacea1e584f3ebc1196948f42a10893bc/blocks This is the hash of the winning late block which made it on chain: ca40eed5fd46f76fbf64e17a98808f098363a83dfe8c100046947505baa1e406 My block made it into the orphan list on pooltool, hash: 97abb258f15995688bdacdc75a054883b22471451026f409a967028ec7b30316 This is a log excerpt from my block producer, the block which should have been the parent for my block arrived full 4 seconds late: {"at":"2021-01-28T07:16:47.00Z","env":"1.24.2:400d1","ns":["cardano.node.ChainDB"],"data":{"kind":"TraceAddBlockEvent.AddedToCurrentChain","newtip":"97abb258f15995688bdacdc75a054883b22471451026f409a967028ec7b30316@20251916"},"app":[],"msg":"","pid":"582044","loc":null,"host":"foobar","sev":"Notice","thread":"49"} {"at":"2021-01-28T07:16:48.04Z","env":"1.24.2:400d1","ns":["cardano.node.ChainDB"],"data":{"kind":"TraceAddBlockEvent.SwitchedToAFork","newtip":"ca40eed5fd46f76fbf64e17a98808f098363a83dfe8c100046947505baa1e406@20251913"},"app":[],"msg":"","pid":"582044","loc":null,"host":"foobar","sev":"Notice","thread":"49"} **This is the 2nd time I have observed this, last time was on December 21st, same pattern different slot leader:** Block producer log. {"at":"2020-12-20T03:07:09.01Z","env":"1.24.2:400d1","ns":["cardano.node.ChainDB"],"data":{"kind":"TraceAddBlockEvent.AddedToCurrentChain","newtip":"78f0c4a29a9c2b9a628584066f05ba3285f6b7eaac3bc270e353f52a0fa94a8c@16867338"},"app":[],"msg":"","pid":"582044","loc":null,"host":"foobat","sev":"Notice","thread":"49"} {"at":"2020-12-20T03:07:10.64Z","env":"1.24.2:400d1","ns":["cardano.node.ChainDB"],"data":{"kind":"TraceAddBlockEvent.SwitchedToAFork","newtip":"2c237fded6c534200814d991deccc3c99f0a1bae01e603e743d6d5926e8a4519@16867333"},"app":[],"msg":"","pid":"582044","loc":null,"host":"foobar","sev":"Notice","thread":"49"} 78f0c4a29a9c2b9a628584066f05ba3285f6b7eaac3bc270e353f52a0fa94a8c was my block which was orphaned 2c237fded6c534200814d991deccc3c99f0a1bae01e603e743d6d5926e8a4519 was the hash of the block before mine (5 slots before) arriving 6 seconds late. Mike downloaded the json of one of the blocks of the pool before mine and noticed a delay of about 10 seconds back then: {"height": 5100112, "slot": 16870897, "theoretical": 1608437188000, "tiptiming": [10547, 10416, 10440, 10509, 10350, 10099, 10432, 10428, 10333, 10378, 10427, 10548, 10219, 10111, 10362, 10293, 10350, 10281, 10296, 10410, 10461, 10419, 10484, 10343, 10350, 10485, 10347, 10330, 10530, 10592, 10327, 10290, 10373, 10332, 10192, 10288, 10390, 10375, 10392, 10301, 10369, 10457, 10350, 10439, 10354, 10493, 10323, 10503, 10407, 10337, 10343, 10398, 10442, 10359, 10367, 10325, 10334, 10305, 10499, 10369, 10346, 10231, 10369, 10311, 10317, 10420, 10505, 10303, 10240, 10310, 10560, 10350, 10360, 11098, 10410, 10310, 10310, 10280, 10320, 10563, 10370, 10330, 10280, 10120, 10400, 10310, 10350, 10310, 10340, 10490, 10460, 10380, 10540, 10410, 10340, -1608437188000, 10330, 10290, 10340, 10370, 10420, 10310, 10260, 10320, 10380, 10440, 10380, 10370, 10350, 10420, 10270, 10517, 10560, 10360, 10110, 10410, 10380, 10300, 10420, 10440, 10390, 10640, 10580, 10580, 10550, 10280, 10740, 10400, 10580, 10380, 10380, 10420, 10380, 10400, 10320, 10370, 10360, 10450, 10300, 10500, 10340, 10410, 10320, 10300, 10550, 10360, 10410, 10320, 10350, 10400, 10350, 10240, 10630, 10370, 10457, 10350, 10330, 10340, 10530, 10280, 10320, 10737, 10310, 10300, 11560, 10479, 10360, 10290, 10430, 10380, 10280, 10360, 10330, 10410, 10310, 10380, 10320, 10320, 11710, 10320, 10310, 10340, 25580, 10450, 10400, 10320, 10440, 11766, 10390, 10310, 12846, 10320, 10320, 12740, 12500, 12952, 13053, 18000, 20610, 20610, 24800], "histogram": "[[0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,

P2P cannot come soon enough on mainnet!

jf3110 · 11 January 2022 12:38

While it is not mandatory your relay needs some other relays propagating your minted blocks to other relays. Otherwise, you won’t receive rewards from your blocks because the won’t appear in the blockchain. Having said that - the only way is to connect your relays to relays from other pools that run topology updater instead of your relays.

If you feel uncomfortable with the shell script there’s an alternative written in python. And - IOG has an alternative called p2p where relays manage to find other relays to connect to themselves. However, p2p is not officially released and should not be used at the moment.

I guess we all look forward to start using p2p as it also reduces the restarts of relays and increases throughput of transactions.

os11k · 11 January 2022 12:55

@jf3110, you are a bit wrong here.

Your block is propagated by fetching them, if you submitted your pool certificate with correct relays, then at least IOHK relays will be able to connect to your relays and fetch blocks.

It is not ideal and I personally do use topologyupdater, but this doesn’t mean that you can’t mint block without it.

7.4d4 · 11 January 2022 13:45

Sure you can mint your block as this has nothing to do with how many incoming connections you have. However, if you have few incoming connections then your block will propagate slowly. This results in the next stake pool operator minting his block without receiving yours first and thus there are now two conflicting blocks.

The one accepted by the nodes is the one with the lower VRF value which is more likely to be the smaller stake pool operator. Unfortunately some small stake pool operators switch their nodes on/off and don’t maintain good network connectivity and this means that tools like “topology updater” don’t list their relays in the topology files given out.

os11k · 11 January 2022 13:56

I do tend to agree that topology updater is recommended way to do this, I just want to be clear and transparent and that everybody knew that running topology updater is not something mandatory.

I agree that it is more desirable to have more good incoming connections then less, but again if you have just some small amount of very good connections, then maybe your block propagation times will not be too slow. This is pure speculation and we can debate all day long about this, but my point is that, topology updater is optional and you should be able to mint block and propagate them to network without huge problems.

P.S. I by myself do use topology updater and I do recommend other to use it.

7.4d4 · 11 January 2022 14:08

What I would like to know is how necessary it is to ping api.clio.one every hour?

I assume that if I don’t have my relays do this then my relay IPs won’t be provided to other operators when they download their topology files. Thus resulting in my relays having less incoming connections.

georgem1976 · 11 January 2022 14:11

That is correct, if you don’t do this every hour, your relays will be removed from that list (probably not immediately) and they will not be provided to other relays downloading a list of relays, so your number of incoming connections will decrease.

jf3110 · 11 January 2022 14:19

I never said, that a block cannot be fetched for propagation. It is just, that there has to be some relay to propagate your block. Whether your relay connects through registering with topology updater or through some other mechanism does not matter. However, it is important, that a network of relays is maintained. That’s what topology updater provides and if there are other relays that pull your block, they have to either use topology updater to be part of the network or have further relays that propagate blocks which then use topology updater.

I know this is not an ideal solution, but it’s the way it is at the moment. p2p will solve it.

os11k · 11 January 2022 14:51

This is wrong. If you registered your relays correctly then you will get incoming connections from IOHK

jf3110 · 11 January 2022 18:19

Well, there’s an easy way to figure out. If you don’t have somebody to connect to your relay and if you don’t register with topology updater, you won’t have any incoming peers. That’s a fact.

The only other way some relays would discover to yours would be p2p relays currently testing on mainnet. I, personally, would not want to challenge luck for this to happen.

And as you said - you’re using topology updater. If you believe you won’t need it then why?

os11k · 11 January 2022 19:13

Again you are wrong here. You will have incoming peers, at least from IOHK, they get peers from your registration.

I don’t say I don’t believe in topology updater, I’m using it. I’m just trying to explain how things work.

7.4d4 · 11 January 2022 21:10

Well it seems that since I am a small pool operator I have little to lose. I can just stop pinging api.clio.one and see how many incoming connections I end up with.

It seems that I won’t be penalised if my blocks propagate slowly but rather other pool operators will be punished for my disconnectedness.

Great!

jf3110 · 12 January 2022 07:58

There are no incoming peers from IOHK - the only exception would be from p2p testing.

jf3110 · 12 January 2022 08:08

If you don’t have any blocks scheduled, then you can try. Other pool operators will only see the effect, if they don’t update their topology and restart relays. However, from experience, there exist quite some of those. They will also be the ones still connecting to your relay.

A simpler approach would be to register a new pool and never run topology updater. Since this happens from time to time with new operators (you can track them here in the forum), none of them got incoming peers from other pools.

It’s quite reasonable the IOHK ran connecting relays at the beginning of Shelley era to keep network connected. As of today - with about 3000 pools active, those pools can organize themselves and through topology updater.

Anyways, with the activation of p2p all of that discussion will become pointless, because p2p is the better and decentralized solution.

Topic		Replies	Views
Is topologyupdator needed on all the relay nodes Operate a Stake Pool	7	420	4 November 2021
Why do we need the topology updater? Operate a Stake Pool	39	2663	26 December 2021
Newly registered stake pool - topology question Setup a Stake Pool	6	495	5 March 2021
Topology Updater Service Operate a Stake Pool haskell , cardano-node	32	3980	4 January 2023
topologyUpdater reliance and weakness Setup a Stake Pool cardano	6	741	16 March 2021

Is running topology updater a must?

This last point is a problem but maybe not how many people think:

Related topics