Relay node suddenly go down after 3 months

conniec1 · 17 July 2021 20:35

Hi ,
My relay node have been running for 3 months now, today I found it seems go down, 0 blocks,slots, everything 0. I restart the node and still see the same (below picture).

Could you please share what is the cause if any clue , can you please suggest how should I recover from it?

Alexd1985 · 17 July 2021 20:42

journalctl -e -f -u cardano-node

or

journalctl -e -f -u cnode

conniec1 · 17 July 2021 21:49

I ran the command and it seems restart the node, however seems it still stuck at the “starting…” phase

conniec1 · 17 July 2021 22:07

It works! thanks for your help!

conniec1 · 18 July 2021 00:50

@Alexd1985 ,
It works originally after restart, however it seems back to starting… phase again, what could be the cause of failing so soon, in less than an couple hours?
Should I restart or there is some other known issues?

Looks like is run out of disk space issue? I check df -k . , it is 98%. How should I make up more space?

Alexd1985 · 18 July 2021 05:27

Delete the log files

conniec1 · 19 July 2021 03:25

When I run the topologyUpdater.sh script, the message show my node is out of sync
", “clientIp”: “x.x.76.39”, “msg”: “blockNo 2606181 seems out of sync. please retry” }

I retried a couple times, and still the same. What should I do?
This is my 2nd relay node. the first relay node seems okay
This is the 2nd note gLview

Xpriens · 19 July 2021 09:22

Delete db restart and let it resync. It once happened to one of my relays as well.
You might want to change the topology file and remove the extra out connections because they slow down the sync process.

conniec1 · 19 July 2021 16:05

do you mean on the relay node, delete the DB and change mainnet.topology.json?
BTW, this is my 2 nd relay node.

Xpriens · 20 July 2021 08:44

Yep, exactly

conniec1 · 20 July 2021 16:13

There are 3 directories inside DB, will it broke if I delete the DB?
do you mean cardano-my-node/db?

Xpriens · 21 July 2021 08:05

Yes this is the directory. If you delete the files in there and let the node resync it will recreate them.

cyberruss · 21 July 2021 08:26

df -h to check disk space. There are some easy bash options to check which directories will be using the space. As Alex mentions above it is almost certainly be the log files. You can delete everything in the log directory and restart the node and you will be fine.

When you ran out of disk space you probably broke the db as well, given it is now re-synching.

Topic		Replies	Views
Relay Node Status goes back to "starting..." Setup a Stake Pool	6	477	23 April 2021
Relays are not processing transactions after 1.29.0 upgrade Operate a Stake Pool	22	1101	12 September 2021
Relay node in starting for ages Operate a Stake Pool	11	613	10 December 2021
Relay node is missing in topology.json? Operate a Stake Pool	5	341	6 September 2021
BP and Relay stuck after the sync Setup a Stake Pool	4	643	5 December 2021

Relay node suddenly go down after 3 months

Related topics