BP node dropping metrics on Grafana after upgrading

doc_krieger · 24 June 2021 21:33

Hi there,

I recently upgraded my VPS that I’m running my BP node on as I noticed it was using a fair amount of ram/memory. It looked like it went without a hitch until I checked my grafana and noted that I’m getting gaps in my metrics coming from my BP node. The node is still running and appears to be processing transactions but I’m concerned as its never done this before. Any idea how to troubleshoot this?

Thanks in advance
DOC

Alexd1985 · 25 June 2021 03:21

What is the hardware configuration? It is possible to see these gaps due to insufficient RAM

The new version (1.27.0) is using ~8G RAM

doc_krieger · 25 June 2021 04:16

16GB ram, 6 Cores
should be more than enough

Alexd1985 · 25 June 2021 04:23

And after the upgrade do u still have the issue?

doc_krieger · 25 June 2021 04:26

yeah, actually started after the upgrade. Just rolled back to a previous snapshot to see if the problem sticks around

doc_krieger · 25 June 2021 04:31

You dont happen to have any other ideas what could be causing the gaps?

Alexd1985 · 25 June 2021 04:45

Type top and show me the output for cardano-node service

doc_krieger · 25 June 2021 04:57

Just realized that for some reason after rolling back to my snapshot my ram + cpu went back to before the upgrade. Talking to the VPS provider now to straighten it out

doc_krieger · 25 June 2021 04:59

still dopping metrics on grafana

doc_krieger · 25 June 2021 14:33

Hi Alex, sorry to keep bugging you. It seems the issue is related to my memory / ram usage as it seems drop the metrics every time the memory usage spikes on grafana. Cant seem to figure out where the leak is though. The only thing I know is different between by relays and bp is that I have tracemempool set as true on my BP node. Going to see what happens when I set it to False. Any ideas yourself?

Alexd1985 · 25 June 2021 14:36

But u said u have 16G of ram right?

Try journalctl -e -f -u cardano-node

Do u see the killed message? Try to set the mempool to false (u will not see tx processed on grafana or glive) and monitor in grafana

doc_krieger · 25 June 2021 17:48

seems to be mempool. Since setting it to false (around 10:30) no drops in Grafana. Any idea why its doing this though? Now that its off I cant really follow my Tx processed (which kind of sucks).

Alexd1985 · 25 June 2021 18:08

MEM issues, what is the HW configuration of the server? Type free -g

doc_krieger · 25 June 2021 18:47

doc_krieger · 25 June 2021 18:47

thats with mempool back on true (waiting node to start up again, will send updated pick once running again)

Alexd1985 · 25 June 2021 18:52

With 16G RAM u should not have issues with TraceMempool set to true

doc_krieger · 25 June 2021 20:18

Still doing the same thing. Cant seem to find the process thats causing the mem leak.

Alexd1985 · 25 June 2021 20:30

Type top and check how much MEM is used by cardano-node
do u have 4G RAM and 11G swap?

doc_krieger · 25 June 2021 20:37

Alexd1985 · 25 June 2021 20:43

And if u try journalctl -e -f -u cardano-node
do u see killed message?

Topic		Replies	Views
1.26.1 and memory usage with traceMemPool Operate a Stake Pool	4	582	14 April 2021
FYI for SPOs - update to 1.26.1 eating up more resources Community Technical Support	1	508	15 April 2021
Is my BP node running normal? Setup a Stake Pool	10	778	21 September 2022
1.26.2 high memory usage Operate a Stake Pool	9	998	25 April 2021
Relay using a lot of CPU, a lot Setup a Stake Pool	34	1357	10 November 2021

BP node dropping metrics on Grafana after upgrading

Related topics