Hello Digital Man,
After three to four days I begin to see one or two of the sbbs processes using 100% cpu.
I am seeing this in htop and I can see the pid of the
processes but I don't know what those processes are doing. htop says there are two pids using 100% cpu but the graph shows one cpu at 100%.
I just shut down sbbs and restarted and all is well now. I notice that my ftn inbound directory fills with files but are not being tossed or sent to links so that is why I went into htop for a look and saw that.
It is likely caused by something that I have misconfigured but I am not sure where to begin looking. Is there some way I can find out what a given pid is doing or trying to do?
I'm pretty sure you mean "threads" when you say processes.
I'm not familiar with 'htop'. But if you use 'top -H', it'll show you the individual threads of multi-threaded processes along with the threads' names (sbbs thread names start with "sbbs/"), for example:
sbbs doesn't "toss files" (sbbsecho does). So... maybe double-check whatever is supposed to be running sbbsecho?
The thread names is the first clue. Then from there we can check out log entries to see the last thing that thread(s) in question were doing.
Re: 100% cpu usage
By: Digital Man to Al on Fri Jun 29 2018 12:25 am
I'm pretty sure you mean "threads" when you say processes.
Yes, I think so.
I'm not familiar with 'htop'. But if you use 'top -H', it'll show you the individual threads of multi-threaded processes along with the threads' names (sbbs thread names start with "sbbs/"), for example:
htop is similar to top but it gives you a graph of cpu(s), memory and swap usage at the top of the screen. It's pretty but I only see exec/sbbs, not the sbbs/* that I see in top.
Yes, I can see the threads described better in top. I'll use that to get a better idea of where to look the next time it happens.
sbbs doesn't "toss files" (sbbsecho does). So... maybe double-check whatever is supposed to be running sbbsecho?
It is the FIDOIN timed event that runs sbbsecho so that stops running for some reason. Either the data/fidoin.now is not being touched or acted upon although BinkIT seems to continue to run.
Or it could be that you have a timed-event that is running in an infinite loop?
This would explain both high CPU utilization and the failure to run
any more timed events.
It is the FIDOIN timed event that runs sbbsecho so that stops running
for some reason. Either the data/fidoin.now is not being touched or
acted upon although BinkIT seems to continue to run.
Or it could be that you have a timed-event that is running in an infinite loop? This would explain both high CPU utilization and the failure to run any more timed events.
I think this is what is happening. It doesn't happen all the time, I see it every four days or so. I saw it happening earlier today and I think this log entry shows what is happening.
11:1/100@wwivftn Jul 8 07:00:42 trmb synchronet: evnt JSBinkP/1.106 callout to 11:1/100@wwivftn
started
Re: 100% cpu usage
By: Digital Man to Al on Fri Jun 29 2018 12:04 pm
It is the FIDOIN timed event that runs sbbsecho so that stops running
for some reason. Either the data/fidoin.now is not being touched or
acted upon although BinkIT seems to continue to run.
Or it could be that you have a timed-event that is running in an infinite loop? This would explain both high CPU utilization and the failure to run any more timed events.
I think this is what is happening. It doesn't happen all the time, I see it every four days or so. I saw it happening earlier today and I think this log entry shows what is happening.
Re: 100% cpu usage
By: Al to Digital Man on Sun Jul 08 2018 01:02 pm
I think this is what is happening. It doesn't happen all the time, I see it every four days or so. I saw it happening earlier today and I think this log entry shows what is happening.
I should have mentioned, top says the thread is SBBS/Events.
Re: 100% cpu usage
By: Al to Digital Man on Sun Jul 08 2018 01:02 pm
11:1/100@wwivftn Jul 8 07:00:42 trmb synchronet: evnt JSBinkP/1.106 callout to 11:1/100@wwivftn
started
I had my cursor misplaced when I cut and pasted that.. there is one more line..
Jul 8 07:00:42 trmb synchronet: evnt poll_node 11:1/100@wwivftn, outbound_dir:
/sbbs/fido/outbound
Jul 8 07:00:42 trmb synchronet: evnt Locking /sbbs/fido/outbound.00b/00010064. bsy
Jul 8 07:00:42 trmb synchronet: evnt Lock successful.
Jul 8 07:00:42 trmb synchronet: evnt Attempting poll for node 11:1/100@wwivftn Jul 8 07:00:42 trmb synchronet: evnt JSBinkP/1.106 callout to 11:1/100@wwivftn
started
Jul 8 07:00:43 trmb synchronet: evnt Sent M_NUL command args: OPT CRYPT
Sorry about that.. :)
Ah, okay, that helps a lot. It's not blocking on connect then. I may even see the problem. Try the next commit of load/binkp.js out. :-)
Re: 100% cpu usage
By: Digital Man to Al on Sun Jul 08 2018 02:55 pm
Ah, okay, that helps a lot. It's not blocking on connect then. I may even see the problem. Try the next commit of load/binkp.js out. :-)
I just updated and see this error now.
Jul 8 15:54:53 trmb synchronet: srvc 0054 BINKP connection accepted from: 68.1 02.199.252 port 61737
Jul 8 15:54:53 trmb synchronet: srvc 0054 BINKP JavaScript service thread star ted
Jul 8 15:54:53 trmb synchronet: srvc 0054 BINKP !JavaScript /sbbs/exec/binkit .js line 31: InternalError: too much recursion
Re: 100% cpu usage
By: Digital Man to Al on Sun Jul 08 2018 02:55 pm
Ah, okay, that helps a lot. It's not blocking on connect then. I may even see the problem. Try the next commit of load/binkp.js out. :-)
I just updated and see this error now.
Jul 8 15:54:53 trmb synchronet: srvc 0054 BINKP connection accepted from: 68.1 02.199.252 port 61737
Jul 8 15:54:53 trmb synchronet: srvc 0054 BINKP JavaScript service thread star ted
Jul 8 15:54:53 trmb synchronet: srvc 0054 BINKP !JavaScript /sbbs/exec/binkit .js line 31: InternalError: too much recursion
That error should be fixed now.
Sysop: | sneaky |
---|---|
Location: | Ashburton,NZ |
Users: | 25 |
Nodes: | 8 (0 / 8) |
Uptime: | 116:26:49 |
Calls: | 1,903 |
Calls today: | 1 |
Files: | 11,079 |
Messages: | 934,609 |