Hey everyone, sorry for the slow response, life has been busy.
I've got new tantalum capacitors installed on the main CPU board, and that seemed to make it run more stable for a while, but now it's back to inconsistently hanging.
To answer some questions, yes the fan is working and I don't suspect the power supply area to be overheating.
I can also verify that after the computer hangs, I can put the terminal
in offline mode and I can still type on the screen and move the cursor
around, which leads me to believe the problem is with the CPU board.
I also have been checking the voltages at the CPU board power connector when it hangs, and I'm getting the expected ~+20v, GND, and ~+5v. I have also been checking the linear regulators at the top of the board, and I am getting a consistent -12v, +12v, and +12v as I would expect. I've been checking those specifically because of the capacitor problem I had at the beginning pulling up the -12v line.
While I had it running better the other night I was able to run the dynamic ram test for a while and it errored out with the message "Error @ 040036 = 000" and a different time with "Error @ 040106 = 000". I'll add that along with a bunch of other expansion cards (which I will dive into once I get it running consistently) I have the 16k ram expansion board. It has a ribbon cable that plugs into the CPU board next to all the other ram chips. Does anyone know if I can just pull that out and go without it for troubleshooting purposes, or will I need to change some jumpers to get it working without that? I'll also note that the memory test always says LWA = 040111 at the top, which seems to be 16kb written in octal, what is LWA an acronym for?
And does anyone have a list of all the functions you can do in the bootrom without loading an OS? I have been going through all the documentation I can find and haven't been able to find a comprehensive list. From what I can tell, the original Heath manual doesn't even mention the memory test. I can't figure out what I did to cause it, but the other night I got it to run the memory test in hex instead of octal (that was strange because it had "LWA = 2049" which is 8kb? and it errored out at "2047 = 00"
I appreciate the warm welcome and hope no one is bothered by my infodump here, I've just been having trouble getting my head wrapped around what troubleshooting step I should take next.
Thanks everyone,
-Bradley Marshall