Exchange 2016 problem with KB4561616 or KB4562561?

404 views
Skip to first unread message

Mayo, Bill

unread,
Jul 14, 2020, 2:11:02 PM7/14/20
to ntexc...@googlegroups.com
Running Exchange 2016 CU-16. Just applied updates, specifically KB4561616 and KB4562561 on the server using the normal process. After reboot, many of the Exchange services will not start. A key error that repeats in the Event Log is "MSExchange Common" event ID 4999:

Watson report about to be sent for process id: 13888, with parameters: E12IIS, c-RTL-AMD64, 15.01.1979.003, MSExchangeFrontendTransport, M.Exchange.Rpc, M.E.R.RpcServerBase.RegisterServer, M.E.Rpc.RpcException, a87b-dumptidset, 15.01.1979.002.
ErrorReportingEnabled: False

Googling on this does lead to a report of a known bug, but with CU7 and the fix being to update to CU10. I am preparing to install CU17 to see if that will help, but wanted to see if anyone was aware of this and if there is anything else that I should be doing.

Bill Mayo
Pitt County MIS

Michael B. Smith

unread,
Jul 14, 2020, 2:28:30 PM7/14/20
to ntexc...@googlegroups.com
Generally re-applying the last CU will correct this. Installing CU17 is a good plan.
--
You received this message because you are subscribed to the Google Groups "ntexchange" group.
To unsubscribe from this group and stop receiving emails from it, send an email to ntexchange+...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/ntexchange/94c56e4480e149aa88c412a513f45cd4%40pittcountync.gov.

Mayo, Bill

unread,
Jul 14, 2020, 3:44:30 PM7/14/20
to ntexc...@googlegroups.com
Thanks, Michael. I proceeded with the install and it went well to a point. It got to the "Mailbox Role: Client Access Front End Service" and that failed.

The output/log shows the error:
Failed to start service 'Microsoft Exchange Service Host (MSExchangeServiceHost)

Would a restart and re-attempt be called for?
To view this discussion on the web visit https://groups.google.com/d/msgid/ntexchange/174380034e52406bb4e26c6efd9d9f69%40smithcons.com.

Michael B. Smith

unread,
Jul 14, 2020, 3:45:48 PM7/14/20
to ntexc...@googlegroups.com

Mayo, Bill

unread,
Jul 14, 2020, 4:01:26 PM7/14/20
to ntexc...@googlegroups.com
Ugh. Failed with the same error again. Not sure where to go from here. Uninstall Windows updates? Try to re-apply CU16?
To view this discussion on the web visit https://groups.google.com/d/msgid/ntexchange/fc808e4b542e43988ad420005fbac7f1%40smithcons.com.

Michael B. Smith

unread,
Jul 14, 2020, 4:02:37 PM7/14/20
to ntexc...@googlegroups.com
If you have services disabled, un-disable them. Then attempt to re-apply.
To view this discussion on the web visit https://groups.google.com/d/msgid/ntexchange/8182abe359974a51953669a436ee6766%40pittcountync.gov.

Mayo, Bill

unread,
Jul 14, 2020, 4:07:18 PM7/14/20
to ntexc...@googlegroups.com
All "Microsoft Exchange *" services are either automatic or manual. I am still getting the 4999 errors, as well as:

MSExchange Store Driver Submission
Event ID 1005
Unable to start the store driver Submission
To view this discussion on the web visit https://groups.google.com/d/msgid/ntexchange/3d85836164b8457b994ad74d7c968ae9%40smithcons.com.

Michael B. Smith

unread,
Jul 14, 2020, 4:09:26 PM7/14/20
to ntexc...@googlegroups.com
I would uninstall the relevant updates, then try CU17 again. Have you looked at ExchangeSetup.log to see if it provides any more detail than "can't start"?
To view this discussion on the web visit https://groups.google.com/d/msgid/ntexchange/65e97f48d7d44cfaaf3d5bd63da4ec83%40pittcountync.gov.

Mayo, Bill

unread,
Jul 14, 2020, 4:11:45 PM7/14/20
to ntexc...@googlegroups.com
The only thing I see that is more specific than that is the following. Other line is just giving the PowerShell command that failed, which basically is saying to start the service based on an IF statement.

[ERROR-REFERENCE] Id=CafeComponent___D3D56093B30e48fa825d407b8fa4b1f0 Component=EXCHANGE14:\Current\Release\Shared\Datacenter\Setup
To view this discussion on the web visit https://groups.google.com/d/msgid/ntexchange/c1c42a8f33f7434a849cb854d226dc68%40smithcons.com.

Mike White

unread,
Jul 14, 2020, 5:05:14 PM7/14/20
to ntexc...@googlegroups.com
Have you checked the exchange setup log for the error? Sometimes you can trace the activity better there.

Mayo, Bill

unread,
Jul 14, 2020, 5:10:36 PM7/14/20
to ntexc...@googlegroups.com
So, I uninstalled KB4561616 (couldn't uninstall the other), rebooted, and tried CU17 again. Still failing at the same point.
To view this discussion on the web visit https://groups.google.com/d/msgid/ntexchange/7235238c97de4e9ebff7371242a76756%40pittcountync.gov.

Michael B. Smith

unread,
Jul 14, 2020, 5:12:27 PM7/14/20
to ntexc...@googlegroups.com
I don't have any good suggestions. I'd call support at this point. "system down".
To view this discussion on the web visit https://groups.google.com/d/msgid/ntexchange/0cbb4294c1aa44ce8d2fa994dacb3caa%40pittcountync.gov.

Mayo, Bill

unread,
Jul 16, 2020, 8:11:06 AM7/16/20
to ntexc...@googlegroups.com

As an update, I contacted Microsoft and am now in worse shape than before. A second server started exhibiting the same problems without anyone having done anything on it. I had a tech yesterday AM that escalated it. That tech got disconnected twice while working on it. Just before or after the second disconnect is when the second server started having issues. Had to wait 3 hours to get a different person on the phone and they were definitely not second level. Was doing things like trying to disable IPv6 (which I requested he not do) and disable the firewall. He finally told me to reinstall the update and let him know what happened. I have asked the manager of the 3rd tech to please escalate and have me contacted this morning, as (shockingly) reapplying the update didn't work.

 

I am hoping my last Exchange server holds out, as I am completely baffled by the second one dying. I have to assume that either it was something the 2nd tech did or there is some setting that get replicated that is the source of the issue. Although, if that is the case, I have no idea why one server is still standing. Things I saw the 2nd tech do include:

  • Adding my account to all kinds of Exchange groups
  • Running the healthchecker script
  • Installing the newest Visual C++ redistributable
  • Changing the service pipe timeout
  • Disabling the setting on one of the services to interact with the desktop
  • Enabling (then re-disabling) FIPS compliance

 

While I wait for Microsoft to get back to me, I welcome any input. Assuming that my last server holds out and I see any responses(!).

prab som

unread,
Jul 16, 2020, 9:21:26 AM7/16/20
to ntexc...@googlegroups.com
I am doing an Exchange upgrade from 2013 to 2019 next week. I am holding my breath.

Mayo, Bill

unread,
Jul 20, 2020, 3:39:41 PM7/20/20
to ntexc...@googlegroups.com

Finally got this issue resolved after many, many hours on the phone with Microsoft. We had configured the RPC dynamic port range (as documented at https://support.microsoft.com/en-us/help/154596/how-to-configure-rpc-dynamic-port-allocation-to-work-with-firewalls) across the enterprise as part of an effort to better lock down connectivity between VLANs. I had considered the wisdom in applying this to servers but had proceeded anyway. Since this had been done several weeks back and we are in the middle of several security initiatives, I did not put 2 and 2 together on this one (in other words, I had totally forgotten about it). Basically, this setting went into effect when I rebooted the Exchange Server to apply updates. And, let’s just say that Exchange doesn’t care for this particular setting. After removing these registry keys and rebooting, everything started working again. There was no problem with the Windows update or CU17, all of which got applied to the servers after the issue was resolved.

 

As always, thanks to those that offered assistance.

Mike

unread,
Jul 20, 2020, 4:15:40 PM7/20/20
to ntexc...@googlegroups.com
Glad you got it resolved.

Michael B. Smith

unread,
Jul 21, 2020, 3:32:26 PM7/21/20
to ntexc...@googlegroups.com

How did you ever figure it out?

Mayo, Bill

unread,
Jul 21, 2020, 3:58:37 PM7/21/20
to ntexc...@googlegroups.com

They had told me on Friday that they believed it was an OS issue and were submitting a request to the Windows team. They indicated they believed the core problem to be the Exchange AD topology service not starting with "Error 1061: The Service Cannot Accept Control Messages At This Time".

 

Friday night, I got an email asking “do you have any registry settings at HKEY_LOCAL_MACHINE\Software\Microsoft\Rpc\Internet?” As soon as I looked at that location, I realized what it was and what had happened.

 

I don’t know how they came to that conclusion, and they didn’t explain any further. It is my assumption that the error message or something else in the setup logs (which had been uploaded) was familiar to someone in Windows support.

 

While it was a self-inflicted wound and it is hard to be critical of support folks who had no idea that this change had been made, I have to say that it was a very painful experience. I wound up talking to 4 different people, suffered through extremely poor quality audio and having an agent get disconnected twice while working due to “technical issues”, and what seemed to be truly bizarre troubleshooting steps. When you are sitting there at 11:00 PM with 2 servers down, and the tech just keeps hitting restart on a service that won’t start and hasn’t started the last 5 times he did it (without changing anything), it is pretty frustrating. Tack on things like asking me repeatedly, “did you uninstall the other update?”, which I kept exasperatingly responding to with “Windows doesn’t let you uninstall that update. Do you see how there is no uninstall button? If you are able to uninstall it some other way, it is fine”. That particular guy left me at midnight with the instruction to re-install the update that started the whole problem and then just email him back if that didn’t happen to solve the problem.  

Michael B. Smith

unread,
Jul 21, 2020, 4:04:53 PM7/21/20
to ntexc...@googlegroups.com

Tier 1 support is appalling. Always has been; since it moved out of Charlotte, San Antonio, and Australia. During COVID it’s been even worse because of the poor communications (as you mention) from their home systems.

 

I generally start a call with a request for escalation. 😊

 

I empathize greatly.

Mayo, Bill

unread,
Jul 21, 2020, 4:18:58 PM7/21/20
to ntexc...@googlegroups.com

I actually escalated twice. The first person to look at it, to his credit, pretty quickly determined it was beyond his abilities and escalated it himself. It was the second guy I was working with, who seemed to have some understanding of what he was doing, that got disconnected twice while he was working on it.

 

The second time he was disconnected was around 4:20 PM. I got an email from his co-worker at around 4:45 that he had technical issues and would get back with me. I initially responded for him to contact me in the morning, but then a few minutes later we got alerts that a second server was down. I responded back immediately that we now had a second server down and I needed someone to contact me ASAP. I got no response for nearly an hour and I finally decided I had no choice but to call back in and try to get a live person. I didn’t know why the 2nd server went down and I was deeply concerned that the last one would to. I explained the situation, and asked for the next available agent. I basically went back in the queue and had to wait another 2 hours for someone to call me. That person was obviously tier 1 and was the worst experience by far. The first time he called he said he was going to have to call me back because “my voice was coming out of his computer speaker”. It took another 15 minutes for him to call back and begin troubleshooting, none of which I was terribly impressed with. When I left him around midnight with the “reinstall the update and email me”, I knew that I was never going to get anywhere with him. I did as he asked and when, shockingly, that didn’t work, I emailed him supervisor asking to have it escalated and to contact me in the morning. At that point, I figured it didn’t matter if the last server failed, there was nobody I was going to be talking to that night that was going to be able to help.

 

The person I got on Thursday morning worked with me through the end. While there were “why is he doing that” moments (one I didn’t mention is that 2 of the techs wanted to disable IPv6, which I know from lots of MBS posts was a no-no), he was pleasant enough to deal with and seemed to mostly follow a logical path.

Reply all
Reply to author
Forward
0 new messages