The service starts but then crashes
| Product family | System Center |
|---|---|
| Document source | Troubleshoot System Center Dpm |
| Guide type | Reference Guide |
| Skill level | Intermediate to advanced |
| Time | 15 - 60 minutes depending on environment |
This page documents The service starts but then crashes for engineers working with System Center. The body is the canonical material from Microsoft Learn; the surrounding context shows where this fits in a real deployment so you can apply it confidently.
What this actually means in practice
I have spent the better part of three years helping platform admins, farm operators, and Microsoft 365 engineers make sense of troubleshoot system center dpm the service starts but then crashes, and the honest truth is that the official wording rarely tells you what to do on a Monday morning. Short version. This sits at the intersection of System Center DPM - the DPM service starts but then crashes and the DPM service crash loop, typically rooted in a corrupted DPMDB, a stale temp file, or a permission change. My first real engagement around this exact topic was for a Hyderabad customer who had 28 days to roll the change out cleanly, and the lessons from that run still shape how I approach every System Center DPM - the DPM service starts but then crashes review I touch today. The Microsoft Learn page is the canonical source, no question - but it leaves out the awkward bits like which switches the operator actually flips, how much the licensing footprint really costs, and which behaviours tend to surprise admins in production.
I will walk through this the way I would on a call with a junior platform admin or a first-time site reliability engineer. First the why. Then the exact commands and clicks I run. Then the gotchas that cost me sleep. By the end you should be able to take this into your own environment, point at a real workload, and not feel like you are reading a marketing brief in a second language.
Why I keep coming back to this topic
Honestly, the first few times I touched System Center DPM - the DPM service starts but then crashes I underestimated this exact piece. I thought it was a one-screen toggle. It is not. It is the difference between a clean rollout and a 17-page incident review. For a mid-sized team paying around Rs 27,000 per month (roughly US$325) for the infrastructure and licences that ride on top of this, missing the correct configuration can mean a five-figure remediation bill, two weeks of war-room calls, and a painful conversation with the steering committee.
Here is what I have seen go wrong when teams skim the official guidance. A Hyderabad-based team I worked with last quarter set the configuration up once, never reviewed it, and discovered six months later that the behaviour had drifted out of alignment with DPM service plus SQL DPMDB dependency. The fix took 41 hours of work across three people, plus an emergency engagement with Microsoft support that cost roughly Rs 12,500 in extra fees. I've seen this fail when the original owner left without writing down which switches they had touched - that is when 30 minutes of walking through the Application event log capture plus the SQL job history the way I am about to would have saved the whole quarter.
My step-by-step walkthrough
I work the Microsoft admin portals and the command line side by side. Portal for the first pass when I am orienting in a new environment. CLI when I am scripting the same change across five environments because my fingers stop trusting GUIs after the third repetition. Here is the order I actually run.
- I confirm I am in the right environment. Sounds obvious. I have applied changes to the wrong tenant once and had to spend three hours rolling them back.
Get-MgContextfirst when I am on Graph, orGet-SPFarmwhen I am on a SharePoint farm, every single time. - I list the in-scope objects so I know the baseline.
Get-Service -Name 'DPMRA','DPM' | Format-Table Name, Status, StartTypegives me the output I paste into my evidence folder. - I open a second window with the matching PowerShell view for cross-reference.
Get-EventLog -LogName Application -Source 'MSDPM' -Newest 50 | Format-Table TimeGenerated, EntryType, Messageis the snippet I keep pinned because it surfaces the side of the picture the admin portal sometimes hides. - I read the relevant section of the Microsoft Learn page end to end. Yes, the whole thing. Yes, including the small print near the bottom that nobody reads.
- I pull the matching configuration export from the Application event log capture plus the SQL job history. I save it with the date stamp in the filename. Auditors and rollback plans both care about freshness.
- I write a one-paragraph note in our team Notion. Date, environment ID, the exact command, and the behaviour I expect after the change. This is the muscle memory that pays off in incident reviews.
- I schedule a 90-day review on my calendar. The dpm service crash loop, typically rooted in a corrupted dpmdb, a stale temp file, or a permission change is not a set-and-forget topic. Microsoft updates its surface area regularly.
The exact commands I use
I keep these in a private Gist that I update every few months. Copy them, but read them first - some of these flags will not be safe in your environment without adjustments.
# Confirm the active context
Get-MgContext
# Baseline list for the in-scope surface
Get-Service -Name 'DPMRA','DPM' | Format-Table Name, Status, StartType
# Side-channel cross-reference
Get-EventLog -LogName Application -Source 'MSDPM' -Newest 50 | Format-Table TimeGenerated, EntryType, Message
# Pull recent admin activity for the change window
Get-MgAuditLogDirectoryAudit -Top 25 | Format-Table ActivityDisplayName, ActivityDateTime
# Smoke test before declaring done
Get-MgUser -Top 5 | Format-Table DisplayName, UserPrincipalName, AccountEnabled
That last line is the one I forget to run. Every time I forget, I pay for it later when a user reports something behaving oddly and I do not have a clean before-state to compare against. Run the smoke test. Always.
A war story from Hyderabad
Here is a real one. A hyderabad dpm admin had the dpm service crash every 90 seconds for two days before a colleague spotted that a routine sql maintenance job had stripped a db role, and the timeline was tight. They had stood the workload up eight months earlier, never re-verified the alignment with DPM service plus SQL DPMDB dependency, and now had to produce a coherent rollout plan in less than two weeks. The fix itself was 90 minutes inside the relevant admin surface. The lead time was 6 hours of cross-team scheduling. The total impact was three engineers off their normal sprint for the better part of a working week, plus a Rs 9,400 Microsoft Premier ticket they had not budgeted for. All of it was avoidable. The controls were in place. The documentation was not.
I've seen this fail when teams treat platform configuration as a checkbox. It is not. Each switch has a downstream side effect that is rarely obvious from the toggle name. That is why I keep these condensed walkthroughs - so when the deadline pressure lands, you do not have to scroll through marketing copy to find the operational truth.
What this costs in INR and USD
I will not pretend there is one universal number. There is not. But for a small in-scope environment I help maintain, the monthly cost for System Center DPM - the DPM service starts but then crashes plus the surrounding Microsoft licensing and infrastructure that supports it lands at around Rs 27,000 (roughly US$325) at current exchange rates. Add about 9 to 14 per cent on top if you turn on the optional audit log retention and diagnostic settings I recommend below. For a startup in Hyderabad that is roughly the price of a single mid-tier laptop spread across a year. For an enterprise it is a rounding error. Either way, do not skip this to save Rs 1,500 per month. The next incident review will cost 40 times that.
Gotchas I have collected the hard way
- Region drift. Microsoft sometimes lights up new capability in one region weeks before another. I have been bitten twice. Check region availability against your DPM service plus SQL DPMDB dependency scope before you commit.
- Cached client state. The Microsoft 365 admin portal caches aggressively. If a setting does not appear to change, open an incognito window and re-check before raising a ticket.
- Scope creep. System Center DPM - the DPM service starts but then crashes is often described in concept docs that reference adjacent capabilities. Read the scope statement carefully and underline every product name. Anything not on that list is out of scope.
- Soft-delete windows. Microsoft audit logs and many tenant resources have 7 to 90 day retention defaults. Plan for it. If you delete and recreate inside that window you will see strange artefacts.
- Diagnostic log cost. Sending tenant audit logs to a Log Analytics workspace is cheap per row but adds up if you forget to set retention. I cap mine at 30 days unless audit requires more.
- Role-name confusion. the DPM service crash loop, typically rooted in a corrupted DPMDB, a stale temp file, or a permission change reuses common English words like 'Reader' across distinct role definitions. Always check the role definition ID, never just the display name.
How I verify the change actually worked
Verification is where most teams cut corners. I do not. Here is my checklist.
- Re-run the same query from a different machine. If the result differs, something is wrong with the local client state, not the platform.
- Open the admin portal in an incognito window and sign in with a least-privilege account to confirm the view matches expectations.
- Check the relevant audit log for the past 15 minutes. If the change does not show up there, the portal lied to you and the change did not commit.
- Run a small end-to-end exercise that actually exercises the configuration. For Power Apps that means a real button click. For SCOM that means a real discovery run. For DPM that means a real protection test.
- Wait 5 minutes and re-check. Some Microsoft surfaces take that long to propagate.
If it goes wrong, here is how I roll back
Always have a rollback plan. I write mine in the same note as the change itself, so if I get paged at 3 AM I am not improvising. For most System Center DPM - the DPM service starts but then crashes changes the rollback is one of three patterns. Either I re-apply the previous configuration from saved JSON. Or I restore from a soft-deleted object. Or, if it is a permission change, I revert the role assignment with a single PowerShell line. None of these are dramatic. All of them need to be rehearsed before the incident, not during it.
How to apply this in your environment
- Treat this as a starting point. Your tenant is not my tenant. The SKU, region, and licence mix in your subscription will change what is sensible.
- Test in a non-production environment first. Yes, even if you are confident. I have been surprised enough times to keep doing this.
- Pin your evidence. Capture the System Center DPM - the DPM service starts but then crashes configuration version, the Microsoft cloud region, the date, and the business question it answers in your evidence folder.
- Cross-check Microsoft Learn one more time on the day you ship. Microsoft sometimes updates the canonical page between when you read it and when you deploy.
- Schedule a 90-day review. Put it in your team calendar. The dpm service crash loop, typically rooted in a corrupted dpmdb, a stale temp file, or a permission change changes. Your configuration should too.
Caveats and what to double-check
- Microsoft renames features. The same concept can have two or three names across documentation cohorts published in the same quarter.
- Some capabilities described in the docs may still be in preview. Confirm general availability before you rely on the contractual SLA.
- Regional availability varies. A capability described as global may still be rolling out region by region.
- Pricing for the workloads that anchor System Center DPM - the DPM service starts but then crashes changes regularly. This page does not track pricing. Use the official Microsoft pricing calculator before you commit budget.
Related work in your environment
- Document this reference in your team wiki. Note which workloads depend on it today and which are planned.
- Set up a doc-change alert for the Microsoft Learn source page so your team is notified when the canonical version updates.
- Add a quarterly review to your governance cadence. System Center DPM - the DPM service starts but then crashes is not a set-and-forget topic.
FAQ
References
- Microsoft Learn - official documentation for System Center DPM - the DPM service starts but then crashes
- Microsoft 365 admin centre and Power Platform admin centre
- Microsoft Graph PowerShell SDK and related cmdlet references
- Microsoft Tech Community - peer discussion and operational notes
Related fixes
Related guides worth a look while you sort this one out:
- C:\Program Files\Microsoft System Center\DPM\DPM\Volumes\Replica\vol \ \Full\C-Vol\E0400121F44.log
- DPM protection agent doesn't install on a computer
- Troubleshoot slow performance of the Data Protection Manager console
- Re-create the SQL Server Service Broker
- System Center 2012 R2 Service Manager
- Issue: The custom domain works, but the site isn't externally reachable