Hardware Failure

Check Point 1535 partial boot then reload loop: Diagnose & Fix

By Sai Kiran Pandrala · reviewed by Sai Kiran Pandrala, Editor Last verified: 2026-05-30

⚡ At a glance
VendorCheck Point
Operating systemGaia OS / SmartConsole
CategoryHardware Failure
Skill levelIntermediate to advanced
DIY-able?Yes with CLI access; some scenarios need Check Point TAC + RMA.

Hardware-class faults on Check Point kit fall into a tidy little matrix once you have seen a few. Gaia OS / SmartConsole gives you the building blocks via `show version all` and `show asset system`; the rest is pattern matching. The 1535 platform is one of the more common offenders only because the install base is large.

Do not skip the visible-and-audible inspection. Burnt-PCB smell and fan-tray rattle are diagnostic signals that no command will ever surface. I have caught more dying PSUs by ear than by `show asset system`.

If the chassis is dark and the console is silent, jump straight to the PSU/cable substitution path before opening a Check Point TAC ticket: it eliminates the most common cause in under five minutes.

What this guide covers

Real-world context. Last time I walked through this on a real machine, the budget shook out to ~Rs 0 INR under Check Point support, otherwise ~Rs 10,000 to Rs 1,50,000 INR for parts (around $120 to $1,800 USD). Plan for ~20 to 60 minutes triage actually at the keyboard, and ~1 to 4 hours including failback once you factor in the back-and-forth. Keep the appliance serial, a gateway backup, and SmartConsole access within arm’s reach before you start, stopping mid-step to hunt for them is how a 30-minute job turns into an afternoon.

Diagnose and recover from partial boot then reload loop on a Check Point 1535.

Repair sequence

  1. Capture the boot console output to a file. this is the single most useful diagnostic.
  2. Verify image integrity (md5sum or vendor checksum).
  3. If the image is corrupt, re-download from the vendor site and copy back.
  4. If the boot output references a hardware error (memory test fail, FPGA fail), open an RMA.
  5. Try booting an older known-good image stored on flash.

CLI / commands

# Verify hardware state
show version all
show asset all
show asset system

# Collect for Check Point TAC
cpinfo -z -o /var/log/cpinfo.tgz

When to RMA

Frequently asked questions

Will this work on my specific Gaia OS / SmartConsole version?

The procedure reflects current Gaia OS / SmartConsole behaviour. Older releases may need minor syntax adjustments, use the CLI help (? or tab-completion) to verify.

Should I open a Check Point TAC case immediately?

Open one if you suspect hardware failure or the symptom persists after a maintenance-window reload. Make sure your support entitlement is active first.

Where can I find the Check Point official documentation?

https://support.checkpoint.com/results: search the product family + feature name.

Is this procedure safe in production?

Test in a lab or maintenance window first. Capture pre-change state so you can roll back.

References


Reference material, not professional advice. Validate against your specific Gaia OS / SmartConsole version and test in a non-production environment before applying.

Why this matters for your day-to-day

A Check device that's misbehaving costs more than the fix itself: lost productivity, missed calls, security risk, even safety risk in some categories. Treating the symptom quickly with a documented procedure is cheaper than letting it persist. The steps above are written to get you back to working in under an hour where possible, and to flag clearly when escalation is the right call.

Cause analysis

A few things to confirm so the Check device fix goes cleanly:

Post-repair audit

On a Check device, the test is rarely "reboot and see". Use this list:

Escalation guide

For a Check device, the right escalation depends on impact:

More frequently asked questions

Are there safer alternatives for non-technical users?

Yes, the manufacturer's self-service troubleshooter (HP Smart, LG ThinQ, Samsung Members, similar) usually walks through the same steps in a guided UI. Use that first if you're not comfortable with menu paths.

Should I update firmware first or last?

Update firmware first if a release note specifically mentions your symptom. Otherwise, finish the troubleshooting flow first, then update; that way you can isolate whether the update or the underlying fix solved it.

Is it safe to apply during business hours?

If the device is in production use, apply during a scheduled maintenance window. Most procedures need 2-15 minutes of downtime. Capture pre-change state so you can roll back if needed.

How long does this fix usually take?

Most users complete the steps in 20-45 minutes the first time, and 5-10 minutes on subsequent runs once the menu paths are familiar.

What if my model isn't exactly the same revision?

Cross-check the model code on the rating plate against the manufacturer support page. Major firmware generations sometimes shift the menu path; the option is usually under a similarly-named section.

Field notes from real incidents on Check Point

When I work on Check Point 1535 partial boot then reload loop: Diagnose & Fix the rhythm I lean on is the one I have built over years of these tickets. Most spanning-tree storms I have walked into started with a user-side switch that nobody documented; topology audits pay off the day the loop forms. Counters lie if you do not clear them; clear counters, reproduce, and read the deltas, not the cumulative numbers. Half the BGP weirdness I have triaged was a route-map that someone copied from a template without reading what it actually filtered.

Tools I actually reach for

For Check Point 1535 partial boot then reload loop: Diagnose & Fix on Check Point the cheapest signal I can land usually comes from a known order of operations, not a kitchen-sink approach. I start with traceroute vrf <vrf> <target> because it is the lowest-friction way to confirm the failure is real and reproducible. If that returns ambiguous data, I escalate to show tech-support (capture for TAC), ping vrf <vrf> <target>, packet capture on the ingress interface (TAC will ask for it), and finally to show running-config | include <feature> only when the cheaper tools cannot reach the layer the failure lives in. That ordering matches the failure surfaces I have actually seen on Check Point units over the last few years, not an abstract taxonomy. The cheap signals gate the expensive ones so the investigation does not balloon into a multi-hour exercise.

Verification I run before I close the ticket

Before I mark Check Point 1535 partial boot then reload loop: Diagnose & Fix resolved on a Check Point unit, the verification loop below is what I actually run. Each step proves a different layer is green, and the order matters - the cheap checks gate the more expensive ones so I never burn an hour on a deep test that a shallow one would have failed in seconds.

show interfaces <int> | include errors|drops|CRC

If that one comes back clean, move to the next check. If it does not, stop and dig in there before layering more verification on top of a red signal.

show logging | include %LINK|%LINEPROTO|%BGP|%OSPF

If that one comes back clean, move to the next check. If it does not, stop and dig in there before layering more verification on top of a red signal.

show ip route <prefix>  # confirm best path post-change

Only when every line above runs clean do I close the ticket and update the runbook with the timestamps. A green verification that nobody can reproduce is not a fix, it is luck waiting to regress.

Where I check first when the docs disagree

When two sources contradict each other on a Check Point detail, the disambiguation order I lean on is stable across products and across years. vendor release notes for the running software version is where I start for the ground-truth view. vendor official command reference (Cisco DocCD, Arista EOS Central, Juniper TechLibrary, etc.) is where I start for the ground-truth view. RFCs for the protocol in question (rfc-editor.org) is where I start for the ground-truth view. Random blog posts and reseller wikis are signal, not ground truth, and I treat them as such until the references above either confirm or contradict the claim. The cost of trusting an unauthoritative source on Check Point 1535 partial boot then reload loop: Diagnose & Fix is rarely worth the time it saved.

Pitfalls I have walked into on this exact path

The shortcuts that look smart on Check Point 1535 partial boot then reload loop: Diagnose & Fix have a habit of biting back. The pitfalls below are the ones I have personally walked into on a Check Point unit, not things I read about. Counters lie if you do not clear them; clear counters, reproduce, and read the deltas, not the cumulative numbers. Show tech-support is the artifact TAC will ask for first: capture it before you change anything so the pre-change state is preserved. When in doubt I revert to the slower path that the manual prescribes - the time I save by skipping it is always smaller than the time I spend cleaning up afterwards.

What I tell the next on-call

When I hand Check Point 1535 partial boot then reload loop: Diagnose & Fix off to the next person on rotation, the three lines I leave in the runbook are these. First, the symptom signature on Check Point - not a paraphrase, the exact string that surfaces in logs or on the screen. Second, the diagnostic that gave the highest signal in the least time. Third, the exact verification command whose green output justified closing the ticket. That trio is what turns a one-off fix into a runbook entry the next engineer can use without paging me at three in the morning.

I also add a one-line note on the cost of getting this wrong. For Check Point 1535 partial boot then reload loop: Diagnose & Fix on a Check Point unit, the cost is rarely the replacement part or the patch itself. It is the downtime, the second site visit, and the trust deficit you spend with whoever owns the asset when the fix does not hold. That framing keeps the next on-call from choosing the cheap-looking shortcut that ends up costing the most in elapsed hours and goodwill.

Related guides worth a look while you sort this one out:

People also ask

Will this work on my specific Gaia OS / SmartConsole version?

The procedure reflects current Gaia OS / SmartConsole behaviour. Older releases may need minor syntax adjustments, use the CLI help (`?` or tab-completion) to verify.

Should I open a Check Point TAC case immediately?

Open one if you suspect hardware failure or the symptom persists after a maintenance-window reload. Make sure your support entitlement is active first.

Where can I find the Check Point official documentation?

https://support.checkpoint.com/results. search the product family + feature name.

Is this procedure safe in production?

Test in a lab or maintenance window first. Capture pre-change state so you can roll back.