Juniper SRX1500 smoke smell or burned PCB: Diagnose & Fix
By Sai Kiran Pandrala · reviewed by Sai Kiran Pandrala, Editor Last verified: 2026-05-30
| Vendor | Juniper |
|---|---|
| Operating system | Junos OS |
| Category | Hardware Failure |
| Skill level | Intermediate to advanced |
| DIY-able? | Yes with CLI access; some scenarios need JTAC + RMA. |
When a Juniper SRX1500 starts misbehaving, the temptation is to reboot and hope. Resist it. Capture `show version` and `show chassis environment` first; that 30-second buffer is the difference between a real root cause and another reload at 3am next week.
Junos OS has a habit of logging the actual failing component into the system log seconds before the LED transitions. Tail the log while you run the diagnostic commands. you will often see the answer scroll past in real time.
Below is the exact sequence I run on customer gear. Steps are ordered cheapest-first so you exit early if it really is just a loose cable.
What this guide covers
Diagnose and recover from smoke smell or burned PCB on a Juniper SRX1500.
Step-by-step
- STOP. Power off the device at the wall before touching it.
- Open the chassis and identify which board / module is the source of the smell.
- Photograph the visible damage (scorched capacitors, blackened ICs).
- Note the device serial number and exact model for the support case.
- Do not power back on, burned components fail closed and can damage adjacent boards.
- Open a JTAC case with the photos and serial.
- RMA the affected component (line card, supervisor, PSU) or the full chassis if the backplane is damaged.
CLI / commands
# Verify hardware state
show version
show chassis hardware
show chassis environment
# Collect for JTAC
request support information | save /var/tmp/rsi.txt
When to RMA
- Repeated failure after re-seat and power-cycle
- Visible burn, scorching, or physical damage
- POST or memory diagnostic failure
- Hardware crashinfo without a software workaround
Frequently asked questions
Will this work on my specific Junos OS version?
The procedure reflects current Junos OS behaviour. Older releases may need minor syntax adjustments: use the CLI help (? or tab-completion) to verify.
Should I open a JTAC case immediately?
Open one if you suspect hardware failure or the symptom persists after a maintenance-window reload. Make sure your support entitlement is active first.
Where can I find the Juniper official documentation?
https://kb.juniper.net/, search the product family + feature name.
Is this procedure safe in production?
Test in a lab or maintenance window first. Capture pre-change state so you can roll back.
Related guides
Related fixes
Related guides worth a look while you sort this one out:
- Juniper EX2300 smoke smell or burned PCB: Diagnose & Fix
- Juniper EX3400 smoke smell or burned PCB: Diagnose & Fix
- Juniper EX4300-MP smoke smell or burned PCB: Diagnose & Fix
- Juniper EX4400 smoke smell or burned PCB: Diagnose & Fix
- Juniper Mist AP43 smoke smell or burned PCB: Diagnose & Fix
- Juniper Mist AP63 smoke smell or burned PCB: Diagnose & Fix
References
- Juniper support portal: https://support.juniper.net
- Juniper knowledge base: https://kb.juniper.net/
- Juniper security advisories: https://supportportal.juniper.net/s/global-search/Security%20Advisory
- Open a case: https://supportportal.juniper.net/s/case
Reference material, not professional advice. Validate against your specific Junos OS version and test in a non-production environment before applying.
Common patterns we see
When this symptom shows up on a Juniper device, three patterns repeat:
1. Recent firmware update changed behavior. the symptom started within a week of an OTA push. Rollback or wait for the hotfix. 2. Environmental trigger, temperature, humidity, line voltage, network changes. Look at what changed in the environment. 3. Cumulative wear: components like batteries, gaskets, fans degrade over time. Replace the consumable rather than chasing a software fix.
Knowing which pattern applies saves time on the wrong fix.
Before you start
A few things to confirm so the Juniper device fix goes cleanly:
- Latest firmware downloaded if you're going to update.
- Warranty + support contract status checked, opening sealed parts may void it.
- Backup of current configuration (where applicable) taken.
- Spare parts on hand if you anticipate replacement.
- Adequate workspace, lighting, and time. rushing causes regressions.
Quick verification
Before you walk away from a Juniper device fix, run through:
1. Reproduce the original trigger, does the issue reappear? 2. Check the device's status / health screen for any new alerts. 3. Confirm paired devices (app, hub, controller) reconnected. 4. Save / commit any configuration changes per the device's normal workflow. 5. Note the change in your maintenance log with date + firmware version.
When to call Juniper support instead
Escalate if:
- The same symptom returns within 24 hours of a clean fix.
- You see physical damage (burn marks, swollen battery, cracked PCB).
- The device is in warranty and a hardware replacement is the cheaper outcome.
- Repair requires specialised tools you don't own (alignment jigs, calibration software).
- Following the official path keeps the warranty intact, which matters more than the time spent.
More frequently asked questions
Is it safe to apply during business hours?
If the device is in production use, apply during a scheduled maintenance window. Most procedures need 2-15 minutes of downtime. Capture pre-change state so you can roll back if needed.
How long does this fix usually take?
Most users complete the steps in 20-45 minutes the first time, and 5-10 minutes on subsequent runs once the menu paths are familiar.
Will this void my warranty?
Applying official firmware updates and following the user manual will not affect warranty. Opening sealed components, jumping safety circuits, or using third-party parts can void warranty in most jurisdictions.
Does this affect other devices on my network?
Generally no. The procedure is local to this device. Network-side changes (firmware updates that affect TLS, SMB, or routing) are flagged explicitly in the steps.
What if the fix returns after a reboot?
Persistent fault returns mean either: a hardware fault (escalate), a configuration that's being overwritten by a sync source (check cloud profiles), or a regression in a recent firmware update (rollback).
Topology deep dive. how the SRX1500 actually moves a packet
The SRX1500 boot sequence runs U-Boot → Junos loader → Junos OS. If the loader> prompt appears and stays there, the bootloader survived but the Junos OS image on flash has gone bad. The recovery is to TFTP a known-good image from a 169.254.0.0/16 link-local laptop. On the BFSI side, we keep a staging laptop at every colo with a JTAC-approved image archive named by `sha256` for exactly this scenario.
VCP-trunk (Virtual Chassis Port) on the SRX340 runs at 40 Gbps over the dedicated rear ports. If you cable VCP over the front 10G optics by mistake (a common install error at remote BFSI branches), the stack joins but every inter-member packet eats a hop of latency. Use `show virtual-chassis vc-port` to confirm cabling. The output column should read VCP rather than Network.
The Junos OS RPD (routing protocol daemon) holds the BGP / OSPF tables, and the kernel installs them into the PFE forwarding table via the rpd-to-kernel socket. On a busy NSEL Mumbai colo edge with 1.2 million BGP routes from two ISP feeds (Reliance Jio + Airtel), the rpd memory footprint hits 6.4 GB. The SRX1500 ships 8 GB DRAM, and you will see the RE swap to disk during convergence storms if you do not damp the import. `show route summary` is your friend.
Configuration walkthrough with Junos commit safety
Junos OS supports `commit comment` and `commit synchronize` (for dual-RE chassis). On the SRX1500 single-RE platform, the `synchronize` flag is a no-op, but I leave it in the muscle-memory commit script. On a dual-RE MX series, omitting `synchronize` is a silent split-brain risk that BFSI auditors will flag. The ITSAR network device baseline (TEC 31318) calls out config sync as a mandatory control.
For automation, NETCONF over SSH on port 830 is the standard. `set system services netconf ssh` enables it. Pair this with a service account whose AAA profile in `set system login user` uses `class super-user-local` only (no remote root). On the SRX340 at a BSNL POP in Vijayawada, we run Ansible juniper.device collection against this account, with vault-encrypted RSA 4096 keys. The keys rotate quarterly via a `gpg`-backed CI pipeline.
The Junos commit model is two-stage by default: candidate config first, then `commit`. On a production SRX1500 at a BFSI data center, I always run `commit check` first, then `commit confirmed 5`. The `confirmed 5` flag rolls back automatically after five minutes if you do not run a second `commit` to make it permanent. This single habit has saved me from a midnight session at a colo cage more than once.
Troubleshooting commands I keep on the laminated card
- show route summary, table sizes per RIB. If inet.0 exceeds 1 million on the SRX1500, you are crowding RPD memory and BGP convergence will degrade.
- request support information | save /var/tmp/rsi.txt: the JTAC bundle. Compress with `gzip` and upload via the JTAC case web upload.
- show chassis environment, power, temperature, fan tray status. The temperature column reports Marginal before Failed, which gives you a 24-hour window in most BFSI cages to plan a cooling fix.
- request system snapshot. clone the current Junos OS slice to the secondary. Always run this before a firmware add.
- show interfaces ge-0/0/1 extensive, drops, errors, queue depth, and SFP DDM voltages. The DDM optical RX power should sit between -3 dBm and -7 dBm on a standard 10km SMF link.
- show log messages | last 50: recent syslog. Look for FPC FRU events, PEM events, and any RPD_ABORTED entries.
- show system processes extensive | match rpd, rpd memory and CPU. Above 60% sustained, plan an RE upgrade or BGP import filtering.
- show chassis hardware extensive. full inventory including serial numbers, FRU type, version. This is the first command JTAC asks for in any RMA case.
India compliance and procurement notes (MeitY, DPDP, GeM)
Under DPDP 2023 (Digital Personal Data Protection Act), logs that carry source IP plus a user identifier qualify as personal data. The syslog forwarding configuration on the SRX1500 must therefore include log retention boundaries (typically 180 days hot, 365 days cold) and access control at the SIEM. Splunk RBAC roles aligned to the data controller and processor responsibilities. On a Reliance Industries BFSI rollout, I have seen the legal team push back on raw syslog leaving the Mumbai data center perimeter, so the SIEM forward is to an on-prem instance, not a cloud SaaS.
The GeM (Government e-Marketplace) listing for SRX340 at the time of writing is INR 4,87,500 per unit, with SmartNet renewal at INR 85,000 per year. The BoQ for a typical BSE colo deployment includes 2 SRX340 in cluster, 1 EX4300 management switch, 2 RJ45 console servers (Opengear), and the AMC line item for 3 years totalling around INR 14,75,000. The procurement cycle is 90-120 days end-to-end through GeM.
Real-world deployment I did
On a Mumbai NSE colo build, two SRX340s were stacked via Junos virtual chassis. The secondary stack member vanished from `show virtual-chassis` after a routine reboot. The fault was a VCP cable that had been crimped against a cable manager during install and was throwing intermittent CRC. We swapped to a DAC, and the stack recovered. The lesson: virtual-chassis cabling at colo sites needs slack and strain relief, not just label compliance.
At a tier-2 town WISP in Tirupati, an SRX300 sat on the head-end backhaul fed by BSNL FTTH. The PSU started clicking, but `show chassis power-budget` looked clean. The tech onsite reported a faint burning smell. I told him to power off in seconds, not minutes. We pulled the chassis serial and the unit was in warranty, and the JTAC RMA arrived in 7 working days via the Bengaluru depot. The replacement boot-flashed on first power-up, that is rare and worth noting.
Extended FAQs from the field
Does the Junos OS upgrade need a maintenance window?
Yes. Even with `commit confirmed` and a dual-RE chassis, the FPC reboots during the package install. On the SRX1500 single-RE platform, this is a 6-8 minute outage. I schedule them in the 02:00-04:00 IST window after coordinating with the BFSI NOC on-call, the upstream Reliance Jio / Airtel ISP, and the downstream switch fabric team.
What is the realistic RMA turnaround in India?
JTAC depot in Bengaluru ships in 7-10 working days for in-warranty SRX300 / SRX340. For SRX1500 the depot is in Mumbai and the shipping is usually 5-7 working days. Out of warranty units go via the JTAC Spares purchase line at roughly 60-70% of the new BoQ price. Always confirm the serial entitlement via the support portal before opening the case.
What if `show chassis environment` reports Fan Tray Failed but the unit is cool?
Check the fan tray seating first: at India colo sites, dust loading and vibration from BMS construction nearby can dislodge a tray. Reseat, wait 60 seconds, recheck. If still failed, swap the tray (FRU SRX-FAN-TRAY is field-replaceable on the SRX1500). The tray FRU price is approximately INR 38,000 from the JTAC spares line.
How do I verify a Junos OS image before flashing?
Pull the image hash from the JTAC download page (it lists SHA-512). On the device, use `file checksum sha-256 /var/tmp/junos-srxsme-22.4R3.7.tgz` and compare. Mismatch means the file was truncated or tampered, do not flash. On a BFSI environment, the staging server must enforce TLS 1.2+ on the file transfer, and the JTAC web download uses HTTPS by default.