OT / ICS security · field research & frameworks

Writing · 2025-08-19 · 5 min read · troubleshooting · framework

The STREAM Method™: Advanced Industrial Network Troubleshooting

An advanced, cyclical troubleshooting methodology for complex industrial network problems. The deeper counterpart to the RIVER Method, run in iterative cycles until the fault is resolved.

River Caudle · rivercaudle.com

Advanced Industrial Network Troubleshooting Methodology

"Follow the STREAM in cycles until you reach the solution"


S - Scope the Symptom

  • Define the exact problem and its impact
  • Identify affected systems, users, and processes
  • Determine the blast radius and urgency level
  • Gather initial information from users and monitoring systems

"What is the exact problem and its blast radius?"

Addresses: The need for formal information gathering before jumping into troubleshooting


T - Test Direction

  • Assess whether this is likely physical (bottom-up) or logical (top-down)
  • Use experience and symptoms to choose starting point
  • Consider environmental factors and recent changes
  • Make an educated decision on investigation approach

"Is this likely a physical or logical issue?"

Addresses: Rigid linearity by allowing experts to jump to logical starting points


R - Replicate or Review

  • For active issues: Attempt to reproduce the problem on demand
  • For intermittent issues: Review logs, monitoring data, and historical patterns
  • Gather evidence from multiple sources
  • Build a timeline of events and symptoms

"Can I make it happen on demand, or do I need to review logs/data for clues?"

Addresses: Intermittent issues that can't be observed in real-time


E - Execute Targeted Action

  • Based on gathered data, perform one specific change or test
  • Make changes incrementally and deliberately
  • Document what you're about to do before doing it
  • Focus on single variables to isolate cause and effect

"Based on the data, what is the one specific change or test I will perform?"

Addresses: Overly broad "examine everything" approaches with focused action


A - Assess the Result

  • Evaluate whether the action fixed, changed, or had no effect
  • Document what you learned from this iteration
  • Determine next steps based on results
  • Decide whether to continue cycling or try different approach

"Did my action fix it, change it, or do nothing? What did I learn?"

Addresses: Missing feedback loop for complex problems requiring multiple iterations


M - Mitigate & Maintain

  • Implement permanent fix if temporary solution worked
  • Document the solution for future reference
  • Update procedures, monitoring, or preventive measures
  • Ensure solution is properly tested and communicated

"Is the fix in place? Is it documented for the future?"

Addresses: Combining immediate fixes with long-term documentation and prevention


The STREAM Cycle™

The Power of the Loop

Unlike linear methodologies, STREAM is designed as a cycle:

Execute → Assess → Execute → Assess

Continue cycling through E→A until the problem is resolved, each iteration providing new data to inform the next action.

Decision Points

  • After Assess: Return to Execute with new targeted action, or
  • After Assess: Return to Replicate if you need more data, or
  • After Assess: Move to Mitigate if problem is solved

Escape Conditions

  • Problem is resolved (move to Mitigate)
  • Problem requires escalation (document and hand off)
  • Problem requires scheduled maintenance window (plan and schedule)

STREAM vs. RIVER: When to Use Which

ScenarioMethodWhy
Device completely dead in cabinetRIVERSimple, physical-first approach
Intermittent network performanceSTREAMRequires data analysis and cycling
Remote troubleshootingSTREAMAll steps can be performed remotely
Complex multi-system issueSTREAMHandles scope and iteration well
New technician trainingRIVERSimpler, more linear learning
Experienced team investigationSTREAMLeverages expertise and flexibility

STREAM Advantages

1. Expert-Friendly

The Test Direction step allows experienced technicians to leverage their knowledge:

  • Skip physical checks for known software issues
  • Start with logs for familiar failure patterns
  • Use intuition built from years of experience

2. Handles Complexity

The Execute → Assess cycle manages multi-variable problems:

  • Each iteration builds knowledge
  • Failed attempts provide valuable data
  • Complex problems are broken into manageable steps

3. Remote-Capable

Every STREAM step works remotely:

  • Scope: Phone calls, tickets, monitoring dashboards
  • Test: Mental assessment based on symptoms
  • Replicate: Log analysis, remote monitoring tools
  • Execute: Configuration changes, remote commands
  • Assess: Remote testing, metric verification
  • Mitigate: Documentation updates, procedure changes

4. Intermittent-Issue Ready

The Replicate or Review step specifically addresses problems you can't see:

  • Historical log analysis
  • Pattern recognition in monitoring data
  • Timeline correlation with other events
  • Proactive investigation techniques

STREAM in Practice: Example Scenarios

Scenario 1: Intermittent HMI Disconnections

  • Scope: Three HMIs losing connection randomly, 2-3 times per day
  • Test: Likely network/logical issue (multiple devices, time-based pattern)
  • Replicate: Can't reproduce on demand → Review switch logs and network monitoring
  • Execute: Enable detailed logging on affected switch ports
  • Assess: Logs show CRC errors during disconnections
  • Execute: Test cable integrity on affected runs
  • Assess: Cable tests reveal intermittent opens
  • Mitigate: Replace cables, update preventive maintenance schedule

Scenario 2: Sudden Production Line Halt

  • Scope: Entire line stopped, PLC communication lost, production impact high
  • Test: Could be physical (power/connection) or logical (network storm)
  • Replicate: Issue is active → Immediate investigation possible
  • Execute: Check PLC status lights and power
  • Assess: Power good, status lights indicate network fault
  • Execute: Check switch port status and connectivity
  • Assess: Switch port down, cable issue suspected
  • Execute: Replace patch cable
  • Assess: Line restored, production resumed
  • Mitigate: Document cable failure, check other cables of same vintage

Quick Reference Card

StepFocusRemote?Key Output
SProblem DefinitionClear scope and impact
TInvestigation StrategyBottom-up or top-down approach
REvidence GatheringReproduction or historical data
EFocused ActionSingle, targeted change
AResults AnalysisLearning and next step decision
MSolution ImplementationPermanent fix and documentation

Integration with SHIP Framework

STREAM supports the SHIP (Standardize, Harden, Isolate, Protect) design philosophy:

  • Standardize: Consistent troubleshooting methodology across teams
  • Harden: Systematic approach prevents hasty changes that could cause more problems
  • Isolate: Scoped investigation respects network segmentation
  • Protect: Documentation and mitigation steps improve overall security posture

"The best troubleshooting methodology adapts to both the problem and the troubleshooter."

River Caudle · river@riverman.io · Houston, Texas