> ## Documentation Index
> Fetch the complete documentation index at: https://docs-sre.lightrun.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Production Engineers

> AI SRE workflow for Production Engineers

Production Engineers are often pulled into incidents late with little evidence. AI SRE helps quickly understand the situation, gather evidence, and contribute effectively to incident resolution.

## Workflow stages

### Alert intake & triage

**Challenge:** Often pulled in late with little evidence

**How AI SRE helps:**

* Provides immediate context about the incident
* Summarizes evidence gathered so far
* Explains current state of investigation
* Enables rapid onboarding to incident

### Scope & impact assessment

**Challenge:** Asked to validate impact without runtime data

**How AI SRE helps:**

* Provides impact assessment from available evidence
* Correlates available data to assess impact
* Identifies what runtime data is missing
* Supports impact validation with available data

### Root cause investigation

**Challenge:** Attempts to reproduce locally

**How AI SRE helps:**

* Analyzes production data directly
* Gathers evidence from production systems
* Provides guidance on reproduction if needed
* Focuses on production evidence, not just reproduction

### Fix design

**Challenge:** Designs fixes based on hypotheses, not real failure states

**How AI SRE helps:**

* Provides evidence of actual failure states
* Uses production data to understand failures
* Enables fix design based on evidence
* Validates fix design against real failure states

### Deployment & verification

**Challenge:** Lacks direct proof fix addressed the root cause

**How AI SRE helps:**

* Verifies fixes address root cause
* Provides evidence that fix worked
* Validates fix addresses root cause
* Provides confidence in fix effectiveness

### Post-incident learning

**Challenge:** Knowledge remains undocumented or tribal

**How AI SRE helps:**

* Documents investigation automatically
* Captures knowledge from investigation
* Creates shareable documentation
* Retains knowledge for future reference

## Key workflows

### Rapid onboarding

1. Get pulled into incident
2. Ask AI SRE for incident summary
3. Get evidence and current state
4. Understand investigation progress
5. Contribute effectively

### Evidence gathering

1. Identify what evidence is needed
2. Ask AI SRE to gather evidence
3. Get evidence from available sources
4. Fill evidence gaps
5. Contribute to investigation

## Best practices

* Use AI SRE immediately when pulled into incident
* Get context fast
* Gather evidence systematically
* Validate fixes with AI SRE
* Document knowledge for future

## Next steps

<CardGroup cols={2}>
  <Card title="Working with AI SRE" icon="wrench" href="/working-with-ai-sre/overview/detection" />

  <Card title="FAQ" icon="question-circle" href="/faq" />
</CardGroup>
