
Driving Enterprise-Wide SRE Adoption Across a Global Automotive Portfolio with Observability and Automation
Client | Industry | Solution Provided | Technologies Used |
---|---|---|---|
Global Automotive Manufacturer | Auto & Manufacturing | Platform DevOps & Site Reliability Engineering (SRE) | Dynatrace, Azure B2C, CI/CD Pipelines, Infrastructure Automation, QA Automation |
The Need
A global automotive manufacturer committed to scaling Site Reliability Engineering (SRE) across its enterprise portfolio to strengthen platform observability and telemetry—particularly with a focus on security.
Their goals included:
- Launching SRE adoption with the Azure B2C platform as a pilot
- Defining and operationalizing SLOs and SLIs for improved reliability tracking
- Partnering with an experienced engineering partner to guide SRE rollout and embed best practices from day one
The Solution
Gorilla Logic partnered with engineering and operations leaders to build the foundational practices, governance, and tooling for scalable SRE adoption.
Key solution components included:
- SLO/SLI Definition: Defined metrics to measure service reliability and performance across products
- SRE Governance: Co-authored SRE policies and adoption playbooks with client stakeholders
- Observability Framework: Integrated Dynatrace to enable real-time telemetry and SLI/SLO tracking
- Phased Platform Rollout: Deployed changes incrementally to Azure B2C—centralizing observability, automating infrastructure, enabling CI/CD pipelines, and advancing QA automation
- Cross-Functional Enablement: Facilitated collaboration between dev, ops, and security teams to operationalize observability and streamline incident remediation
Results
Improved Platform Visibility: Centralized observability enabled real-time insight into performance and issues
Accelerated SRE Adoption: Established reusable SRE policies, processes, and telemetry patterns to scale across the enterprise
Enhanced Collaboration: Unified development, operations, and security workflows through shared tooling and reliability metrics
Continuous Optimization: Set the foundation for long-term improvement of reliability, release speed, and incident response