- Career Center Home
- Search Jobs
- Senior Technical Architect - Site Reliability Engineering & AIOps
Results
Job Details
Explore Location
Schwab
Austin, Texas, United States
(on-site)
Posted
3 days ago
Schwab
Austin, Texas, United States
(on-site)
Job Type
Full-Time
Senior Technical Architect - Site Reliability Engineering & AIOps
The insights provided are generated by AI and may contain inaccuracies. Please independently verify any critical information before relying on it.
Senior Technical Architect - Site Reliability Engineering & AIOps
The insights provided are generated by AI and may contain inaccuracies. Please independently verify any critical information before relying on it.
Description
Your OpportunityAt Schwab, you're empowered to make an impact on your career. Here, innovative thought meets creative problem solving, helping us "challenge the status quo" and transform the finance industry together.
We believe in the importance of in-office collaboration and fully intend for the selected candidate for this role to work on site in the specified location(s).
In this role, you'll lead the technical vision and architecture for our Site Reliability Engineering (SRE) and AIOps function, shaping how reliability, automation, and intelligent operations scale across the enterprise. This is not a traditional production support role. It requires engineering / coding experience. You'll work at the intersection of cloud-native platforms, distributed systems, and AI-driven operations-partnering closely with Engineering, Product, Security, and Infrastructure leaders to build resilient, self-healing systems that support millions of clients. This is a highly visible leadership role where your expertise influences both technology strategy and how teams operate day to day.
Key Responsibilities
- SRE Architecture & Reliability Strategy - Define and own the end-to-end reliability architecture, including SLO/SLI frameworks, error budget policies, observability standards, and resilience patterns across distributed microservices environments.
- AIOps Platform Architecture - Design and architect the AIOps platform encompassing ML-driven anomaly detection, predictive alerting, automated root cause analysis, event correlation, and intelligent remediation workflows.
- Infrastructure & Platform Design - Lead architecture decisions for cloud-native infrastructure (GCP/AWS/Azure), Kubernetes orchestration, service mesh (Istio/Envoy), infrastructure-as-code (Terraform/Pulumi), and multi-region disaster recovery strategies.
- Observability & Monitoring Architecture - Architect the unified observability stack integrating metrics, logs, traces, and events using technologies such as OpenTelemetry, Grafana, Datadog, and custom ML pipelines for intelligent alerting.
- Automation & Self-Healing Systems - Drive the architecture of automated remediation frameworks, self-healing infrastructure, chaos engineering pipelines, and progressive deployment strategies (canary, blue-green, feature flags) to achieve zero-touch operations.
- Technical Leadership & Governance - Establish architecture review boards, technical standards, design patterns, and reference architectures; lead technical due diligence and drive consistency across SRE and platform teams.
- Team Development & Mentorship - Build, mentor, and grow a team of senior SRE architects and engineers; foster a culture of engineering excellence, continuous learning, and innovation in reliability and AI-driven operations.
- Stakeholder & Executive Engagement - Partner with Engineering, Product, Security, and Infrastructure leadership to align reliability and AIOps investments with business priorities; present technical strategies to executive stakeholders.
What you have
Required Qualifications
- 12+ years of experience in software development and engineering, infrastructure, or SRE, with 5+ years in a senior architecture or technical leadership role.
- Deep expertise in distributed systems, cloud-native architectures, and large-scale production environments.
- Hands-on experience with Kubernetes, Docker, service mesh, CI/CD pipelines, and infrastructure-as-code tools.
- Strong understanding of ML/AI concepts and their application to operational intelligence - anomaly detection, predictive scaling, log analysis, and automated diagnostics.
- Proven experience designing observability platforms using OpenTelemetry, Prometheus, Grafana, Datadog, Splunk, or equivalent.
- Expertise in incident management frameworks, chaos engineering, and SLO-driven reliability practices.
- Experience with major cloud platforms (AWS, GCP, Azure) at scale.
- Strong communication and executive presence with the ability to translate complex technical concepts for non-technical stakeholders.
In addition to the salary range, this role is also eligible for bonus or incentive opportunities.
Requisition #: 2026-119491
r1d4rh5eu
Requirements
2026-119491
Job ID: 82697963

Schwab
United States
Schwab is a leader in financial services, helping millions of people make the most of their money. Most Schwab careers are based in one of our two main operating segments, Investor Services or Institutional Services. But across the entire Schwab organization, more than 12,000 employees share a passion for fulfilling our corporate purpose: to help everyone be financially fit.
View Full Profile
More Jobs from Schwab
Software Engineer - Full Stack
Austin, Texas, United States
7 hours ago
VP, Financial Consultant - Huntersville, NC
Huntersville, North Carolina, United States
7 hours ago
Manager, Financial Services IT Product Management
Austin, Texas, United States
6 hours ago
Jobs You May Like
Median Salary
Net Salary per month
$4,904
Cost of Living Index
67/100
67
Median Apartment Rent in City Center
(1-3 Bedroom)
$2,119
-
$3,831
$2,975
Safety Index
56/100
56
Utilities
Basic
(Electricity, heating, cooling, water, garbage for 915 sq ft apartment)
$101
-
$300
$190
High-Speed Internet
$50
-
$100
$67
Transportation
Gasoline
(1 gallon)
$2.73
Taxi Ride
(1 mile)
$2.61
Data is collected and updated regularly using reputable sources, including corporate websites and governmental reporting institutions.
Loading...
