
Website Capital One
Job Description:
We are looking for an experienced Site Reliability Engineer with operational and/or site reliability engineering background with a passion for providing superior system availability and customer experience. We are looking for candidates who can lead a 24/7 support organization, drive reliability and performance across a massive scale by mastering the full depth of the stack.
Job Responsibilities:
- Utilize production support expertise to influence and support new designs, architectures, standards and methods maintaining stability and availability for large-scale distributed systems
- Proactively monitor all of the applications and infrastructure behind Capital One’s external and internal customer facing services including their availability, latency, performance, and capacity
- Drive incident resolution through a systematic problem solving approach, coupled with a strong sense of ownership and drive
- Effectively manage troubleshooting and recovery of complex production incidents, ranging from low to critical impacts
- Create, manage and utilize appropriate technical procedural documentation (run books)
- Identify opportunities and develop proactive automated monitoring and alerting solutions by utilizing available tools (Splunk, DataDog, etc.)
- Influence resiliency and scalability in production environments in Amazon Web Services (AWS)
- Assist with conducting Root Cause Analysis (RCA) on critical production outages, develop and implement mitigation strategies
- Actively participate in teams’ Agile stories (project work) to streamline and enhance day to day operations of the team
Job Requirements:
- 2+ years experience with web API services
- 2+ years of experience with Linux, UNIX, python, Ruby, Go, JavaScript, or NoSQL
- 2+ years of experience with AWS, Azure or GCP
- Bachelor’s Degree
- At least 2 years of experience in technology production support
- 2+ years of experience with Splunk, New Relic, or DataDog monitoring and alerts
- AWS Associate level certification (Solutions Architect, SysOps Administrator, or Developer)
Job Details:
Company: Capital One
Vacancy Type: Full Time
Job Location: Glen Allen, VA, US
Application Deadline: N/A
Report Job