Capital One Employment – Site Reliability Engineer

Website Capital One

Job Description:

We are looking for an experienced Site Reliability Engineer with operational and/or site reliability engineering background with a passion for providing superior system availability and customer experience. We are looking for candidates who can lead a 24/7 support organization, drive reliability and performance across a massive scale by mastering the full depth of the stack.

Job Responsibilities:

  • Utilize production support expertise to influence and support new designs, architectures, standards and methods maintaining stability and availability for large-scale distributed systems
  • Proactively monitor all of the applications and infrastructure behind Capital One’s external and internal customer facing services including their availability, latency, performance, and capacity
  • Drive incident resolution through a systematic problem solving approach, coupled with a strong sense of ownership and drive
  • Effectively manage troubleshooting and recovery of complex production incidents, ranging from low to critical impacts
  • Create, manage and utilize appropriate technical procedural documentation (run books)
  • Identify opportunities and develop proactive automated monitoring and alerting solutions by utilizing available tools (Splunk, DataDog, etc.)
  • Influence resiliency and scalability in production environments in Amazon Web Services (AWS)
  • Assist with conducting Root Cause Analysis (RCA) on critical production outages, develop and implement mitigation strategies
  • Actively participate in teams’ Agile stories (project work) to streamline and enhance day to day operations of the team

Job Requirements:

  • 2+ years experience with web API services
  • 2+ years of experience with Linux, UNIX, python, Ruby, Go, JavaScript, or NoSQL
  • 2+ years of experience with AWS, Azure or GCP
  • Bachelor’s Degree
  • At least 2 years of experience in technology production support
  • 2+ years of experience with Splunk, New Relic, or DataDog monitoring and alerts
  • AWS Associate level certification (Solutions Architect, SysOps Administrator, or Developer)

Job Details:

Company: Capital One

Vacancy Type: Full Time

Job Location: Glen Allen, VA, US

Application Deadline: N/A

Apply Here

 Report Job