Site Reliability Engineer (SRE) Job at TMS, San Jose, CA

enZ0Qm9CVGQ3NmFKUEsyVWwxRzNVZXdRS0E9PQ==
  • TMS
  • San Jose, CA

Job Description

Job Title: Site Reliability Engineer (SRE) AWS & DevOps
Location: San Jose, CA (Local Only)
Duration: 12+ Months
Experience Needed: 12+ Years


Job Overview:

We are looking for a skilled Site Reliability Engineer (SRE) with strong expertise in AWS Cloud and DevOps practices to join our growing engineering team. This role is critical in ensuring the reliability, scalability, and performance of our cloud infrastructure and applications. You will focus on automation, observability, infrastructure as code, and operational excellence, working closely with development and operations teams to deliver a seamless production environment.

Key Responsibilities:

  • Design, implement, and manage highly available and scalable infrastructure on AWS .
  • Develop and maintain Infrastructure as Code using tools such as Terraform , CloudFormation , or AWS CDK .
  • Build, manage, and optimize CI/CD pipelines for fast and reliable application delivery using tools like Jenkins , GitLab CI/CD , CircleCI , or AWS CodePipeline .
  • Monitor system performance and availability using CloudWatch , Prometheus , Grafana , ELK , or Datadog .
  • Define and implement observability , logging , and alerting to ensure system health and quick incident resolution.
  • Improve system reliability through monitoring, incident response, post-mortems, and automation.
  • Support containerized applications using Docker , ECS , EKS , or Kubernetes .
  • Collaborate with development teams to ensure operational readiness and incorporate SRE principles into the software development lifecycle.
  • Implement security best practices across AWS resources, including IAM , encryption , VPC configuration , and compliance monitoring .
  • Participate in an on-call rotation and lead incident response and root cause analysis.

Required Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or related technical field (or equivalent experience).
  • 3+ years of experience in a Site Reliability Engineering, DevOps, or Cloud Engineering role.
  • Strong hands-on experience with AWS services including EC2, S3, RDS, Lambda, IAM, VPC, CloudFront, etc.
  • Proficient in scripting/programming (e.g., Python, Bash, Go).
  • Expertise in CI/CD tools and automation frameworks .
  • Experience with monitoring and observability tools .
  • Strong understanding of networking , DNS , firewalls , and load balancing .
  • Solid grasp of DevOps methodologies and agile practices .
  • Familiarity with containerization and orchestration tools.

Preferred Qualifications:

  • AWS Certifications (e.g., AWS Certified DevOps Engineer , Solutions Architect ).
  • Experience with Kubernetes , Helm , or Service Mesh .
  • Background in security and compliance (SOC2, ISO 27001, etc.).
  • Exposure to chaos engineering , SRE principles , and error budgets .
  • Experience with configuration management tools like Ansible , Chef , or Puppet .

Job Tags

Local area,

Similar Jobs

WHM - Property Management

Outstanding Lease-Up Leasing Agent - Landhaus at Gruene Job at WHM - Property Management

 ...Leasing Agent Landhaus at Gruene (Gruene, TX) WH Management Do you love connecting with people and helping them find theperfect...  ...at Gruene, and convert leads into signed leases. Customer Experience:Provide top-tier customer service, ensuring every resident and... 

Headway

Licensed Clinical Social Worker Job at Headway

 ...clients you see through Headway, so that you can set the hours that work for you. Grow your caseload by providing marketing support and...  ...patients in-person or remotely via telehealth while working from home. We accept the following licenses on a state by state basis:... 

Amazon

Warehouse Operative Job at Amazon

Warehouse OperativeJob ID:240116628 | Dublin, DUBLINAt Amazon, we understand that work doesn't stop on the job.That's why we have warehouse jobs that work for you.**Role & Shifts**Temporary RoleDay or night shift available, full time, part time. Shift availability... 

Stargate Recruiting

Electrician Job at Stargate Recruiting

 ...DescriptionWe're in need of a residential electrician for immediate hire with ongoing contracts gigs in the Tampa Bay area. Ideally a self-employed individual or 1099 with schedule flexibility. Must have at least 5yrs experience and their own transportation. Please call our... 

Leggett & Platt Incorporated

Master Electrician Job at Leggett & Platt Incorporated

Master ElectricianDate:Apr 19, 2025Location: Carthage, MO, USCompany: Leggett & PlattWe make life more comfortable.Leggett & Platts overall mission is a commitment to enhance lives - by delivering quality products, offering empowering and rewarding careers...