DevJobs

Sr Staff Site Reliability Engineer

Overview
Skills
  • Python Python
  • Java Java
  • ML ML
  • Linux Linux ꞏ 5y
  • CI/CD CI/CD
  • AWS AWS
  • Azure Azure
  • GCP GCP
  • Kubernetes Kubernetes ꞏ 2y
  • Ansible Ansible
  • Terraform Terraform
  • Chef Chef
  • Puppet Puppet
  • Networking Networking
  • Unix ꞏ 5y
  • Observability
  • Alerting
  • Monitoring
  • Distributed Systems
  • ML Services
  • Load Balancing
  • LLM
  • Proxies
  • DNS
  • Routing
  • SASE
  • SD-WAN
  • TCP/IP
  • BGP
  • AI
  • VPNs
  • Zero Trust
Our Mission

At Palo Alto Networks®, we’re united by a shared mission—to protect our digital way of life. We thrive at the intersection of innovation and impact, solving real-world problems with cutting-edge technology and bold thinking. Here, everyone has a voice, and every idea counts. If you’re ready to do the most meaningful work of your career alongside people who are just as passionate as you are, you’re in the right place.

Who We Are

In order to be the cybersecurity partner of choice, we must trailblaze the path and shape the future of our industry. This is something our employees work at each day and is defined by our values: Disruption, Collaboration, Execution, Integrity, and Inclusion. We weave AI into the fabric of everything we do and use it to augment the impact every individual can have. If you are passionate about solving real-world problems and ideating beside the best and the brightest, we invite you to join us!

This role is remote, but distance is no barrier to impact. Our hybrid teams collaborate across geographies to solve big problems, stay close to our customers, and grow together. You will be part of a culture that values trust, accountability, and shared success where your work truly matters.

Job Description

Your Career The SASE Platform team builds and operates highly available, secure, and globally distributed services that protect users, applications, and data for some of the world’s largest enterprises. Our mission is to deliver cloud-native security and networking capabilities that seamlessly converge networking and security at scale. As enterprises accelerate adoption of cloud, remote work, and AI-driven workloads, the need for resilient, observable, and secure SASE platforms has never been greater. As an SRE, you will play a critical role in ensuring our platform is reliable, scalable, performant, and secure from day one. Your Impact As a Site Reliability Engineer, you will be an integral part of the product and platform lifecycle, partnering closely with software engineers, security experts, and infrastructure teams. You will: Collaborate with development teams to embed reliability, scalability, and operability into services from the earliest design stages Design, review, and evolve cloud-native architectures to improve availability, performance, cost efficiency, and fault tolerance Build and operate automation for provisioning, deploying, and managing infrastructure at global scale using Infrastructure as Code Improve CI/CD pipelines and release processes to enable safe, fast, and repeatable deployments Drive observability best practices, including metrics, logs, traces, SLIs/SLOs, and data-driven incident analysis Participate in on-call rotations, continuously reducing MTTR through automation, runbooks, and proactive reliability improvements Mentor and guide engineers on large-scale cloud and SASE deployments, fostering a strong SRE culture Participate in architecture and design reviews, bringing a reliability and operational excellence mindset Champion reliability, security, and operational maturity across the organization

Qualifications (Additional Job Description)

Your Experience Bachelor’s degree in Engineering, Computer Science, or a related technical field (or equivalent practical experience) 5+ years of experience working with Unix/Linux systems (shell, tools, networking, storage, kernel concepts) 2+ years of hands-on experience with microservices architectures running on Kubernetes and container platforms Strong understanding of distributed systems design, fault tolerance, scalability patterns, and high-availability architectures Experience operating workloads in public cloud environments (AWS, GCP, Azure, or hybrid) at medium to large scale Proficiency in building automation and tools in Python, Java, or similar languages for production environments Strong Infrastructure as Code experience (Terraform, Ansible, Chef, Puppet, or similar) Experience designing and operating monitoring, alerting, and observability systems at scale A tools-first mindset with a passion for reducing toil and increasing engineering efficiency Excellent communication skills and the ability to lead discussions across engineering and security teams Experience applying reliability and security frameworks to design, review, and operate production systems Nice to have: Networking expertise, including TCP/IP, DNS, BGP, routing, load balancing, proxies, VPNs, and cloud networking concepts—especially relevant to SASE architectures Experience operating or supporting SASE, SD-WAN, Zero Trust, or network security platforms Familiarity with AI/LLM technologies, including: Using LLMs to improve operational workflows (incident analysis, alert enrichment, runbooks, automation) Experience integrating AI/ML services into production systems Understanding of reliability, security, and governance considerations for AI-driven services

Our Commitment

We’re trailblazers that dream big, take risks, and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at [email protected] .

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

Is role eligible for Immigration Sponsorship? No. Please note that we will not sponsor applicants for work visas for this position.
Palo Alto Networks