We are seeking an experienced Senior Site Reliability Engineer to play a critical role in our organization as the integrating hub for all Microsoft 365 workloads. The ideal candidate will have a strong background in Microsoft Entra, Python development, and proven experience implementing solutions using the Microsoft Graph API.
Responsibilities:
- Serve as the central point of coordination for multiple Microsoft 365 workloads
- Collaborate with architects and administrators responsible for various aspects of Microsoft 365, including infrastructure, identity, security, compliance, endpoints, and applications
- Design, implement, and maintain highly available and scalable systems using Microsoft Entra
- Develop automation scripts and tools using Python to streamline processes and improve system reliability
- Implement and integrate solutions leveraging the Microsoft Graph API to enhance functionality and user experience
- Perform user management activities, including provisioning, deprovisioning, and access control
- Monitor and implement security measures to safeguard systems and data
- Negotiate with vendors to procure necessary tools, services, and support
- Troubleshoot and resolve complex technical issues in a timely manner
- Mentor and guide junior team members, promoting best practices and fostering a culture of continuous improvement
- Participate in agile development processes, including sprint planning, standups, and retrospectives
- Manage tasks and projects using Jira, ensuring effective collaboration and timely delivery.
Job Requirements:
- Minimum of 5 years of experience as an SRE or in a similar role
- Bachelor's degree in Computer Science, Engineering, or a related technical field
- Deep expertise in Microsoft Entra, including deployment, configuration, and troubleshooting
- Strong proficiency in Python programming, with the ability to write clean, efficient, and maintainable code
- Proven track record of developing and implementing solutions using the Microsoft Graph API
- Working knowledge of Jira and agile methodologies
- Solid understanding of Git version control, with experience using Gerrit
- Familiarity with user management practices and security monitoring techniques
- Excellent problem-solving skills and ability to debug complex distributed systems
- Strong negotiation and vendor management skills
- Exceptional communication and collaboration abilities, with the ability to work effectively with cross-functional teams
- Demonstrated experience mentoring and developing junior team members
- Knowledge of additional technologies such as Kubernetes, Docker, Azure, and/or PowerShell is a plus