<
Join Our Official Channels ✔️
-->
Zoho is inviting applications for the role of Site Reliability Engineer (SRE). This position offers an excellent opportunity to work with modern cloud infrastructure, DevOps practices, automation, and large-scale system reliability. Candidates passionate about cloud, scripting, and performance engineering will find this role highly rewarding.
Job Description
Zoho is looking for Site Reliability Engineers (SREs) to support cloud infrastructure, improve automation workflows, and ensure highly available services across multiple environments. The role focuses on performance, monitoring, cloud adoption, and operational excellence.
Roles and Responsibilities
1. Automation
- Develop and maintain automation tools and frameworks.
- Reduce manual operations (toil) and improve deployment efficiency.
- Enhance CI/CD pipelines to ensure consistent software delivery.
2. Infrastructure & Cloud Management
- Support configuration, deployment, and monitoring of cloud infrastructure.
- Monitor system performance and plan capacity for scalability.
- Ensure robust, secure, and optimized cloud operations.
3. Build Deployment & Maintenance
- Assist in build deployments across global data centers.
- Work with Zoho’s in-house CI/CD ecosystem.
- Maintain high service uptime and reliability.
4. Performance & Monitoring
- Build and maintain monitoring dashboards, alerts, and logging systems.
- Detect and resolve issues proactively to maintain reliability standards.
5. Security & Compliance
- Follow cloud and infrastructure security policies.
- Maintain access control systems and adhere to compliance requirements.
6. Incident Handling
- Respond to alerts and system failures.
- Participate in root-cause analysis and implement preventive measures.
Eligibility Criteria / Requirements
Experience
- 0–2 years of experience in Cloud Automation, Cloud Infrastructure, DevOps, or SRE roles.
Technical Skills
- Scripting: Bash, Shell, or Python scripting experience.
- Programming: Understanding of at least one language – C, C++, Java, or Python.
- Cloud & Networking: Knowledge of cloud networking, security concepts, IAM basics.
- Networking Tools: netstat, ping, traceroute, nc, ssh/scp, wireshark, tcpdump.
- CI/CD: Basic understanding of CI/CD tools or processes.
- SRE Concepts: Knowledge of SLIs, SLOs, SLAs, and reliability engineering.
- Shift: Ability to work in rotational shifts (24×7 support).
Key Skills
- Strong understanding of networking fundamentals.
- Good knowledge of cloud networking and infrastructure.
- Basic understanding of Kubernetes and containerized deployments.
- Familiarity with Terraform or any Infrastructure-as-Code tool.
- Strong analytical and problem-solving skills.
- Good communication skills.
- Eagerness to learn and adapt in a fast-paced environment.
- Exposure to AWS, OCI, or Azure is an added advantage.
Important Note
- Only shortlisted candidates will be notified for the interview process.
- 2026 graduates are not eligible for this recruitment.