Back to Job Board
S

Site Reliability Engineer

SID Global SolutionsTS, IndiaApril 17, 2026

Job Description

Job Information

Job Opening ID

ZR_1164_JOBDate Opened

04/17/2026Industry

IT ServicesJob Type

Full timeCity

HyderabadState/Province

TelanganaCountry

IndiaZip/Postal Code

500082### Job Description

About SIDGS

SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management, Microservices, DevOps, Cloud, Service Mesh, Artificial Intelligence, RPA domains.

We create innovative solutions in Digital, API Management, Cloud and DevOps space in partnership with Google. We understand that every business has a unique set of challenges and opportunities, and we leverage our unique industry insights, honed through decades of combined experience in the technology sector, to deliver the products, solutions, and services necessary to achieve best customer satisfaction and delivering positive impact to the communities.

About SIDGS:

SID Global Solutions is a premier AI-first digital transformation company, delivering intelligent full-stack solutions and services worldwide. We specialize in embedding AI across the enterprise to accelerate transformation, modernize technology, and unlock new revenue streams. Our core expertise spans AI-powered modernization, cloud and infrastructure solutions, application innovation, and advanced data analytics. As a leading Google Apigee implementation partner, we also bring you SAMi-Smart API Monetization Platform our flagship platform for API management. With a global footprint and 1000+ professionals, SID Global Solutions is committed to building a smarter, AI-driven future through innovation and impact.

Job Role: Site Reliability Engineer (SRE) – GCP

The Site Reliability Engineer (SRE) Level 1/ Level 2/ Level 3 will be responsible for maintaining and improving the reliability, availability, and performance of the systems.

We are looking for someone who passionate about learning and developing their skills in system reliability, automation, and incident response. You will work closely with senior SREs, DevOps teams, and other stakeholders to ensure the services meet the highest standards of reliability and performance.

Key Responsibilities:

  • Site Reliability Engineers (SREs) are responsible for building and operating critical internal and external services.
  • Our SREs use their skills as software engineers and their knowledge of infrastructure to further develop our reliability practice in order to keep these services running smoothly.
  • They do this with the ultimate goal of delivering a great customer experience through services that are observable, scalable and highly available.
  • In this role you will be learning complex systems, specifically how they behave and fail in production.
  • If you are inclined to drive those failure modes out of our systems then you might be a good fit for the team. This important work not only drives growth for but is also critical for the growth of customers.

You might be a good fit for our team if the following describes you:

  • You are experienced with infrastructure as code practices
  • You consistently use your programming skills to automate tasks
  • You are comfortable working in a GCP Cloud environment
  • You think of software and infrastructure coming together to form a larger system
  • You dig deep into incidents/problems and come up with unique solutions
  • You are enthusiastic about learning new technologies and spreading your knowledge
  • You battle ruthlessly to fix what's broken and protect the customer experience
  • You are compelled to leave a situation better than you found it

Skills Required:

  • Increasing the observability of our various applications, services, and infrastructure using:
  • Open Telemetry
  • Grafana eco-system (Grafana, Loki, Mimir, Tempo)
  • Fluentd
  • Automating our applications and infrastructure using:
  • Terraform
  • Kubernetes
  • Puppet
  • Creating CI/CD pipelines for these services using:
  • Gitlab
  • ArgoCD
  • Kustomize
  • Working with our Product teams and helping them capture the user experience in SLOs
  • Reducing the impact of service disruptions through our incident, problem, change management programs
  • Continuously learn and apply new skills and technologies relevant to the SRE role.
  • Participate in training sessions and workshops to enhance knowledge.

Preparing for this role?

Practice with an AI interviewer tailored to Site Reliability Engineer at SID Global Solutions.