Full Time
Anywhere
Posted 1 week ago

Supersourcing

About the job

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that the servicesboth our internally critical and our externally-visible systemshave reliability, uptime appropriate to customer’s needs and a fast rate of improvement. Additionally, SREs will keep an ever-watchful eye on our systems capacity and performance.

As a Site Reliability Engineer, you will have the opportunity to manage the complex challenges of scale which are unique to Digitization, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. You will provide scalable, reliable, durable, and secure applications for our customers and internal users. You will help build highly reliable applications using a customer-first approach while innovating technically. You will understand our customer’s needs and how we can meet them.

Responsibilities

Work with the Site Reliability Engineering team, Development team, and other partner teams to ensure that applications reliability, efficiency, and performance meets our customer’s needs, while keeping the service’s operation’s reliable, scalable, and automated.
Develop and implement projects that improve system reliability, efficiency, and performance
Partner with development teams on feature launches to ensure our customers are delivered reliable and scalable functionality.
Build a deep knowledge on production infrastructure and using that to debug distributed systems problems and identify improvements to the system.
Operations, SLO, SLA management
Metrics reporting and progress tracking
Be on-call, responding to and managing incidents.
Observability (Alarms, monitoring, synthetics).
Error management

Qualifications

Bachelor’s degree in Computer Science or a related engineering degree
8+ years of IT industry experience
Strong Experience in

Java, Springboot, Nodejs, microservices, RDBMS, NoSQL
AWS EC2, S3, Lambda, IAM, ECS, EKS, SQS, Kinesis
Observability using Splunk, NewRelic
Infrastructure as Code using terraform
APIs and event-driven approaches
Security patterns
Unix/Linux systems administration. Familiar with Docker is a must.

Strong Experience in analysing and troubleshooting large-scale distributed systems. Quick reaction on high severity customer impacts.
Ability to debug and optimize code and automate routine tasks
Knowledge in modern software engineering practices and tools – Agile and DevOps
Strong communication skill and the ability to explain complex technical matters in an easy-to-understand way.

Originally posted on Himalayas

To apply for this job please visit himalayas.app.

About this role & career path

Working in India

India, officially the Republic of India, is a country in South Asia. It is the seventh-largest country by area, the most populous country in the world and, since its independence in 1947, the world's most populous democracy. Bounded by the Indian Ocean on the south, the Arabian Sea on the southwest, and the Bay of Bengal on the southeast, it shares land borders with Pakistan to the west; China, Nepal and Bhutan to the north; Bangladesh and Myanmar to the east. In the Indian Ocean, India is near Sri Lanka and the Maldives. Its Andaman and Nicobar Islands share a maritime border with Myanmar, Th

More jobs at Supersourcing

Keep exploring on Get A Job.ai

Not quite the right fit? Your next opportunity is a click away.

Browse all jobs
More jobs by category
Remote jobs you can do from anywhere
Research typical pay for this role
Set a job alert so new matches reach you first
Upload your resume to apply faster

Hiring instead? Post a job and reach candidates searching right now.

Get A Job.ai

SRE (Site Reliability Engineer)

About the job

Responsibilities

Operations, SLO, SLA management

Metrics reporting and progress tracking

Error management

Qualifications

8+ years of IT industry experience

Strong Experience in

Observability using Splunk, NewRelic

Infrastructure as Code using terraform

APIs and event-driven approaches

Security patterns

About this role & career path

Working in India

More jobs at Supersourcing

Keep exploring on Get A Job.ai

Lead Product Manager, Safety

Oracle Record to Report Lead

Data Analyst (F/H)

Board Advisor (Volunteer)

Accounts Receivable Specialist – Freelance, Remote

People Operations Specialist – Contracts & HR Administration

Senior Information Security Engineer

Senior Windows Administator / Platform Engineer IV

Community Manager (Senior Level Considered)

Growth Marketing Lead | $125K-$150K USD + Bonus + Equity + Remote | Award Winnin