company-logo-image

Site Reliability Engineer (8K up - 2 years & above experience!)

Hiredly X

RECRUITMENT firm

ashley-avatar-image

AI-generated summary

beta

This job is a Site Reliability Engineer, focusing on keeping tech systems running smoothly. You might like this job because you’ll work on building scalable systems, improving tools, and collaborating to enhance performance and reliability.

RM 8K - RM 12K

Selangor

Job Description

The Site Reliability Engineer (SRE) ensures the reliability and performance of critical services, bridging development and operations. The role focuses on scalable infrastructure, SRE practices such as SLOs and SLIs, and reducing operational toil. Collaboration with teams to improve reliability and foster a continuous learning culture is key.

  • Design and implement resilient system architectures for high availability and scalability.
  • Develop automation tools and scripts to improve operational efficiency.
  • Define, track, and analyze SLOs and SLIs for performance and reliability.
  • Conduct post-mortem analyses and implement improvements based on findings.
  • Collaborate on best practices for system reliability and incident management.
  • Troubleshoot and resolve database, network, and deployment issues.
  • Ensure issue resolution meets Service Level Agreements (SLAs).
  • Identify and address system performance bottlenecks with actionable recommendations.
  • Maintain documentation for processes and incident responses.

Job Requirements

  • Proficiency in programming languages like Python, Golang, or Java.
  • Experience in system architecture with a focus on reliability and scalability.
  • Strong understanding of SRE principles (SLOs, SLIs, toil reduction).
  • Experience with cloud environments (AWS, Azure, Google Cloud).
  • Expertise in Linux system administration.
  • Problem-solving skills with a proactive approach to operational challenges.
  • Ability to work independently and collaborate in a team environment.
  • Able to speak, read, and write in Mandarin to support communication and collaboration across teams.

Preferred skills:

  • Familiarity with monitoring tools and performance optimisation.
  • Experience with system administration automation and scripting.
  • Knowledge of networking concepts and troubleshooting.
  • Hands-on experience with cloud platforms and services.
  • Familiarity with DevOps practices (CI/CD, infrastructure as code, containerisation).

Skills

Site Reliability Engineering
Python (Programming Language)
Cloud Technologies
Docker Container
Kubernetes
Linux

Additional Info

Company Activity

Last active - few minutes ago

Career Level

Junior Executive


Company Profile

Hiredly X-logo-image

Hiredly X

Hiredly X, the headhunting team of Hiredly, makes headhunting accessible and affordable for every employer, no matter the size or industry. We help employers screen and source the best candidates through exclusive access to our job portal database.Assisted with AI, we make the headhunting process fast and accurate, allowing us to be competitive with our fees.