View Jobs at Decagon |
Full Time |
Remote |
Posted 2 years ago |
JOB TITLE: Site Reliability Engineer
JOB LOCATION: Remote
Employment Type: Contract
JOB DETAILS:
- We are looking for talented & experienced engineers to help us build and support our cloud-native infrastructure.
- As a Site Reliability Engineer, you’ll be joining a team of mix background technologists.
- Our mandate is to provide secure, flexible and stable platform solutions that empower our feature development teams to create the highest quality services for our customers.
Duties & Responsibilities
- Develop, deploy, and operate cloud-native infrastructure in support of SaaS platform
- Develop and improve instrumentation for understanding and troubleshooting the health and availability of services
- Bring a mindset of standards and best practices to help create observability solutions that the team would want to adopt
- Participate in an on-call rotation
- Drive a culture of automation, both within the team and throughout the organization, in order to scale efficiently and reliably
- Participate in technical discussions to aid system design, analysis, and troubleshooting
- Help engineering teams to develop, test, debug and release scalable, resilient and highly available cloud-native applications
Skills / Requirements
- 4+ years of experience with implementation, operations, and maintenance of cloud services
- A drive to inspire adoption through enthusiasm
- An understanding of the importance of a strong feedback loop with other teams and individuals across the organization
- A deep understanding of cloud computing concepts and solutions, specifically with Google Cloud Platform
- A solid understanding of Identity and Access Management, as well as setting and auditing access policies
- Experience with cloud-native approaches to security concerns
- Hands-on experience with container and container orchestration technologies: Kubernetes, Docker, Podman, etc.
- Experience working with Infrastructure-as-Code tools
- Intimate understanding of one or more of these monitoring and observability tools: DataDog, Prometheus, Grafana, Jaeger, Honeycomb
- Very strong problem solving & troubleshooting skills, including the ability to perform root cause analysis and preventative analysis
Nice to have:
- You have experience in building systems in a microservice environment, understanding the basic building blocks of resilient and scalable software
- Experience with web applications developed in Python or Ruby
- Knowledge of some or all of: web/network protocols, security, data persistence, and CI/CD pipelines
- An understanding of modern software development practices: TDD/BDD, hexagonal design, etc.
- An understanding of Linux primitives: process scheduling, signals, namespaces, authentication/authorization, etc.
Perks of the Role
- Become a crucial part of a fast-growth, dynamic company, and its expansion.
- The Monthly pay rate is in USD.
- Great/flexible work culture.
Remuneration
USD2,000 – 2,200 Monthly.
Apply Now
Deadline: August 31, 2022
Job Features
Job Category | Engineering / Technical |