+1 202 555 0180

Have a question, comment, or concern? Our dedicated team of experts is ready to hear and assist you. Reach us through our social media, phone, or live chat.

CloudDevs: Senior Site Reliability Engineer (SRE)

CloudDevs: Senior Site Reliability Engineer (SRE)


Headquarters: San Francisco

URL: https://clouddevs.com/

LOCATION : LATAM, ERUOPE

 

CloudDevs works with fast-moving, venture-backed startups throughout the US. We’re constructing a pool of world-class Web site Reliability Engineers for present roles and for upcoming alternatives. You’ll both be positioned straight into considered one of our associate startups or added to our vetted SRE community for future tasks.

This function is good for engineers who care about reliability, metrics, efficiency, and constructing easy, scalable techniques. Should you get pleasure from designing for scale and enhancing how groups ship software program, you’ll match proper in.

Key Obligations
Work as a hands-on engineer centered on system reliability, efficiency, and observability.
Outline and observe SLIs, SLOs, and error budgets.
Optimize monitoring price and sign high quality throughout metrics, logs, and traces.
Enhance deployment security, canary rollouts, and UAT pipelines.
Construct instruments for automated and native efficiency testing and observe benchmarks.
Lead resilience work like failover drills, chaos exams, and redundancy checks.
Accomplice with engineering groups to enhance scaling patterns and structure because the product grows.
Help incident response processes and assist scale back operational noise.
Write clear, maintainable code in Go, Python, or Node.js.
Contribute to CI/CD enhancements and automation efforts.
Collaborate with engineers throughout groups to boost reliability requirements.

Necessities
5+ years in SRE, DevOps, or Platform Engineering roles.
Robust expertise with cloud infrastructure (AWS most well-liked), Terraform, and Kubernetes.
Deep information of observability instruments like DataDog, Prometheus, or OpenTelemetry.
Robust debugging abilities throughout companies, networking, and knowledge layers.
Fingers-on expertise designing and monitoring SLIs/SLOs.
Expertise with CI/CD instruments akin to GitHub Actions, Jenkins, or ArgoCD.
Means to put in writing production-grade code in Go, Python, or Node.js.
Consolation working independently in fast-paced environments.

Good to Have
Expertise tuning observability prices and optimizing knowledge ingestion.
Publicity to chaos engineering and progressive deployments.
Background with high-throughput or latency-sensitive techniques.
AWS at scale (EKS, Lambda, DynamoDB, S3).
Expertise in regulated industries like fintech, funds, or SOC2 environments.
Efficiency testing pipelines or load-testing automation.
Expertise dealing with techniques processing tens of hundreds of thousands of API calls.

Open Pool for SREs
Even for those who don’t meet each requirement or aren’t a match for the present function, sturdy SREs with actual manufacturing expertise are welcome to affix our expertise pool. We usually place engineers with totally different strengths throughout reliability, DevOps, platform, observability, backend, and infrastructure engineering.

 

To use: https://weworkremotely.com/remote-jobs/clouddevs-senior-site-reliability-engineer-sre


Source link

Share this article
Shareable URL
Prev Post

Chipcolate: Senior Engineer – Fullstack

Next Post

Float: Account Executive (EMEA)

Leave a Reply

Your email address will not be published. Required fields are marked *

Read next