DevOps / Site Reliability Engineer
Job Location: remote in Romania
Recruitment process:
- HR discussion
- Technical discussion
Role description:
We have a new opportunity for a seasoned Site Reliability Engineer who will work alongside the development, architecture and service management teams.
This role is instrumental in bridging the gap between development and operations, applying engineering principles to operational challenges to drive continuous improvement and innovation.
Responsibilities:
- Infrastructure Management: Design, build, and maintain scalable and resilient AWS cloud infrastructure.
- Reliability and Performance: Implement monitoring, alerting, and remediation strategies to maintain system health and performance.
- Automation: Create and manage CI/CD pipelines, automate routine tasks using Infrastructure as Code (Ia
C) tools such as Terraform and Cloud
Formation. - Incident Management: Proactively monitor and respond to system reliability issues, ensuring high availability and reducing downtime.
- Collaboration: Work with development, security, and operations teams to ensure seamless integration and operational reliability of applications.
- Security: Ensure best practices in security and compliance, addressing vulnerabilities in AWS environments.
- Cost Optimization: Analyze and optimize AWS resource usage to balance performance, scalability, and cost.
- Documentation: Write and maintain technical documentation, including architecture diagrams, runbooks, and incident reports.
Profile:
- Knowledge and
- on experience with cloud platforms and Infrastructure as a Service (Iaa
S) offerings, preferably Amazon Web Services (AWS) or Microsoft Azure. - AWS certification (e. g. AWS Solutions Architect Associate or Professional) or other industry certification is beneficial.
- Has significant experience in Dev
Ops, SRE implementation and in evolving practices and ways of working through
- disciplinary teams, business frameworks and culture. - AWS Expertise: Strong
- on experience with AWS services, including EC2, S3, RDS, Lambda, VPC, IAM, and Cloud
Watch - Automation Tools: Proficiency in Infrastructure as Code (Ia
C) tools like Terraform, Cloud
Formation, and scripting languages such as Python, Bash, or similar. - CI/CD Pipelines: Experience with CI/CD tools such as Jenkins, Git
Lab CI, or AWS Code
Pipeline. - Strong and proven Java skills
- Linux and networking fundamentals
- Experience of containerisation, ideally using Docker, Kubernetes
- Expertise in observability principles and practices, encompassing monitoring, logging, tracing, and alerting systems to ensure transparency and actionable insights into system performance and health. Tools Dynatrace, Datadog
- Familiarity with addressing performance and optimization issues, with a demonstrated capability in diagnosing and resolving such problems efficiently.
- Experience across the entire stack: hardware, application and network
Fii primul, care se va înregistra la oferta de muncă respectivă!
-
De ce să cauți de muncă pe Lucrezi.ro?
În fiecare zi oferte noi de muncă Puteți alege dintr-o gamă largă de locuri de muncă: Scopul nostru este de a oferi o gamă cât mai largă de opțiuni Lasă să-ți fie trimise noile oferte prin e-mail Fii primul care răspunde la noile oferte de muncă Toate ofertele de muncă într-un singur loc (de la angajatori, agenții și alte portaluri) Toate serviciile pentru persoanele aflate în căutarea unui loc de muncă sunt gratuite Vă vom ajuta să găsiți un nou loc de muncă