Senior SRE Engineer
Description
As a Senior Site Reliability Engineer (SRE), you will play a crucial role in designing, implementing, and maintaining highly scalable and reliable systems and services. Your primary focus will be on ensuring the availability, performance, and efficiency of the company's infrastructure and applications. You will collaborate with
- functional teams, including product, devops & qa, cloud infrastructure teams to drive improvements and solve complex technical challenges.
Responsibilities:
System Design and Architecture: Contribute to the design and architecture of scalable and highly available systems and services, considering factors such as reliability, performance, security, and
- effectiveness.
Infrastructure Automation: Develop and maintain infrastructure automation tools and frameworks, leveraging technologies such as
-
- code (Ia
C) and configuration management tools. Automate deployment, monitoring, and management processes to increase efficiency and reduce manual effort.
Monitoring and Alerting: Implement effective monitoring and alerting systems to proactively identify and resolve issues. Develop and maintain monitoring tools and dashboards to provide
- time visibility into system performance and availability.
Incident Response and Troubleshooting: Respond to critical incidents, perform root cause analysis, and implement preventive measures to minimize the impact of future incidents. Work closely with development teams to address performance bottlenecks and reliability issues.
Capacity Planning and Performance Optimization: Analyze system performance and capacity metrics to identify areas for improvement. Collaborate with teams to optimize resource utilization, enhance system performance, and plan for future growth.
Continuous Improvement and Best Practices: Stay
-
- date with industry best practices, emerging technologies, and trends in Site Reliability Engineering. Drive continuous improvement initiatives, implement best practices, and mentor junior team members.
Collaboration and Communication: Collaborate with
- functional teams, including developers, operations, and product managers, to understand requirements, provide technical guidance, and ensure the reliability and scalability of systems. Communicate effectively with stakeholders, both technical and
- technical, to provide updates and address concerns.
Coaching and Mentoring: Provide guidance and support to junior colleagues, helping them develop their skills and grow in their careers. Share knowledge, review code, and assist with technical challenges to foster a collaborative and
- oriented environment.
Job Qualifications
Requirements:
Experience: Several years of experience in a similar role as a Site Reliability Engineer or Dev
Ops Engineer, with a focus on building and maintaining highly available and scalable systems.
Strong Programming and Scripting Skills: Proficiency in programming languages such as Python, and scripting languages like Bash or Power
Shell. Experience with infrastructure automation tools like Terraform is desirable.
Cloud Computing: Strong knowledge and
- on experience with cloud platforms such as GCP, AWS. Familiarity with containerization technologies like Docker and orchestration tools like Kubernetes.
System Administration: Solid understanding of Linux/Unix systems, networking, and system administration. Experience with managing distributed systems and troubleshooting performance issues.
Monitoring and Logging: Experience with monitoring and logging tools such as Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), or Splunk.
Problem-solving and Analytical Skills: Ability to analyze complex systems, identify problems, and propose effective solutions. Strong troubleshooting and debugging skills.
Collaboration and Communication: Excellent collaboration and communication skills to work effectively with
- functional teams and stakeholders. Ability to explain technical concepts to both technical and
- technical audiences.
Becoming part of P&G, you will benefit of:
Competitive salary package, annual bonus and vacation bonus
Private medical insurance & life insurance & 24 Hours Accident Insurance
Stock Ownership Plan - You have the opportunity to buy P&G shares quoted on the New York Stock Exchange to participate in the profits of the Company and the company will match part of the investments
Flexible working schedule & Hybrid Work from Home/Work from Office option
Meal allowance and access to our private canteen
Sports Program – will allow you to reimburse sport activities
Dental & Glasses Plan - will allow you to reimburse dental and ophthalmological care
Access to a variety of learning and development platforms, including Bookster, Linked
In Learning, Harvard Business Review etc.
Fresh fruits every day in the office
Employee Assistance Program – confidential expert guidance and specialist support on any work, wellbeing, emotional, financial, physical or family issue
Maternity/ paternity support - additional salary protection during maternity leave & additional paternity leave
Watch this video to learn more about our full recruiting process:
Kindly be advised that at P&G, employment is exclusively extended on the basis of an Full-time Employment Contract. Apply only if you agree to these conditions.
Job Schedule
Full timeJob Number
R000111840Job Segmentation
Experienced Professionals (Job Segmentation)Fii primul, care se va înregistra la oferta de muncă respectivă!
-
De ce să cauți de muncă pe Lucrezi.ro?
În fiecare zi oferte noi de muncă Puteți alege dintr-o gamă largă de locuri de muncă: Scopul nostru este de a oferi o gamă cât mai largă de opțiuni Lasă să-ți fie trimise noile oferte prin e-mail Fii primul care răspunde la noile oferte de muncă Toate ofertele de muncă într-un singur loc (de la angajatori, agenții și alte portaluri) Toate serviciile pentru persoanele aflate în căutarea unui loc de muncă sunt gratuite Vă vom ajuta să găsiți un nou loc de muncă