Site Reliability Engineer | Cloud Team
We are a global company with offices in the US, Europe and Asia. In these centers, we carry out the various stages of product development, from initial concept to mass production of
-
- sell units. We embrace a vertically integrated business model with strategic design, manufacturing, distribution, sales and support centers around the world to maximize our value to customers.
Garmin Private Cloud (GPC) will be our internal cloud, developed entirely using
- source technologies such as Open
Stack and Kubernetes. GPC will enable Garmin to fully manage the technology, staffing, and costs associated with our evolving product platform.
The GPC team will be responsible for building and maintaining the platform that supports
- known Garmin services like Garmin Connect, Connect
IQ Appstore, Garmin Golf, and many other services.
We believe that collaboration leads to the best ideas, and we rely heavily on team interaction. As a hybrid role based in Cluj-Napoca, this position will require at least 3 days in the office each week.
Responsabilities
- Ensures the integrity of Garmin's production environment is maintained and that all releases into the environment are
- organized, communicated, and managed. - Author and lead process improvements to the whole project lifecycle and release process.
- Establish and provide training to development teams on operational processes and automations that promote software integrity and stability.
- Lead design/definition activities for
- and
- complexity systems, features, and/or processes. - Champion the
- left culture of reliability and delivery performance within software development teams. - Monitor and support
- and
- complexity software releases. - Design and implement improvements to the software lifecycle and production pipeline through automated tools/systems that align with industry best practices.
- Coordinate and improve monitoring practices across software applications and infrastructure.
- Build and/or maintain tools to generate reports.
- Maintain accurate data to facilitate reporting on key reliability SLOs for multiple products/systems.
- Improve the team’s incident response by nurturing incident playbooks.
- Through
- incident activities, proactively identify and/or implement reliability improvements and automated mitigations of recurrence. - Cultivate engagement in the SRE community to nurture standards, best practices, and training across product owners, software engineers, and other SREs.
- Participate in capacity planning to ensure software can scale sufficiently at peak times.
- Work collaboratively and professionally in a team environment with other Garmin associates to achieve goals.
Requirements
- Experience with public cloud infrastructures, tools, and processes (Azure, AWS, GCP).
- Experience with designing, developing, and deploying containerized applications (Kubernetes).
- Experience with moderately complex build and deployment automation.
- Experience with scaling cloud native applications in large,
- availability environments. - Experience with Dev
Ops-style tools such as Jenkins, Maven, Git
Lab, Nexus, Run
Deck. - Experience with scripting languages such as Python, Groovy.
- Experience with Infrastructure as Code such as Ansible, Terraform, Salt, Chef, Puppet.
- Good understanding of Linux system administration.
- Configuration of complex
- tiered server applications. - Effective judgment, discretion, and
- making abilities. - Demonstrate strong and effective verbal, written, and interpersonal communication skills.
- Team-oriented, possessing a positive attitude and working well with others.
- Minimum 4 years of relevant work experience.
Would be a plus:
- Proficiency in application languages/frameworks such as Java, Spring
Boot, C#, Java
Script, React, Angular. - You have some knowledge with: Rabbit
MQ, Kafka. - You are familiar with data storage technologies such as RDBMS, No-SQL.
- Experience with Open
Stack cloud computing infrastructure and related technologies. - Experience with APM monitoring tools such as Zabbix, App
Dynamics, New Relic, Dynatrace. - Experience with CDN Providers such as Akamai/Cloudflare.
- Experience with observability tools such as Uptrends, Splunk, Kibana.
Benefits
Benefits to enhance your experience:
- 24 days off each year plus extra vacation days based on years at Garmin and compensation for legal holidays.
- Health package subscription and yearly budget for glasses.
- Monthly budget for sports and wellbeing activities.
- Local and global career development programs (training, mentorship, technical and leadership development, and more).
- Access to
- learning platforms and support for technical conferences attendance. - Loyalty bonus within the company, plus other special bonuses (for holidays and personal life events).
- Meal tickets.
Yours exclusively when part of our team:
- Significant discount for Garmin products.
- Employee stock purchase plan.
- Contribution to the retirement plan (Pillar 3).
- Garmin products available for testing and borrowing.
- A comprehensive event series championing wellbeing, sports, and community tailored to foster holistic health (featuring sports events, classes, hackathons, parties, and more).
- Other benefits which we invite you to discover along the recruitment process.
Garmin Cluj is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, religion, national origin, sex, age, or disability.
Fii primul, care se va înregistra la oferta de muncă respectivă!
-
De ce să cauți de muncă pe Lucrezi.ro?
În fiecare zi oferte noi de muncă Puteți alege dintr-o gamă largă de locuri de muncă: Scopul nostru este de a oferi o gamă cât mai largă de opțiuni Lasă să-ți fie trimise noile oferte prin e-mail Fii primul care răspunde la noile oferte de muncă Toate ofertele de muncă într-un singur loc (de la angajatori, agenții și alte portaluri) Toate serviciile pentru persoanele aflate în căutarea unui loc de muncă sunt gratuite Vă vom ajuta să găsiți un nou loc de muncă