Senior Embedded Engineer/Architect with AI for Edge - Romania Location
Răspunde la anunț- on experience in local Large Language Models (LLM) inference with embedded GPU/TPU architectures. As Principal Engineer specializing in Edge AI, you will play a crucial role in shaping the future Edge AI solution, leveraging the power of GPU/TPU acceleration and enterprise grade, large scale edge compute.
- Influence the Edge AI strategy by providing expert advice on design and architecture.
- Make critical decisions regarding technical directions, scalability, and system performance.
- Develop and optimize AI inference models for deployment on edge devices with embedded GPU/TPU accelerators, focusing on local Low Latency Model (LLM) inference.
- Implement and
- tune
- latency model inference pipelines to meet
- time performance requirements. - Collaborate with
- functional teams to integrate AI inference solutions into edge computing platforms and applications. - Collaborate with the GPU Hardware Design Team to design and optimize GPUs that power
- generation devices. - Conduct performance profiling and optimization to maximize the efficiency of GPU/TPU acceleration for local LLM inference.
- Work on
- architecture development, ensuring efficient execution of graphics, compute, and AI workloads within energy and area constraints. - Stay current with advancements in GPU/TPU technologies and edge AI frameworks, incorporating them into solution designs as appropriate.
- Provide technical expertise and support to project teams, ensuring successful implementation and deployment of edge AI solutions.
- Lead and inspire a team of engineers, providing guidance, setting goals, and ensuring collaboration.
- Oversee project planning, execution, and delivery, ensuring alignment with business objectives.
- Manage all phases of technical projects, from conception to completion.
- Develop project specifications, track progress, and control costs.
- Foster a positive work environment, encouraging professional growth and knowledge sharing.
- Bachelor’s degree in computer science, Engineering, or a related field; Master’s degree preferred.
- 5+ years of
- on experience in AI model development and deployment, with a focus on edge computing and local LLM inference. - Strong programming skills in languages such as Python and C++
- Proficiency in LLM frameworks (e. g. , v
LLM, Text generation inference, Open
LLM, Ray Serve, and Hugging
Face Transformers) and deep learning libraries. - Extensive experience with GPU/TPU acceleration for AI inference, including optimization techniques (tensor, pipeline, data, sharded data parallelism) and performance tuning,
- Hands on experience with one or more GPU frameworks: CUDA, Vulkan, Open
CL - Deep knowledge of GPU memory layout, familiarity with NVIDIA Jetson, ARM Mali or relevant So
C configurations. - Knowledge of parallel computation, memory scheduling, and structural optimization
- Excellent
- solving and analytical skills, with a passion for innovation and continuous learning.
- Experience with edge device hardware and software integration.
- Familiarity with edge computing architectures and Io
T platforms. - Experience with edge AI applications in domains such as robotics, autonomous vehicles, or industrial automation.
About R Systems:
WE SPEAK DIGITAL. Overview
Formerly known as Computaris, we are the European branch of R Systems - global technology and analytics services company. We help our clients achieve
-
- market, overcome digital barriers, and create business value with our specialized service offerings and consultative business approach. We speak the language of business as fluently as we do the language of technology. In other words: We speak Digital. Our goal: accelerate our clients’ digital leadership. Our clients choose to partner with us for Cloud transformation, automation, data science, analytics, and product engineering, thanks to our technical expertise, domain knowledge, global presence and a record of delivering
- class solutions consistently for over 28 years. With a global workforce of 4, 800+ employees spread across 16
- centers and 26 offices, we continue to empower organizations with
- edge technologies. In Europe, R Systems has offices in the UK, Romania, Poland, Moldova and Switzerland, providing digital transformation services and extensive telecom expertise.
Fii primul, care se va înregistra la oferta de muncă respectivă!
-
De ce să cauți de muncă pe Lucrezi.ro?
În fiecare zi oferte noi de muncă Puteți alege dintr-o gamă largă de locuri de muncă: Scopul nostru este de a oferi o gamă cât mai largă de opțiuni Lasă să-ți fie trimise noile oferte prin e-mail Fii primul care răspunde la noile oferte de muncă Toate ofertele de muncă într-un singur loc (de la angajatori, agenții și alte portaluri) Toate serviciile pentru persoanele aflate în căutarea unui loc de muncă sunt gratuite Vă vom ajuta să găsiți un nou loc de muncă