Lead Edge AI Engineer

  • Job types information is not available.
  • Germany

Lead Edge AI Engineer

Velotio Technologies


About Velotio:

Velotio Technologies is a product engineering company working with innovative startups and enterprises. We have provided full-stack product development for 110+ startups across the globe, building products in the cloud-native, data engineering, B2B SaaS, IoT & Machine Learning space. Our team of 400+ elite software engineers solves hard technical problems while transforming customer ideas into successful products.

We are seeking a talented Edge AI Engineer with specialized expertise in GPU/TPU acceleration to join our team. The ideal candidate will have extensive hands-on experience in local Large Language Models (LLM) inference with embedded GPU/TPU architectures. As Principal Engineer specializing in Edge AI, you will play a crucial role in shaping the future Edge AI solution, leveraging the power of GPU/TPU acceleration and enterprise grade, large scale edge compute.

The successful candidate will combine technical excellence with effective leadership, creating a positive impact on both projects and team dynamics.



Requirements

  • High-Level Design and Architecture
  • Influence the Edge AI strategy by providing expert advice on design and architecture.
  • Make critical decisions regarding technical directions, scalability, and system performance.
  • Develop and optimize AI inference models for deployment on edge devices with embedded GPU/TPU accelerators, focusing on local Low Latency Model (LLM) inference.
  • Implement and fine-tune low-latency model inference pipelines to meet real-time performance requirements.
  • Collaborate with cross-functional teams to integrate AI inference solutions into edge computing platforms and applications.
  • Collaborate with the GPU Hardware Design Team to design and optimize GPUs that power next-generation devices.
  • Conduct performance profiling and optimization to maximize the efficiency of GPU/TPU acceleration for local LLM inference.
  • Work on micro-architecture development, ensuring efficient execution of graphics, compute, and AI workloads within energy and area constraints.
  • Stay current with advancements in GPU/TPU technologies and edge AI frameworks, incorporating them into solution designs as appropriate.
  • Provide technical expertise and support to project teams, ensuring successful implementation and deployment of edge AI solutions.

You will enjoy this role if you…

  • Lead and inspire a team of engineers, providing guidance, setting goals, and ensuring collaboration.
  • Oversee project planning, execution, and delivery, ensuring alignment with business objectives.
  • Manage all phases of technical projects, from conception to completion.
  • Develop project specifications, track progress, and control costs.
  • Foster a positive work environment, encouraging professional growth and knowledge sharing.

Desired Skills & Experience:

  • Bachelor’s degree in computer science, Engineering, or a related field; Master’s degree preferred.
  • 5+ years of hands-on experience in AI model development and deployment, with a focus on edge computing and local LLM inference.
  • Strong programming skills in languages such as Python and C++.
  • Proficiency in LLM frameworks (e.g., vLLM, Text generation inference, OpenLLM, Ray Serve, and HuggingFace Transformers) and deep learning libraries.
  • Extensive experience with GPU/TPU acceleration for AI inference, including optimization techniques (tensor, pipeline, data, sharded data parallelism) and performance tuning.
  • Hands on experience with one or more GPU frameworks: CUDA, Vulkan, OpenCL .
  • Deep knowledge of GPU memory layout, familiarity with NVIDIA Jatison, ARM Mali or relevant SoC configurations.
  • Knowledge of parallel computation, memory scheduling, and structural optimization.
  • Excellent problem-solving and analytical skills, with a passion for innovation and continuous learning.

Bonus points if you:


  • Experience with edge device hardware and software integration.
  • Familiarity with edge computing architectures and IoT platforms.
  • Experience with edge AI applications in domains such as robotics, autonomous vehicles, or industrial automation.


Benefits


Our Culture:

  • We have an autonomous and empowered work culture encouraging individuals to take ownership and grow quickly.
  • Flat hierarchy with fast decision making and a startup-oriented “get things done” culture.
  • A strong, fun & positive environment with regular celebrations of our success. We pride ourselves in creating an inclusive, diverse & authentic environment.

We want to hire smart, curious, and ambitious folks, so please reach out even if you do not have all of the requisite experience. We are looking for engineers with the potential to grow!

At Velotio, we embrace diversity. Inclusion is a priority for us, and we are eager to foster an environment where everyone feels valued. We welcome applications regardless of ethnicity or cultural background, age, gender, nationality, religion, disability or sexual orientation.



Source
remotive.com

Comments are closed.