Head of Performance Architecture
Join us ! đ
We usually respond within a day
đ Join FlexAI:Â
FlexAI is at the forefront of revolutionizing AI computing by reengineering infrastructure at the system level. Our groundbreaking architecture, combined with sophisticated software intelligence, abstraction, and an orchestration layer, allows developers to leverage a diverse array of compute, resulting in efficient, more reliable computing at a fraction of the cost.Â
The rapid evolution of machine intelligence has created a need for a new system architecture capable of handling high memory capacity and bandwidth. These are critical bottlenecks in pushing machine intelligence to the next level, where compute demand is expected to increase up to 1000 times current levels.
FlexAI has pioneered a groundbreaking solution to tackle these memory challenges. Our innovative compute architecture ensures a well-balanced distribution of memory bandwidth, capacity, and compute density, ensuring maximum utilization of system resources. This architecture is the cornerstone of our datacenter-in-a-box concept, which is enabled by our universal AI compute cloud service. Our hardware solutions are built for seamless deployment with our own AI cloud offerings and other cloud service providers worldwide, setting new standards in performance and efficiency.
 We are looking for a Head of Performance Architecture who is not afraid of pushing boundaries and reimagining whatâs possible. In this role, you will lead the performance architecture team, optimize system performance, and architect solutions that propel our platforms to deliver exceptional speed, efficiency, and reliability. If you're ready to take on a challenge that will leave an indelible mark on the industry, we want to hear from you.
Our innovative compute architecture, coupled with sophisticated software intelligence and orchestration, allows developers to leverage a diverse array of compute, resulting in efficient, more reliable computing at a fraction of the cost. This architecture ensures a well-balanced distribution of memory bandwidth, capacity, and compute densityâforming the backbone of our datacenter-in-a-box concept. Enabled by our universal AI compute cloud service, our hardware solutions set new benchmarks in performance and efficiency, seamlessly integrating with our AI cloud offerings and other cloud service providers worldwide.
Position Overview:
As the Head of Performance Architecture, you will oversee the analysis, design, and optimization of AI systems and infrastructure performance, essentially architecting AI efficiency. This role requires a technical understanding of system architectures and hardware acceleration and the ability to collaborate with experts across multiple disciplines to identify performance bottlenecks, improve system throughput, and ensure that AI models and workloads operate at peak performance, even at scale. The ideal candidate will have a deep understanding of AI architectures, hardware acceleration, performance optimization techniques, and strong leadership skills.
Success at FlexAI requires an entrepreneurial spirit and startup mindset: the ability to rapidly iterate and make meaningful progress while staying focused on our mission to deliver more compute with less complexity. Your proven expertise in cultivating influence, aligning diverse stakeholders, and driving efficient operationsâ while fostering a supportive environment through mentorship and thoughtful leadership of a growing teamâwill be critical to being a highly effective leader.
What youâll do:
Lead and mentor the performance architecture team, fostering a culture of excellence, innovation, and collaboration.
Define and execute the overall performance strategy, ensuring alignment with business goals and technical requirements.
Oversee the analysis and optimization of AI systems' performance, including hardware and software components, to support AI workloads.
Manage system architecture, design, development, and optimization to ensure high performance, scalability, and reliability.
Identify and address performance bottlenecks across AI infrastructures, including CPUs, GPUs/TPUs, memory, storage, and networking.
Collaborate with AI researchers, data scientists, and infrastructure teams, including software engineering, hardware engineering, and infrastructure teams, to optimize system performance across the stack.
Lead performance reviews and architecture evaluations to guide the design of new systems to ensure they meet performance requirements.
Provide guidance on best practices for AI system performance, including workload distribution, resource allocation, and hardware utilization.
Stay current with the latest advancements in AI hardware and software, integrating cutting-edge technologies to drive performance improvements.
What youâll need to be successful:
Bachelorâs or Masterâs degree in Computer Science, Electrical Engineering, or a related field. Advanced degrees are a plus.
10+ years of experience in system performance engineering, with a focus on AI or high-performance computing (HPC), with at least five years in a leadership role.
Proven experience optimizing AI systemsâ performance, including hardware and software components.
Deep knowledge of AI architectures, including GPUs, TPUs, and specialized AI accelerators.
Strong understanding of AI frameworks (e.g., TensorFlow, PyTorch) and how they interact with hardware.
Experience with performance analysis tools, including profiling, benchmarking, and monitoring tools.
Expertise in hardware acceleration technologies and techniques for optimizing AI workloads.
Ability to work with cross-functional teams, including AI researchers, data scientists, and software engineers, to drive performance improvements.
Strong problem-solving skills and the ability to make data-driven decisions.
Model inclusive behaviors and contribute to a culture that respects different backgrounds and perspectives.
Preferred Skills
Experience with distributed AI systems and scaling AI workloads across large-scale infrastructure.
Knowledge of cloud-based AI platforms and performance optimization in cloud environments.
Familiarity with containerized environments (e.g., Kubernetes) and AI performance in these contexts.
Experience with low-level optimization, including assembly-level tuning and compiler optimization for AI workloads.
Strong background in networking performance, including low-latency, high-throughput communication architectures.
What we offer:
- A competitive salary and benefits package, tailored to recognize your dedication and contributions.
- The opportunity to collaborate with leading experts in AI and cloud computing, learning from the best and the brightest, fostering continuous growth.
- An environment that values innovation, collaboration, and mutual respect.
- Support for personal and professional development, empowering you with the tools and resources to elevate your skills and leave a lasting impact.
- A pivotal role in the AI revolution, shaping the technologies that power the innovations of tomorrow.
đ¤ About FlexAI:
Founded by Brijesh Tripathi and Dali Kilani, who bring experience from Nvidia, Apple, Tesla, Intel, Lifen, and Zoox, FlexAI is not just building a product â weâre shaping the future of AI.
đ Offices :
Our teams are strategically distributed across three continentsâEurope, North America, and Asiaâunited by a shared mission: to deliver more compute with less complexity.
- Paris - HQ
- San Francisco (Bay Area) - US office
- Bangalore - India office
đđź Apply NOW!
Youâve seen what this role entails. Now we want to hear from you! Does this opportunity align with your aspirations? If youâre even slightly curious, we encourage you to apply â it could be the start of something extraordinary!
At FlexAI, we believe diverse teams are the most innovative teams. Weâre committed to creating an inclusive environment where everyone feels valued, and we proudly offer equal opportunities regardless of gender, sexual orientation, origin, disabilities, veteran status, or any other facets of your identity that make you uniquely you.
- Department
- R&D HW
- Locations
- San Francisco (Bay Area)
- Remote status
- Hybrid
- Employment type
- Full-time
Head of Performance Architecture
Join us ! đ
Loading application form
Already working at FlexAI?
Letâs recruit together and find your next colleague.