Unlimited Job Postings Subscription - $99/yr!

Job Details

Principal Engineer Distributed AI Systems Architecture (Heterogeneous Compute)

  2026-05-31     Intel Corporation     all cities,TX  
Description:

divh2Principal Engineer/h2pWe are seeking a Principal Engineer to define and architect the next generation of distributed AI systems across heterogeneous compute platforms, including CPUs, GPUs, IPUs/FNICs/FNICs, and emerging dataflow accelerators. This role focuses on one of the hardest problems in modern computing: How to dynamically execute and optimize large-scale AI computation graphs across diverse hardware while managing state, locality, and performance at system scale. You will operate at the intersection of systems architecture, high-performance computing, and AI infrastructure-defining the execution model, runtime abstractions, and placement strategies that turn a rack of heterogeneous devices into a coherent, programmable system./ph3Key Responsibilities/h3ulliDynamic Execution of Distributed Computation Graphs ulliDefine a runtime model for executing AI workloads as distributed computation graphs across heterogeneous resources/liliDesign abstractions for graph representation, dependencies, and execution semantics/liliEnable dynamic scheduling and execution across CPUs, GPUs/specialized accelerators, and IPUs/FNICs./li/ul/liliStateful Scheduling and Memory-Centric Architecture ulliArchitect systems where state (e.g., KV cache) is a first-class concern in scheduling and execution/liliDistributed Inferencing solution: Define models for data locality, memory hierarchy, and state ownership/liliOptimize for minimal data movement and efficient access to distributed state/li/ul/liliGraph Introspection and Automated Partitioning ulliDevelop mechanisms to analyze AI computation graphs and classify stages by:/liulliCompute intensity/liliMemory bandwidth requirements/liliCommunication cost/liliLatency sensitivity/li/ulliDrive automated or semi-automated partitioning of workloads across heterogeneous compute/li/ul/liliIntegration of Specialized Accelerators ulliArchitect frameworks that treat specialized accelerators (e.g., dataflow engines) as first-class execution targets/liliDefine execution boundaries, data exchange models, and integration strategies across device classes/liliEnable interoperability across diverse compute paradigms without sacrificing performance/li/ul/liliMoE-Aware Execution and Adaptive Placement ulliDesign runtime strategies for Mixture-of-Experts (MoE) models, including:/liulliExpert placement/liliRouting locality/liliLoad balancing vs data movement trade-offs/li/ulliEnhance existing frameworks for MOE and optimize communication path with IPUs/FNICs and compute path with Intel Accelerators./liliEnable adaptive execution based on real-time system signals (latency, utilization, skew)/li/ul/liliAdaptive Runtime and Feedback-Driven Optimization ulliDefine observability and telemetry models for distributed AI execution/liliBuild feedback loops that continuously optimize placement, scheduling, and resource utilization/liliDrive system-level performance across latency, throughput, and efficiency metrics/li/ul/li/ulh3Qualifications/h3pMinimum Qualifications:/pulliBachelors or BS degree in Computer Science, Software Engineering, or a related specialized field, or equivalent experience per business needs./lili12-plus years of experience with a Bachelors degree/liliProven expertise in defining and implementing software architectures for AI frameworks, protocols, and algorithms./liliDeep experience in systems architecture, high-performance computing, or distributed systems/liliStrong background in parallel or data-parallel computation models/liliExperience with heterogeneous compute environments (CPU, GPU, DSP, or accelerators)/liliProven ability to design end-to-end systems from abstraction through implementation/liliStrong understanding of performance trade-offs across compute, memory, and interconnect/li/ul Preferred Qualifications: p8-plus years of experience with a Masters degree, or 6-plus years of experience with a PhD./ppExperience with AI/ML systems, inference infrastructure, or large-scale model serving/ppFamiliarity with stream processing, dataflow models, or graph execution systems/ppKnowledge of modern AI frameworks or runtimes/ppExperience building developer-facing SDKs or programming models/ppBackground in performance optimization and benchmarking/ppRequirements listed would be obtained through a combination of industry relevant job experience, internship experiences and or schoolwork/classes/research./ppOperate as a technical leader and architect, not just an implementer/ppDrive cross-team alignment across hardware, software, and infrastructure/ppInfluence long-term system design and platform direction/ppMentor engineers and shape architectural thinking across the organization/ph3Job Type/h3pExperienced Hire/ph3Shift/h3pShift 1 (United States of America)/ph3Primary Location:/h3pUS, California, Santa Clara/ph3Additional Locations:/h3pUS, Oregon, Hillsboro, US, Texas, Austin/ph3Business Group:/h3pAt the Data Center Group (DCG), were committed to delivering exceptional products and delighting our customers. We offer both broad-market Xeon-based solutions and custom x86-based products, ensuring tailored innovation for diverse needs across general-purpose compute, web services, HPC, and AI-accelerated systems. Our charter encompasses defining business strategy and roadmaps, product management, developing ecosystems and business opportunities, delivering strong financial performance, and reinvigorating x86 leadership. Join us as we transform the data center segment through workload driven leadership products and close collaboration with our partners./ph3Posting Statement:/h3pAll qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation, or any other characteristic protected by local law, regulation, or ordinance./ph3Position of Trust/h3pThis role is a Position of Trust. Should you accept this position, you must consent to and pass an extended Background Investigation, which includes (subject to country law), extended education, SEC sanctions, and additional criminal and civil checks. For internals, this investigation may or may not be completed prior to starting the position. For additional questions, please contact your Recruiter./ph3Benefits/h3pWe offer a total compensation package that ranks among the best in the industry. It consists of competitive pay, stock bonuses, and benefit programs which include health, retirement, and vacation. Find out more about the benefits of working at Intel./ppAnnual Salary Range for jobs which could be performed in the US: $255,850.00-361,200.00 USDThe range displayed on this job posting reflects the minimum and maximum target compensation for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific compensation range for your preferred location during the hiring process./ppWork Model for this Role/ppThis role will require an on-site presence. * Job posting details (such as work model, location or time type) are subject to change./ppADDITIONAL INFORMATION: Intel is committed to Responsible Business Alliance (RBA) compliance and ethical hiring practices. We do not charge any fees during our hiring process. Candidates should never be required to pay recruitment fees, medical examination fees, or any other charges as a condition of employment. If you are asked to pay any fees during our hiring process, please report this immediately to your recruiter./p/div


Apply for this Job

Please use the APPLY HERE link below to view additional details and application instructions.

Apply Here

Back to Search