AI Container Platform Engineer
Intel Corporation     Chandler, AZ 85248
 Posted 3 days    

Job Description

Intel is hiring a senior AI Container Platform Engineer for development and optimization of its AI infrastructure. This includes collaboration with other team members to design and build a highly resilient Kubernetes container platform that is optimized for AI workloads on Intel accelerator hardware. Successful candidates will have a strong desire to address challenges with an attention to detail and focus to build a highly reliable and scalable container platform. A foundational understanding of software development principles and testing methodologies along with a high degree of independence, adaptability, drive, and willingness to accept new challenges, are a must. Candidate should also have both demonstrated software development skills in a range of languages and strong Linux systems expertise.

Responsibilities:

Work with others to design, build, and maintain a Kubernetes container platform on bare metal on-premises hardware.
Participate in building advanced tooling for deployment, testing, monitoring, logging, administration, auditing, and operations of multiple Kubernetes clusters in distributed data centers.
Research and implement solutions related to Kubernetes container RBAC, networking, storage, scheduling, registries, certificate management, and more to build a highly reliable, scalable, secure, and resource-optimized AI container platform.
Evaluation and selection of third-party commercial and open-source components for the AI container platform

Qualifications

You must possess the below requirements to be initially considered for this position. Preferred qualifications are in addition to the requirements and are considered a plus factor in identifying top candidates. Experience listed below would be obtained through a combination of your schoolwork and/or classes and/or research and/or relevant previous job and/or internship experiences.

Minimum Qualifications:
The candidate must possess a Bachelor’s degree or Master’s degree in Computer Engineering, Computer Science, Information Systems, or a related field with 8+ years of relevant work experience.

5+ years of experience in below areas:

Python, Golang or another modern programming language
Linux based operating systems such as CentOS, Ubuntu, SUSE, or Rocky
Bash shell scripting and Linux command-line acumen
2+ years of experience in below areas:

Software engineering team in a Cloud or on-premises data center environment supporting critical services.
Linux containers and container runtimes (Docker, containerd, cri-o)
Kubernetes
IP networking, load balancing, DNS
Pod scheduling and node topology management
Environment As Code via configuration management tools such as ansible, terraform, salt, chef, or puppet.
Container Network Interface (CNI), Container Storage Interface (CSI), and Kubernetes schedulers
Istio and/or service meshes.
AI/ML workloads
Performance benchmarking
Hardware accelerators and specialized devices (GPU, HPU, HPC)
Git development workflow
Kubespray, Kops, or Kubadm
Preferred Qualification:

Slurm, Volcano, MPI, PyTorch, TensorFlow or other schedulers and AI domain frameworks
On-premises data center networking
Cloud development or architecture (AWS, GCP, Azure, etc.)
Secret vault integration with Kubernetes
Identity provider configuration with SSO
Ability to communicate detailed technical concepts in a clear and concise manner.

  Back to All Job Opportunities

Job Details


Seniority Level

Experienced (5+ years, non-manager)

Field of Interest

Manufacturing

Employment Type

Full Time

Number of openings

1


Related Skills:


While all employers are vetted to meet the Maricopa Guidelines, the job postings are not individually reviewed. Students should be diligent in ensuring they are applying for positions that meet their needs and are not in violation of the Maricopa guidelines.