Humberger Nav
mployee.me logo
HPC Linux System Administrator
KLA
linkedin
Chennai, Tamil Nadu, India
5-7 years
Not Disclosed
Full time
30 April 2026
Top Skills:
AnsibleApacheArchitectureAutomationBalancingBiosChefCloudConfiguration Management ToolCustomer SupportDevopsDhcpDnsDockerDoeElectronicsGitGrafanaJenkinsKubernetesLinuxLinux DistributionNetworkingNginxOperating SystemPrinted Circuit BoardPrometheusPuppetPythonR&dResearch And DevelopmentSchedulingSemiconductorSubsystemTcp/ipTroubleshootingUbuntu

96

Get Personalized Job Matches with 1 Click

Job Description iconJob Description
Download Resume iconDownload Resume

KLA – Chennai, India

KLA Overview

KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice‑controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us.

KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and invests significantly in R&D. Our expert teams of engineers, physicists, and problem‑solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices.

Life at KLA is fast‑paced and collaborative, with teams solving complex, real‑world engineering problems.


Key Responsibilities

  • Design, implement, and support on‑prem high‑performance Linux clusters, including initial cluster setup, configuration, and deployment.
  • Demonstrate strong knowledge of on‑prem cluster infrastructure, including CPU/GPU architecture, scalable and robust storage, and high‑bandwidth interconnects.
  • Generate hardware BOMs for on‑prem clusters, work with vendors, and oversee hardware qualification and release activities.
  • Use strong Linux OS skills to install, image, configure, and maintain operating systems for cluster systems.
  • Understand system‑level and subsystem‑level requirements and drive execution to meet project timelines.
  • Support design and release of new products to manufacturing and customers by delivering production‑ready golden images, procedures, scripts, and documentation to manufacturing and customer support teams.

This role focuses on infrastructure, imaging, and cluster bring‑up. It does not require HPC workload scheduling, tuning, or application‑level optimization.


Required Qualifications

  • In‑depth, Linux distribution‑agnostic experience (SUSE, RedHat, Rocky, Ubuntu).
  • Experience designing, deploying, and maintaining storage systems used in on‑prem cluster environments.
  • Strong hardware knowledge across servers, GPUs, networking, storage, BIOS, and BMC, with hands‑on on‑prem setup experience.
  • Experience with systemd, netboot/PXE, and Linux cluster provisioning concepts.
  • Strong understanding of TCP/IP fundamentals and common protocols (DNS, DHCP, HTTP, LDAP, SMTP).
  • Ability to develop and maintain Shell and Python scripts.
  • Experience with one or more configuration management tools (Salt, Chef, Puppet, or similar).


Preferred Qualifications

  • Experience setting up and troubleshooting on‑prem Linux clusters, including hardware bring‑up and OS installation.
  • Exposure to Linux imaging or golden image creation frameworks – packer, ansible,cloud-init, kiwi, ubuntu-image used for repeatable deployments.
  • Experience with GPU‑based systems and high‑performance interconnects (InfiniBand or equivalent) will be an added bonus.
  • DevOps‑oriented mindset with experience using Jenkins, Git‑based repositories, and automation workflows.
  • Familiarity with container technologies (Singularity, Docker) from an infrastructure perspective.
  • Exposure to Kubernetes, Prometheus, and Grafana is a plus.
  • Knowledge of Apache/Nginx, proxy or reverse‑proxy configuration, and load‑balancing concepts (HAProxy).


Skills and Abilities

  • Strong team orientation and ability to collaborate across engineering, manufacturing, and support teams.
  • Excellent organizational and time‑management skills.
  • Ability to troubleshoot complex issues across OS, hardware, networking, and storage layers.
  • Adaptable and effective in fast‑changing environments.
  • Clear written and verbal communication skills.


Minimum Qualifications

  • Doctorate degree with 5+ years of related experience, OR
  • Bachelor’s or Master’s degree with 8+ years of related experience.


Equal Employment Opportunity

KLA offers a competitive, family‑friendly total rewards package and is committed to maintaining an inclusive work environment. KLA is an Equal Opportunity Employer and does not discriminate based on any protected status under applicable law.