HPC Engineer Job Description
HPC Engineer Duties & Responsibilities
To write an effective HPC engineer job description, begin by listing detailed duties, responsibilities and expectations. We have included HPC engineer job description templates that you can modify and use.
Sample responsibilities for this position include:
HPC Engineer Qualifications
Qualifications for a job description may include education, certification, and experience.
Licensing or Certifications for HPC Engineer
List any licenses or certifications required by the position: YARN, AWS, ITIL, DHCP, DNS, IAT
Education for HPC Engineer
Typically a job would require a certain level of education.
Employers hiring for the HPC engineer job most commonly would prefer for their future employee to have a relevant degree such as Bachelor's and Master's Degree in Computer Science, Engineering, Education, Computer Engineering, Technical, Science, Mathematics, Software Engineering, Information Systems, Electrical Engineering
Skills for HPC Engineer
Desired skills for HPC engineer include:
Desired experience for HPC engineer includes:
HPC Engineer Examples
HPC Engineer Job Description
- Automating benchmarking infrastructure and reporting scores
- Integrating the software with standard TCG validation, test, and automation infrastructures
- Developing several internal and customer-facing demos at various stages of the product cycle
- Automate and orchestrate delivery of product-based solutions
- Perform day-to-day application deployment and integration tasks in support of Ford HPC environment
- Develop and facilitate HPC application deployment project plans
- Develop and maintain application launch and management middleware and tools
- Develop methodology and tools to integrate Ford HPC services within the Product Development CAE processes and methods
- Support Ford application development teams developing and integrating HPC applications
- Perform CAE application performance benchmarking and tuning
- SQL database programming
- Reliability, availability, serviceability (RAS) experience
- Experience with MPI
- XML transformation
- Prototype and develop innovative features for cluster system software components
- Evaluate, test, and validate cluster systems components
HPC Engineer Job Description
- Work closely with business partners to establish functional and performance requirements
- Design and build configuration management leveraging DSC/Chef
- Streamline deployment of infrastructure into consumable services through automation and self-service
- Automation of everything (Infrastructure-as-Code methodologies)
- Help design and build CI pipelines around this automation
- Build orchestration to loosely couple complex and heterogeneous systems
- Design and build IaaS control plane and compute fabric
- Documentation for handoff to operational teams
- Serve as a principal collaborator on major projects, and be responsible for fostering broader community-wide benchmarking efforts where organizations outside of NERSC will use NERSC benchmarks and report back results
- Develop and optimize codes for HPC architecture, developing libraries and tools for Research team
- Provide rapid response and support for stakeholders
- C/C++ programming skills and/or Perl or Python scripting
- HPC cluster building and/or cluster provisioning
- MPI and other parallel programming models
- Hands-on server/hardware debugging
- The OpenMP programming mode
HPC Engineer Job Description
- Champion best-practices, stay ahead of curve on new HPC technologies and plan code for future growth
- Provide Primary Support to High Performance Computing Site Lead
- Provide support for remote HPC systems (Debug workflow, system, and application issues, Optimize HPC user job performance, Develop scripts and procedures for system, job, and process monitoring, Maintain allocation reporting infrastructure, Automate generation of charts of system metrics, Contribute to development of system test infrastructure, Develop test cases based on use scenarios and user requirements)
- Tiered User Support (Respond to moderately complex customer inquiries (Tier-2), Assist customers in resolving technical problems by providing guidance regarding software and hardware problems, Identify, evaluate, and prioritize customer problems to ensure that inquiries are resolved appropriately, write/update/enhance system documentation concerning problem resolutions and HPC assets)
- Act as primary point of contact for all licensing, installation and HPC use queries across the Americas
- Day-to-day system administration of several Linux based HPC systems
- Develop system tools for automation and monitoring
- Proactively monitor, analyze and correct system issues
- Architect and design HPC systems to meet requirements
- Diagnosis and troubleshoot complex large scale system issues
- Expert systems
- Design and implement "HPC appliances" to lower the barrier of entry to HPC for scientists and researchers working on various fields including bioinformatics, next generation gene sequencing, drug research
- Deep understanding of parallel programming concepts and experience with tuning/optimizing parallel codes particularly in the life sciences field
- Hands on expertise architecting systems for gene sequencing, molecular dynamics, computational chemistry or similar applications
- Experience with benchmarking, performance modelling, performance extrapolations
- Domain knowledge in HPC and system software such as cluster management/provisioning tools, job schedulers, MPI, etc
HPC Engineer Job Description
- This position will require occasional night and weekend support
- Work with the CFSAN HPC Program Technical lead and team to manage, recommend and implement (if required) significant HPC architecture changes to support the diverse scientific disciplines within FDA
- Manage, support, install, upgrade, and configure applications, software-packages, and Linux OS on HPC environments
- Troubleshoot and resolve issues related to applications/software, Linux OS, Grid Engine scheduler, and wrapper scripts used in the HPC environments
- Manage and support HPC peripheral services such as Nagios, Ganglia, DNS, Repositories, XDMoD, GridFTP
- Support to the HPC users community
- Support scientific data center and operation logistics for assigned areas
- Manage, configure and support High Availability services
- Position will require a self-starter who has initiative to identify issues on their own and then develop solutions identify needed services and recommend changes including new technologies or new ways to leverage existing technologies
- Articulate customer requirements to Product Management, Marketing and R&D teams
- MS or Ph.D in CS/CE or a relevant engineering field and 10 + years experience
- Design and implement "HPC appliances" to lower the barrier of entry to HPC for engineers designing and manufacturing cars, aircrafts, engines
- Deep understanding of parallel programming concepts and experience with tuning/optimizing parallel codes particularly in CFD, CAE, structural analysis
- Hands on expertise architecting systems for CFD, CAE, structural analysis or similar applications
- Experience benchmarking, performance modelling and performance extrapolations particularly with ISV applications such as Ansys Fluent, Ansys Mechanical, LS-Dyna, Nastran, Abaqus, StarCD, etc
- MS or Ph.D in CS/CE/EE and 5 + years experience
HPC Engineer Job Description
- Manage the coverage of tests including automation for the Code Advisor product using Selenium and the Robot test framework
- You will need to be willing to learn REST API and the Go language to technically assess and potentially write code
- Conducting or participating in multidisciplinary research and collaborating with design, layout and/or hardware engineers in the design
- Collaborating with partners and customers on performance requirements and strategy
- Successful candidates will be flexible, with a variety of skills and a commitment to extend the breadth and depth of their knowledge
- This job may involve travel-time to customers and conferences for face-to-face communication and relationship building
- Participate in the design and implementation of multiple Linux-based HPC, Infrastructure and Parallel file system servers and clusters
- Build, configure, and maintain multiple RAID controllers and disk enclosures systems
- Deploy and maintain Infiniband fabrics for compute and storage networks
- Monitor installation of software releases, patches of the operating system, and third-party utilities with emphasis on overall system security
- Hands-on experience with high performance computing cluster architecture and underlying building blocks
- Must be familiar with Linux Operating System as a integration engineer
- Should be familiar with integration and system testing
- Exposure to MPI architecture is desired
- Should have an understanding Cluster Management/Monitoring Framework and Deployment of HPC cluster is required (Ganglia, NagiOS)
- Understanding of Intel or AMD Processor Architecture is required