see job description
Education and Experience Requirements:
Bachelor’s Degree Engineering, Computer Science, Information Technology, or related curriculum or equivalent combination of education and experience sufficient to successfully perform the essential functions of the job. A Masters degree may be used to offset one year of experience requirement; PhD may offset two years of experience. Two (2) years experience required. in HPC or scientific computing environment to include the installation, configuration and maintenance of RPM-based Linux distributions (RedHat, SuSE).
Provides operational support for the production High Performance Computing (HPC) environment in the Advanced Computing Technologies (ACT) department. Drawing upon the operating plan, design specifications and technical oversight, leverage enabling technologies to meet the desired goals, objectives and strategies of the Computational Fluid Dynamics, Simulation, and modeling engineering business areas. Responsible for the optimum integration of scientific applications to high performance computing technology.
Principle Duties and Responsibilities:
Essential Functions: 1. Assists with the day-to-day operations of production HPC clusters. 2. Troubleshoots and maintains the Infiniband network. 3. Assists end users running applications on the HPC cluster(s). 4. Manage, maintain, monitor, and control interactive and batch processes, both scheduled and unscheduled (including on-request processing). 5. Complete engineering-defined batch processing and backups in the correct sequence and within the established time periods. 6. Perform proactive failure trend analysis and root cause analysis for all system failures. 7. Produce trend reports to highlight production issues and follow predetermined action and escalation procedures when issues are encountered. 8. Monitor, verify, and suggest appropriate adjustments to support proper application executions. 9. Provide technical solutions that meet the performance and processing objectives of the business areas. 10. Follow upgrade plans to ensure compliance with corporate policies and industry best practices. 11. Provide support during data center upgrades and outages. 12. Assist with performance tuning and benchmarking activities.
Additional Functions: 1. Maintain technical relationships with multiple hardware and software vendors. 2. Work multiple operational windows as required. to support business objectives. 3. Provide on-call support 24x7 4. Complies with technical standards, hardware standards, and software standards. Perform other duties as assigned.
Other Requirements: 1. Experience with the management of Linux-based HPC clusters. 2. Familiarity with high performance/parallel storage. 3. Familiarity with the configuration and management of cluster scheduling software.
Languages Required (in addition to English): None
Requisition Number: 117794
Category: Information Systems
Percentage of Travel: Up to 25%
Employment Type: Full-time
Posting End Date: 10/18/2017
Gulfstream does not provide work visa sponsorship for this position, unless the applicant is a currently sponsored Gulfstream employee.