Skip to Main Content

IT Manager

Below you will find the details for the position including any supplementary documentation and questions you should review before applying for the opening.  To apply for the position, please click the Apply for this Job link/button.

If you would like to bookmark this position for later review, click on the Bookmark link.  If you would like to print a copy of this position for your records, click on the Print Preview link.

Position Details

Position Information

Recruitment/Posting Title IT Manager
Job Category Staff & Executive - Information Technology
Department Office of Economic Development
Overview

The Rutgers Discovery Informatics Institute (RDI2) Advanced Computational and data Infrastructure (ACI) Ecosystem permeates all aspects of RDI2 and includes facilities for high-performance computing and communications, data management, computational services, and advanced analytics, and supports a diverse user community at RDI2, across Rutgers, and beyond. The goal is to support both science and engineering enabled by ACI, as well as the science and engineering required to develop and advance ACI. The RDI2 ACI should also provide global linkages to the national and international ACI that will be used, for example, to link observational instruments, data streams, experimental tools, and testbeds.

Posting Summary

Rutgers, The State University of New Jersey, is seeking an IT Manager for The Office of Research and Economic Development. This position will be under the leadership of the Associate Director for Technical Operations and, in turn, of the Director of the Rutgers Discovery Informatics Institute (RDI2) as per the attached organizational chart.

Among the key duties of this position are the following:
• Coordinates and executes the setup, configuration, and support of parallel computing systems (hardware and software) and other similar computational and data systems.
• Administers ELF and Caliburn clusters other research infrastructure such as the CAPER and Spring clusters.
• Consults with faculty to understand their needs and help them use the system intelligently, provide training for users, and develop policies for use of the system.
• Supports the operation and maintenance of these system as well as the RDI2 community in user support issues. While this position is not a programming position, some knowledge of parallel programming and the associated software stacks will be required, in order to help faculty use the system to its fullest.

Position Status Full Time
Hours Per Week
Daily Work Shift Day
FLSA Exempt
Grade 08
Position Salary
Annual Minimum Salary 96072.000
Annual Mid Range Salary 121529.500
Annual Maximum Salary 146987.000
Standard Hours 37.50
Union Description Admin Assembly (MPSC)
Payroll Designation PeopleSoft
Benefits

Information about the benefits provided to Rutgers employees can be found at: http://uhr.rutgers.edu/benefits/benefits-overview

Seniority Unit
Terms of Appointment Staff - 12 month
Position Pension Eligibility ABP

Qualifications

Minimum Education and Experience

• Requires a bachelor’s degree in computer science or an equivalent combination of education and experience and six or more years of related systems administration, including configuration and operation of complex, interconnected systems, networks, and support software such as scheduling systems.

Required Knowledge, Skills, and Abilities

• Must have successful project planning and management experience, supervisory experience, and good interpersonal communication skills.
• Requires the ability to correctly identify and diagnose complex system and network problems.
• Must be able to handle multiple, shifting priorities; a large volume of technical problem resolution; and work well with senior staff in a team oriented, and cooperative environment.
• Must be willing to work with academic researchers on executing their applications on the systems.

Preferred Qualifications

• Linux system administration: local file systems, kernel modules, SNMP, node monitoring, performance tuning.
• High Performance Computing runtime, libraries and tools deployment, configuration and administration
• Batch system (e.g., SLURM, PBS, etc.) setup, configuration and scheduler policy management.
• Networking: IB (and OPA, preferably) and ethernet driver installation, tuning and administration
• Advanced network administration (security/monitoring appliances)
HPC- and SAN-based storage set up and administration (e.g., GPFS, Lustre)
• User management and accounting.
• Scientific applications and libraries installation, configuration and administration.
• Scripting (Shell, Python, etc.)
• Parallel computing concepts including shared memory, distributed memory and hardware acceleration (e.g., GPU)
• Parallel programming paradigms and runtimes (e.g., MPI, OpenMP, CUDA, etc.) and programming environments (including compilers, debuggers, profilers, etc.)
• Profiling and performance monitoring
• Automation tools such as Ansible, Puppet and Chef
• Project management tools and other RDI2-related services administration (e.g., Redmine, Jira, LDAP, etc.)

Equipment Utilized

• This position will be initially responsible for the administration of ELF and Caliburn clusters and other research infrastructure such as the CAPER and Spring clusters, and supporting their user communities. ELF is a 144-node x86-based cluster with an IB-based interconnect, delivering 3,456 Intel cores, 38TB of RAM memory, 1PB usable storage and 16 GPU nodes. Caliburn is a 560-node x86-based cluster with Intel OPA interconnect, delivering 20,160 Intel cores, 140TB of RAM memory, 218TB of non-volatile memory. ELF and Caliburn together deliver ~800TFlops peak performance.
• The Caliburn and ELF systems (like other large parallel systems) are not single computers. They contain a large number of systems, with special-purpose networks connecting them, a high-performance storage subsystem. Programming such a system is a specialized task.
• These clusters use SLURM queuing system.
• This position requires significant experience using queueing systems, including SLURM.

Physical Demands and Work Environment

• Physical demands are those typical for the management of computer systems and their setup such as racking servers, cabling equipment, datacenter work and other similar tasks
• While this position is a permanent technical staff member; however, we expect graduate assistants and postdoctoral research associates to be placed in departments to help faculty adapt and optimize code for these systems.

Special Conditions

Posting Details

Posting Number 17ST2434
Posting Open Date
Special Instructions to Applicants
Regional Campus Rutgers University-New Brunswick
Home Location Campus Busch (RU-New Brunswick)
Location Details

Affirmative Action/Equal Employment Opportunity Statement

It is university policy to provide equal employment opportunity to all its employees and applicants for employment regardless of their race, creed, color, national origin, age, ancestry, nationality, marital or domestic partnership or civil union status, sex, pregnancy, gender identity or expression, disability status, liability for military service, protected veteran status, affectional or sexual orientation, atypical cellular or blood trait, genetic information (including the refusal to submit to genetic testing), or any other category protected by law. As an institution, we value diversity of background and opinion, and prohibit discrimination or harassment on the basis of any legally protected class in the areas of hiring, recruitment, promotion, transfer, demotion, training, compensation, pay, fringe benefits, layoff, termination or any other terms and conditions of employment. For additional information please see the Non-Discrimination Statement at the following web address: http://uhr.rutgers.edu/non-discrimination-statement

Supplemental Questions

Required fields are indicated with an asterisk (*).

Applicant Documents

Required Documents
  1. Resume
  2. Cover Letter/Letter of Application
  3. List of References
Optional Documents