Research Computing Systems (RCS), in the UAF Geophysical Institute, is looking for an experienced Linux Systems Administrator or Storage Engineer to join our team of research cyberinfrastructure analysts and engineers. We are a dynamic group of IT professionals supporting a core set of cyberinfrastructure services for a wide variety of UA research projects, as well as developing and maintaining specific systems for researchers. RCS team members work together and independently to ensure UAF researchers have high performing and adaptable network, storage, compute, web, and hosted systems services they can depend on to conduct their research. The successful candidate will work with team members to support and enhance a multi-petabyte network of Linux servers and disk arrays, a multi-petabyte tape library, a Versity based HSM, and Lustre based parallel filesystems. We have a very flexible office environment and collaborate with multiple groups throughout the GI and within UA’s Office of Information Technology.
Maintain up-to-date system procedures that can be followed by system, storage and network staff as required
Utilize the RT ticketing system for documenting system updates/issuesEssential
Research and develop tools for the collection and analysis of system performance data
Examine, document and report on profiles of system utilization
Monitor system performance and make recommendations for improvements
Develop and utilize capacity planning tools to anticipate future needs and make recommendations concerning future upgradesEssential
10System Upgrades and Modernization:
Develop utilities and problem solving tools
Perform analysis, and develop/modify software and systems to circumvent problems and enhance access
Develop regression test cases to solve problems, to implement robust systems and to validate system changes
Support faculty and staff in programming projects as needed
Follow industry hardware and software releases to determine when upgrades are advantageous
Provide procurement support as part of a team
Develop requirements and identify potential recipients for RFPs
Evaluate proposals and make product recommendationsEssential
Provide consulting for staff and users
Assist, or provide guidance to, users and staff with system questions/issues including problem diagnosis
Ensure that staff are notified in advance of system downtime
Alert services staff to potential system and user issues observed in performance of duties
Follow and enhance internal change management and project management standards and practices
Provide input, guidance, and review on system and usage policies
Provide direct support as required to other units on campus sharing infrastructure
Provide staff support for linux workstationsEssential
20Maintain Availability and Functionality of Systems:
Perform day-to-day system status reviews and troubleshooting; work with vendors as necessary, to ensure fully functioning systems
Develop and maintain baseline of expertise required to support systems and participate in on-call rotation
Maintain technical currency and awareness of technological advances relating to HPC and server technologies and operating systems
Participate in machine room power outages
Install, inventory, modify, maintain, test and integrate system hardware and software to provide access to systems and software tools with minimal negative impact to users by making effective use of downtime
Install and update user software as requested by staff
Take a leading role in overall system responsibility or particular system roles or tasksEssential
Ensure systems are operated, managed, secured and used in accordance with organizational policies and procedures
Serve as, or support, the Systems Security Officer
Monitor system security advisories and respond appropriately
Monitor system logs and take appropriate action when anomalies are detected
Perform system security scans, analyze results and correct deficienciesEssential
Maintain professional security certification as appropriate
Achieve and maintain technical professional certification
Author and present papers/posters at professional conferences
Participate in funding/grant writing activities as appropriate
Represent organization at campus, local, state, national and international meetingsEssential
5Research and Planning:
Track industry trends and share information concerning new technologies of interest
Evaluate and report on new technologies of interest
Work with organization and other units on research and/or implementation of new technologiesEssential
Advanced knowledge of information technologies including hardware and software, network configuration, system administration, database development and administration, data and network security, programming, and system analysis and integration.
Expert level knowledge of analysis, database administration, security, systems administration, and engineering.
Knowledge of multiple systems and ability to understand how systems relate to one another.
Knowledge of, and ability to combine inter-relationships between disparate problems and formulate situations.
Ability to formulate problem resolution.
Ability to analyze unusual, non-routine or complex situations and problems and devise alternate strategies for solutions.
Advanced knowledge of a specialized area.
Advanced knowledge of managing enterprise level technology.
Ability to understand needs of end users.
Ability to deliver results for the organization.
Ability to extrapolate abstract business problems into a successful targeted architecture benefiting UA.
Ability to guide and mentor peers on tasks related to their area of expertise.
Ability to make decisions on matters of significance and implement these decisions on behalf of the University.
Advanced knowledge of policies, standards and the computing environment. Project management skills.
Ability to work in a team environment with developers, testers and analysts.
Effective verbal and technical writing communication skills.
Prioritization and organization skills.
Ability to write and maintain good technical documentation.
Ability to lead a deployment or troubleshooting team.
Ability to work scheduled periods of 24x7 on-call support.
Experience troubleshooting and deploying computer hardware and software.
Technical skills to include advanced computer system design and testing.
Advanced system administration skills.
Experience implementing or training with IT security best practices and methods.
Experience writing basic shell scripts.
Experience troubleshooting and deploying peripheral hardware systems.
Required Education and Experience:
A Bachelors degree in an information technology, software development, or science research field, or equivalent experience.
Red Hat Certified Engineer (RHCE) certification or equivalent as determined by supervisor.
4 years experience in high performance computing and/or in an IT field
As a public, regional, comprehensive university, UAF is committed to building a culturally diverse and inclusive organization and strongly encourages women, minorities, individuals with disabilities, and veterans to apply.
To ensure consideration, please apply prior to the review date.
To apply for this position, please provide a cover letter, resume, and contact information for three professional references with your application. Please submit your application no later than 7/13/2020 at 11:55 PM Alaska Time. The review of this posting will start on 7/14/2020. This posting will be closed once it is filled.
If you need assistance applying to this posting, please contact GI - Office of Human Resources at 907-474-6498.
Applicants needing a reasonable accommodation to participate in the application and screening process should contact the UA Human Resources office at 907-450-8200.
This position is a term-funded position and is reviewed annually for contract renewal at the University's discretion.
Affirmative Action Statement:
UA is an AA/EO employer and educational institution and prohibits illegal discrimination against any individual: www.alaska.edu/nondiscrimination
The successful applicant is required to complete a background check. Any offer of employment is contingent on the background check.
Pursuant to University Regulation 04.07.020, new employees of the University are employed in an at-will probationary status for the first six months of employment. During the probationary period, employment may be terminated for no reason or any reason. Promoted employees also serve a probationary period with limited rights of retreat.
Public Disclosure Statement:
Your application for employment with the University of Alaska is subject to public disclosure under the Alaska Public Records Act.
University of Alaska is a Drug Free Workplace.
It is the policy of the University of Alaska (UA) that all employees are required to complete training to meet the requirements of the positions they hold, and to complete the required training within a specified period to remain employed at the UA.
If you have any questions regarding this position, please contact GI HR at 907-474-6498.