Systems Monitoring and Scheduling Developer

University of Chicago

University of Chicago

Software Engineering
Chicago, IL, USA
Posted on Friday, October 27, 2023

Department

F&A ITS - Monitoring and Reporting


About the Department

IT Services collaborates with campus partners to support the mission of the University of Chicago through the consistent delivery of high-quality solutions and services.

• We provide secure, stable, and reliable infrastructure and applications to support the mission of the University.
• We support and enable faculty research and teaching with the effective use of technology.
• We simplify the technology experience for faculty, students, alumni, and staff, and we ensure technology is mobile-friendly and accessible.
• We identify, manage, and mitigate the technology risks of the University.


Job Summary

Under limited direction, the Systems Monitoring and Scheduling Developer will support and enhance the University of Chicago’s IT critical infrastructure monitoring and job scheduling environments. System monitoring will support alerts, automation, and metrics for system early warning and service impacting events. This role will also support both mainframe and open system job scheduling scripts, error correction, and enhancements. Work closely with network, voice, security, and server teams supporting advances in monitoring automation application integration creating a comprehensive information monitoring environment. These services are key in the 24x7x365 global operations to the University of Chicago IT Services offerings.

At a minimum, the Systems Monitoring and Scheduling Developer will:
 Engage with faculty, staff, and campus divisions providing monitoring, recovery, and escalation of systems and servers
 Participate in regular team meetings developing and providing appropriate status reporting and technical documentation.
 Suggest and implement approved best practices and information technology policies that result in the highest quality systems administration.
 Provide hands-on development working independently and as part of team.

Responsibilities

  • Responsible for the day-to-day service delivery, 24 hours a day, 7 days a week, 365-day monitoring environment.
  • Participates in IT Services Disaster Recovery and Business Continuity initiatives.
  • Collaborates with SMEs to document the root cause of service-impacting events.
  • Schedules and manages mission-critical production job streams utilizing shell script programming and automated scheduling packages.
  • Develops event management through the tools and processes available.
  • Provides valuable support in managing critical outage processes as required.
  • Develops and maintains process documentation for current services and support including process documentation and knowledge articles.
  • Ensures consistent documentation, procedures, and process management.
  • Enhances monitoring and scheduling logic, optimize monitoring parameters, and elevate alarm capabilities.
  • Works with stakeholders to compile and assess user feedback to improve monitoring automation and job scheduling.
  • Manages and develops data-analysis tools to optimize and trend system performance.
  • Engages with clients to formulate and execute monitoring processes tailored to their specific requirements, encompassing supervisory control and data acquisition.
  • Assesses customer and regional monitoring protocols to support compliant data processing.
  • Generates comprehensive status reports and establishes a well-structured communication rhythm to ensure seamless alignment among all stakeholders.
  • Participates in a range of processes and procedures related to the monitoring and controls of mini or mainframe computers and peripheral equipment, such as printers, tape, and disk drives by operating the central console or on-line terminals.
  • Analyzes operating records such as machine performance and production reports.
  • Configures, installs, upgrades, and maintains server applications and hardware. Works to safeguard the integrity of computer software. Implements operating system enhancements to improve the reliability and performance of the system.
  • Administers operating systems, maintains security, and implements backup procedures for the organization's information systems and peripheral equipment, such as servers, desktops, printers, and storage devices.
  • Plans and installs necessary patches and upgrades for servers and their associated storage, network, communications, and peripheral sub-systems. Installs and maintains an appropriate level of intrusion detection, monitoring, and auditing software as required.
  • Tracks compliance and maintains documentation for hardware, software, and service inventories for management reports.
  • Performs other related work as needed.


Minimum Qualifications

Education:

Minimum requirements include a college or university degree in related field.

---
Work Experience:

Minimum requirements include knowledge and skills developed through 5-7 years of work experience in a related job discipline.

---
Certifications:

---

Preferred Qualifications

Education:

  • Bachelor’s degree in computer science, engineering, or other technical fields.
  • Working knowledge of industry monitoring tools such as CA Spectrum, SCOM, Nagios, or similar.

Experience:

  • Minimum two years experience with CA Workload automation tools.
  • Minimum two years experience with CA Spectrum monitoring tools.
  • Minimum two years experience with a major DCIM software.
  • Minimum five years experience managing monitoring and operation tools in an enterprise environment.
  • At least three years experience providing support within a university environment.

Technical Skills or Knowledge:

  • Running tests/validation of products and solutions; demonstrating solutions, documentation, and handover to support teams.
  • Technical knowledge of Windows, Linux, and/or Networking.
  • Working knowledge of CA Spectrum and other types of monitoring tools.
  • Technical knowledge of and experience with an IBM/MVS environment.
  • Working knowledge of mainframe and open system job scheduling tools.
  • Working knowledge of system and process automation.
  • Proficiency with Microsoft Word, Excel, and Visio.
  • Minimum five years experience with high-level programming languages, such as Python, Go, Java, C/C++ .
  • Understanding of monitoring programs.
  • Understanding of how to achieve and leverage network-wide visibility tools and data, in both virtual and physical network environments (cloud experience a plus).
  • System admin/team lead level of knowledge and experience supporting Enterprise Linux (server and desktop) and Windows environments.
  • Working knowledge of general computer processes.

Preferred Competencies

  • Effective time management, organizational and analytical skills.
  • Accomplished written and verbal communication skills.
  • Excellent decision-making, and problem-solving skills.
  • Outstanding listening skills.
  • Strong interpersonal and people skills.
  • Prioritize multiple projects and independently follow through with detail for project assignments.
  • Communicate technical information to a non-technical audience in a clear and coherent manner.
  • Empathize and focus on client service.
  • Handle confidential material and work with personnel in a tactful and courteous manner.
  • Must be customer friendly and willing to be a strong customer advocate.
  • Demonstrated ability to act on own initiative.

Working Conditions

  • Standard office environment.
  • Hybrid work schedule.
  • On-call responsibilities.
  • Must be able to work after hours and on-call.
  • Available for minimal travel.

Application Documents

  • Resume/CV (required)


When applying, the document(s) MUST be uploaded via the My Experience page, in the section titled Application Documents of the application.


Job Family

Information Technology


Role Impact

Individual Contributor


FLSA Status

Exempt


Pay Frequency

Monthly


Scheduled Weekly Hours

37.5


Benefits Eligible

Yes


Drug Test Required

No


Health Screen Required

No


Motor Vehicle Record Inquiry Required

No


Posting Statement

The University of Chicago is an Affirmative Action/Equal Opportunity/Disabled/Veterans and does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender, gender identity, national or ethnic origin, age, status as an individual with a disability, military or veteran status, genetic information, or other protected classes under the law. For additional information please see the University's Notice of Nondiscrimination.

Staff Job seekers in need of a reasonable accommodation to complete the application process should call 773-702-5800 or submit a request via Applicant Inquiry Form.

We seek a diverse pool of applicants who wish to join an academic community that places the highest value on rigorous inquiry and encourages a diversity of perspectives, experiences, groups of individuals, and ideas to inform and stimulate intellectual challenge, engagement, and exchange.

All offers of employment are contingent upon a background check that includes a review of conviction history. A conviction does not automatically preclude University employment. Rather, the University considers conviction information on a case-by-case basis and assesses the nature of the offense, the circumstances surrounding it, the proximity in time of the conviction, and its relevance to the position.

The University of Chicago's Annual Security & Fire Safety Report (Report) provides information about University offices and programs that provide safety support, crime and fire statistics, emergency response and communications plans, and other policies and information. The Report can be accessed online at: http://securityreport.uchicago.edu. Paper copies of the Report are available, upon request, from the University of Chicago Police Department, 850 E. 61st Street, Chicago, IL 60637.