Disaster Recovery Manager


Job Details


Job Description

Computer World Services Corp (CWS) is seeking an exceptional candidate to serve as the Disaster Recovery (DR) Manager for the National Institutes of Health (NIH) Center for Information Technology (CIT) Operations Management Services (OMS) project. CIT requires support for information technology (IT) service monitoring and continuous improvement of centralization and consolidation efforts to improve the quality of monitoring IT services, and to increase event management efficiencies in creating meaningful alerts leading to identification and resolution of incidents and root causes of problems in an expedited manner. The 24x7x365 IT operations teams are consolidated to create the centralized Operations Management Services (OMS) Monitoring Team Technology Operations Center (TOC) for all CIT services. Currently, the OMS Monitoring Team operates and maintains CIT monitoring tools including SL1, SCOM, and SiteScope, and utilizes xMatters/Everbridge to automatically send out notifications. The overall task includes monitoring approximately 12,000 individual devices that serve approximately 59,000 local area network (LAN) ports.

The Disaster Recovery Manager will play a vital role in ensuring the Continuity of Operations (COOP) and Disaster Recovery (DR) capabilities for OMS. This position requires strong coordination skills, expertise in DR planning, security assessments, and the ability to interface effectively with various stakeholders including CIT Service Areas and the NIH Information Security (InfoSec) team.

Key Tasks and Responsibilities

Manage Continuity of Operations Plan and Disaster Recovery

  • Manage the Continuity of Operations Plan (COOP) and act as the Disaster Recovery (DR) manager for OMS.
  • Coordinate with other CIT Service Areas on DR planning and develop/maintain a comprehensive DR Plan for OMS critical IT services.
  • Review existing documentation and processes, perform security assessments of OMS systems, and interface with the NIH InfoSec team.
  • Interface with Service Areas to ensure up-to-date DR plans and reflect modifications to CIT OMS Critical IT services infrastructure.
  • Attend weekly Change Technical Review Board (TRB) and Change Advisory Board (CAB) meetings to review major changes impacting disaster recovery failover setup.

    Coordinate Annual DR Tests and Reports
  • Plan, coordinate, and document annual DR tests for OMS services and facilitate tabletop exercises for critical systems.
  • Coordinate OMS security-related issues such as vulnerabilities, audit response, and coordination with other Service Areas.
  • Provide Remediation Reports and respond to Audits on OMS security vulnerabilities.
  • Collate test results and report findings and develop 'lessons learned' reports with recommended actions to improve CIT Continuity of Operation.

    Coordinate and Manage Resolution of Security Issues
  • Manage the resolution of security issues/vulnerabilities for the CIT infrastructure identified by the NIH InfoSec department.
  • Interface with all CIT Service Areas to coordinate the resolution of issues/vulnerabilities and report progress to the NIH InfoSec department.
  • Escalate issues to CIT Operations Management Services management as necessary.




Education & General Experience

Bachelor's degree in Information Technology, Computer Science, or related field.

Minimum of 5 years of experience in disaster recovery planning and coordination, preferably in a large-scale IT environment.

Strong understanding of IT security principles, practices, and technologies.

Experience with coordinating and conducting DR tests, tabletop exercises, and vulnerability assessments.

Excellent communication skills with the ability to interface effectively with stakeholders at all levels.

Strong analytical and problem-solving skills, with the ability to prioritize tasks effectively.

Proven ability to work both independently and collaboratively within a team environment.

Certifications

ITIL 4 Certification (Preferred)

Certified Business Continuity Professional (CBCP) or equivalent certification preferred

Security Clearance

Public Trust Moderate (Tier 2)

Other (Travel, Work Environment, DoD 8570 Requirements, Administrative Notes, etc.)

May require sitting/standing for extended periods and performing tasks involving bending, stooping, and reaching.

May require lifting and carrying heavy equipment.

Computer World Services is an affirmative action and equal employment opportunity employer. Current employees and/or qualified applicants will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, disability, protected veteran status, genetic information or any other characteristic protected by local, state, or federal laws, rules, or regulations.

Computer World Services is committed to the full inclusion of all qualified individuals. As part of this commitment, Computer World Services will ensure that individuals with disabilities (IWD) are provided reasonable accommodations. If reasonable accommodation is needed to participate in the job application or interview process, to perform essential job functions, and/or to receive other benefits and privileges of employment, please contact Aaron McClellan in Human Resources at
314.###.####
or





 Computer World Services (CWS)Corporation

 04/29/2024

 Bethesda,MD