logo

View all jobs

IT Resiliency Analyst

White Plains, NY · Information Technology
We are looking for IT Resiliency Analyst with responsibilities for ensuring an organization's IT infrastructure and applications can withstand disruptions, recover quickly, and maintain continuous operations. They focus on disaster recovery planning, business continuity, risk assessment, and implementing strategies to enhance the overall resilience of IT systems. This role involves collaboration with IT, security, and business teams to identify vulnerabilities, design recovery solutions, and conduct testing to ensure readiness in the event of an outage or disaster.
Core Responsibilities
  1. Disaster Recovery (DR) Planning and Management
    • Develop and maintain disaster recovery plans for critical IT systems and applications.
    • Define recovery objectives, such as Recovery Time Objectives (RTO) and Recovery Point Objectives (RPO), to align with business requirements.
    • Coordinate with IT teams to implement recovery solutions, such as backup systems, failover sites, and redundant infrastructure.
  2. Business Continuity Planning (BCP) Support
    • Work closely with business units to assess continuity requirements and develop strategies for maintaining critical operations during disruptions.
    • Develop business continuity plans that align with IT recovery capabilities and ensure seamless resumption of business activities.
    • Assist in the documentation and updating of BCP procedures to keep plans current with changes in business and IT environments.
  3. Risk Assessment and Impact Analysis
    • Conduct risk assessments to identify vulnerabilities and potential failure points in the IT infrastructure.
    • Perform Business Impact Analysis (BIA) to evaluate the effects of potential disruptions on critical processes.
    • Identify high-risk areas and prioritize resources and efforts to address critical vulnerabilities.
  4. Testing and Validation of DR and BCP Plans
    • Organize and conduct regular disaster recovery and business continuity tests, including tabletop exercises, simulations, and full-scale tests.
    • Evaluate test results, document lessons learned, and make recommendations for improvements.
    • Collaborate with stakeholders to remediate issues discovered during testing and update recovery plans as needed.
  5. Incident Response and Recovery Coordination
    • Serve as a key contact in disaster recovery and incident response situations, coordinating response efforts across IT teams.
    • Execute recovery procedures in the event of a disruption, monitoring the recovery process and providing updates to stakeholders.
    • Document incidents and recovery efforts, identifying areas for improvement in processes and procedures.
  6. Data Backup and Recovery Management
    • Monitor and manage data backup processes, ensuring regular and complete backups for critical systems.
    • Verify the integrity and accessibility of backup data, testing recovery procedures to confirm effectiveness.
    • Implement retention policies and procedures for data recovery that comply with regulatory and organizational requirements.
  7. Compliance and Regulatory Adherence
    • Ensure that disaster recovery and business continuity plans comply with industry standards and regulatory requirements, such as ISO 22301, NIST, or GDPR.
    • Maintain documentation and audit trails of testing, changes, and updates to meet compliance standards.
    • Work with internal audit teams to address compliance gaps and improve resiliency posture.
  8. Reporting and Documentation
    • Create and maintain detailed documentation for DR and BCP plans, including recovery steps, communication protocols, and contact lists.
    • Generate reports on resiliency metrics, including RTO/RPO status, backup success rates, and test results.
    • Present findings to management and stakeholders, highlighting key risks, readiness levels, and areas for improvement.
Key Skills
  1. Disaster Recovery and Business Continuity Planning
    • Strong knowledge of disaster recovery methodologies, including backup and restore processes, failover mechanisms, and high availability setups.
    • Experience in developing and maintaining DR and BCP plans for complex IT environments.
    • Familiarity with key concepts like RTO, RPO, and Recovery Levels to set achievable objectives.
  2. Risk Management and Assessment
    • Proficiency in conducting risk assessments to identify vulnerabilities and mitigate potential failures.
    • Ability to perform Business Impact Analysis (BIA) and evaluate risks from an operational and financial perspective.
    • Skills in prioritizing recovery efforts based on risk levels and impact assessments.
  3. Testing and Validation
    • Expertise in planning and executing DR and BCP tests, such as failover simulations, data restoration tests, and emergency response exercises.
    • Analytical skills to assess test results, identify gaps, and document recommendations for improvement.
    • Experience in coordinating with IT teams and business units to carry out tests effectively.
  4. Incident Response and Crisis Management
    • Ability to coordinate response efforts during an IT outage, natural disaster, or cyber incident.
    • Skills in communication and decision-making to facilitate effective and timely recovery actions.
    • Experience documenting incidents and identifying lessons learned for continuous improvement.
  5. Data Backup and Recovery Solutions
    • Familiarity with backup and recovery tools and technologies, including snapshot backups, tape backups, and cloud-based recovery solutions.
    • Knowledge of best practices for data retention, offsite storage, and data recovery testing.
    • Proficiency in verifying backup success rates, data integrity, and recovery point compliance.
  6. Regulatory Knowledge and Compliance
    • Understanding of industry standards and regulations for business continuity, such as ISO 22301, NIST, SOX, or GDPR.
    • Ability to ensure DR and BCP plans comply with regulatory requirements and prepare for audits.
    • Skills in documenting changes and maintaining compliance records to demonstrate resiliency readiness.
  7. Communication and Collaboration Skills
    • Strong communication skills to work with technical teams, business units, and executive management.
    • Ability to translate complex technical concepts into business-friendly language for stakeholders.
    • Skills in creating documentation, reports, and presentations to communicate resiliency objectives and progress.




If you are interested in getting more information about this opportunity, please contact Irina Rozenberg  Recruiting@arielpartners.com at your earliest convenience.
 
At Ariel Partners, we solve the most difficult problems that inhibit technology from enabling our customers to achieve their goals. Our vision is to be recognized by our stakeholders as an elite provider of IT solutions, so when they have their biggest challenges, we are on their short list. We are looking for team members who share our values of: Integrity to do the right thing even when it hurts; Commitment to the long-term success and happiness of our customers, our people, and our partners; Courage to take on difficult challenges, accept new ideas, and accept incremental failure; and the constant pursuit of Excellence. Ariel Partners is an Equal Opportunity Employer in accordance with federal, state, and local laws.


 

Share This Job

Powered by