Job Details
Experience Needed:
Career Level:
Education Level:
Salary:
Job Categories:
Skills And Tools:
Job Description
- Team Leadership and Management:
- Leading, mentoring, and developing a team of NOC engineers and technicians.
- Managing staffing, scheduling (including 24/7 coverage), and performance of the NOC team.
- Fostering a culture of continuous learning, collaboration, and proactive incident management.
- Conducting training sessions and workshops to keep the team updated on tools, technologies, and best practices.
- Incident and Problem Management:
- Overseeing 24/7 monitoring, incident response, and service restoration activities.
- Implementing, overseeing, and coordinating incident detection, response, resolution, and escalation procedures.
- Ensuring incidents are resolved in a timely manner to minimize downtime and impact on services and customers.
- Leading Major Incident Management efforts, coordinating cross-functional teams to resolve high-priority incidents swiftly.
- Conducting root cause analysis (RCA) for major incidents and implementing preventive measures to reduce recurrence (Problem Management).
- Operational Excellence and Process Improvement (ITIL Focus):
- Developing, implementing, and continuously improving operational procedures and protocols, aligned with ITIL best practices (e.g., Incident Management, Problem Management, Change Management, Service Request Fulfillment, Availability Management, Capacity Management).
- Identifying opportunities for automation, process optimization, and cost reduction within NOC functions.
- Driving the adoption of automation tools and technologies to streamline service management processes.
- Ensuring the ITSM toolset is effectively utilized to manage workflows, track incidents, and generate actionable insights.
- Monitoring and Reporting:
- Implementing and maintaining monitoring tools and dashboards to track the performance, availability, and security of the network and IT infrastructure.
- Generating and analyzing reports on network performance, incidents, and operational metrics.
- Providing insights and recommendations to senior management for service enhancement and operational efficiency.
- Monitoring and managing SLAs, KPIs, and operational metrics to ensure service quality and performance targets are met.
- Collaboration and Vendor Management:
- Collaborating with other departments (Cloud Engineering, Security, Customer Support, etc.) to ensure seamless operations and effective communication during incidents.
- Liaising with third-party service providers and vendors to ensure the quality and reliability of external services and support.
- Compliance and Security:
- Ensuring all operations adhere to industry standards, regulatory requirements, and company policies.
- Monitoring the network for security threats and taking action to mitigate them.
Job Requirements
- Education: Bachelor's degree in Information Technology, Computer Science, Engineering, Networking, or a related field.
- Experience:
- Minimum of 5-10+ years of experience in network operations or IT service management, with at least 2-5+ years in a managerial or supervisory role, preferably within a 24/7 operations environment.
- Proven experience leading a large-scale service delivery organization is highly desirable.
- Strong understanding and practical experience in implementing ITSM processes.
- Experience with business process improvement methodologies (e.g., Six Sigma) is a plus.
- Demonstrated experience in leading ITSM solutions, process and teams.
- Certifications:
- ITIL Certification (required): ITIL Foundation is essential; advanced ITIL certifications (e.g., ITIL Expert, ITIL v4 Managing Professional) are highly advantageous.
- Other relevant certifications (e.g., CCNA, PMP, CompTIA Network+, Security+) are beneficial.