Available for new opportunities

Luis Afonso IT Service Delivery Manager

IT Service Delivery & Operations Manager with 10+ years of experience leading IT teams, driving ITSM transformation, and managing complex multi-vendor environments across the cultural, professional services, and technology sectors. Specialist in ITIL, Microsoft 365, Azure, cybersecurity and service management.

Career at a glance
10+
Years in IT
MSc
Telecoms & CS
5+
Certifications
Core technologies
Windows & Server macOS & iOS Office 365 Azure Admin PowerBI Intune / MDM ITIL Prince2 ServiceNow Freshservice Jira SolarWinds PCI DSS Cyber Essentials Rapid7 Sophos Mimecast Qualys PowerShell Python Java Cloudflare DigiCert / SSL Bluecoat CodeTwo SharePoint PowerApps
Current role
Service Delivery Manager
National Portrait Gallery, London (2023–2026)
Leading IT service operations, ITIL processes, cybersecurity, cloud transformation and strategic planning.

Bridging technology
and people

I'm a highly motivated IT Manager with a decade of experience across cultural institutions, professional services and consulting in London. My work sits at the intersection of IT governance, cloud infrastructure, security and service excellence.

From managing helpdesks and security operations at the Museum of London to leading IT service delivery at the National Portrait Gallery, I've built teams, driven automation and ensured compliance across complex environments.

My academic background in Telecommunications and Computer Science (ISCTE, Lisbon) gives me a technical edge — I also have a published IEEE paper on UAV communications. I hold certifications including Prince2 Practitioner and ITIL, and I'm passionate about leveraging cloud solutions and AI to drive continuous improvement.

  • 🤖
    Google AI Professional
    Google · 2026
  • 🏆
    Prince2 Practitioner & Foundation
    QA · 2024
  • 📡
    Amateur Radio Intermediate
    RSGB · 2020
  • ⚙️
    ITIL Continual Service Improvement
    2019
  • 🍎
    Jamf 200 — Apple Management
    Amsys, London · 2018
  • ⚙️
    ITIL Foundation
    BCS (QA) · 2017
  • 🎓
    MSc · Telecommunications & Computer Science
    ISCTE – University Institute of Lisbon · 2015

Where I've worked

A decade of progression across service management, security operations and infrastructure delivery.

Feb 2023 — Mar 2026
Service Delivery Manager
National Portrait Gallery · London
  • Lead IT service delivery operations, ensuring alignment with strategic goals, OLAs/SLAs and KPIs
  • Manage IT support and infrastructure teams, vendor relationships, and recruitment
  • Drive ITIL processes and service improvements through automation, cybersecurity and cloud solutions
  • Strategic planning, budget forecasting and reporting with PowerBI
  • Cyber incident response and compliance oversight
Nov 2021 — Jan 2023
ICT Helpdesk Manager
Museum of London · London
  • Led and developed a high-performing ICT Helpdesk team, driving service excellence
  • Managed incident response across enterprise tools and environment
  • Coordinated IT operations governance, including budget management, procurement, vendor and contract management
  • Supported strategic projects across the organisation
Aug 2020 — Nov 2021
ICT Helpdesk Technician
Museum of London · London
  • 1st, 2nd and 3rd line support across three sites, up to 400 users
  • Azure AD, Intune, Office 365 and Exchange administration
  • Windows 10, macOS, hybrid environment support
Dec 2016 — Jun 2020
Senior Technical Analyst
Deloitte LLP · London
  • 2nd and 3rd line support in a high-pace corporate environment
  • macOS / Jamf champion; ServiceNow incident, asset and change management
  • AV events, major incident, VIP and escalation support
  • Mentoring and supervising small teams
Mar 2015 — Sep 2016
Infrastructure System Technician
Everis Consultancy Ltd · London
  • 1st and 2nd line support including site management, MDM and AV systems
  • Assisted with multinational projects; BMC Remedy incident support

Skills & expertise

A broad skill set spanning infrastructure, security, governance and leadership.

☁️
Microsoft 365 & Azure
Administration of M365 tenants, Azure AD, Intune/Endpoint Manager, Exchange Online, SharePoint, Teams and Azure Security Centre.
🛡️
Cybersecurity & Compliance
Cyber Essentials, PCI DSS, EDR, MDR, SIEM, email filtering, web filtering, vulnerability management, disaster recovery, threat detection and response.
🤖
AI in IT Service Management
Leveraging AI tools to enhance service delivery and automation: ServiceNow AI, Freshservice AI and Claude for intelligent ticket routing, knowledge management and workflow optimisation.
⚙️
ITIL & Service Management
ITIL Foundation & Continual Service Improvement certified. SLA/OLA governance, incident and change management, service desk leadership. Tools: ServiceNow, Jira, SolarWinds, Freshservice.
📊
Project Management
Prince2 Practitioner & Foundation. Infrastructure rollouts, system migrations, vendor onboarding and cross-functional project delivery.
📱
Mobile Device Management
End-to-end MDM for iOS and Android using Microsoft Intune and Jamf. Device enrolment, policy enforcement, app deployment and compliance reporting across hybrid fleets.
📈
Reporting & Analytics
PowerBI dashboards for IT KPIs, budget forecasting and service performance. Data-driven decision making for continual improvement.
💻
Scripting & Automation
Automating IT operations and reducing manual toil with PowerShell, Bash and Python — covering deployments, reporting, user provisioning and system maintenance.
📡
Monitoring & Performance
Proactive infrastructure and end-user experience monitoring using Nagios, Zabbix, ControlUp and Nexthink — enabling real-time visibility and rapid incident detection.

Service maturity
& performance

Measured through modern engineering frameworks — DORA, SRE reliability targets and XLA human-centred experience metrics — not just ticket counts.

Deployment Frequency ELITE
0
Deploy Frequency📈 KPI
Average deployments per week across managed services. Elite threshold: on-demand / multiple per day.
↑ Elite tier
Lead Time for Changes ELITE
0
Lead Time📈 KPI
Time from code commit to production. Elite: less than one hour. High: less than one day.
↑ High performer
Change Failure Rate ELITE
0
Change Failure Rate📈 KPI
Percentage of deployments causing service degradation. Elite target: 0–5%. Industry average: 15–45%.
↓ Well below avg
Mean Time to Restore HIGH
0
MTTR📈 KPI
Average time to restore service after an incident. Elite: less than one hour. High: less than one day.
↑ High performer
DORA Performance Band
Overall DORA maturity assessed across all four key metrics. Elite band requires excellence in deployment frequency, lead time, failure rate and MTTR.
90% Score
Deploy FreqElite
Lead TimeHigh
Change FRElite
MTTRHigh
Change Success by Type
Deployment success rates segmented by change category — standard, emergency, and normal changes over the past 12 months.
Standard Changes99%
Normal Changes97%
Emergency Changes91%
Rollback Rate3%
Deployment Frequency (6 months)
Monthly deployment count showing consistent delivery cadence and improvement trajectory over the past half year.
Service Availability SRE
0
SLA Uptime📈 KPI
Measured availability across all production services. Target SLA 99.9% — current exceeds with 5h error budget remaining.
↑ Exceeds SLA
Error Budget Remaining SRE
0
Error Budget⚠️ KRI
Percentage of the monthly error budget still available. Enables reliable feature development while protecting reliability targets.
↑ Healthy buffer
P1 Incident Rate SRE
0
P1 Frequency⚠️ KRI
Average critical incidents per month. Sustained reduction driven by proactive monitoring, chaos engineering and automated alerting.
↓ 62% reduction
Toil Reduction SRE
0
Toil Eliminated🛡️ KCI
Percentage of manual repetitive operations automated. SRE target: keep toil below 50% of engineering time through continuous automation.
→ Target: 50%
SLO Compliance by Service
Service Level Objective compliance across the four core service tiers. Measured monthly against agreed reliability targets.
95% Avg SLO
Tier 1 Critical99.9%
Tier 2 High99.5%
Tier 3 Medium98%
Tier 4 Low95%
MTTR by Incident Priority
Mean time to restore broken down by incident severity. Tracks against SRE-aligned response and resolution objectives.
P1 · Critical (<1h target)52min
P2 · High (<4h target)1.8h
P3 · Medium (<8h target)3.2h
Runbook Coverage84%
Incident Volume Trend (6 months)
Total incidents by month showing sustained downward trend driven by proactive monitoring and automation investments.
CSAT Score XLA
0
Customer Satisfaction📈 KPI
Overall CSAT across all IT support interactions. XLA target: 85%+. Exceeding benchmark by 6 percentage points.
↑ +6% vs target
Net Promoter Score XLA
0
IT NPS📈 KPI
Internal Net Promoter Score — how likely users are to recommend IT services. Excellent: 50+. Industry average: +35.
↑ Excellent
First Contact Resolution XLA
0
FCR Rate📈 KPI
Tickets resolved without escalation or re-contact. XLA benchmark: 70–75%. Current performance 12 points above benchmark.
↑ +12% vs bench
Experience Score XLA
0
Digital EX Score📈 KPI
Composite digital employee experience score from device performance, app reliability and support quality. Measured via Nexthink.
↑ Above 8.0 target
CSAT Satisfaction Breakdown
Distribution of satisfaction ratings across all IT interactions over the past 30 days. XLA focus: maximise Excellent + Good, drive Neutral and Poor toward zero.
91% CSAT
Excellent78%
Good13%
Neutral6%
Poor3%
XLA Dimensions Score
Employee experience scored across five XLA dimensions: responsiveness, empathy, knowledge, resolution quality and proactivity.
Resolution Quality94%
Responsiveness91%
Technical Knowledge89%
Empathy & Communication88%
Proactive Outreach81%
NPS Trend (6 months)
Internal IT Net Promoter Score by month. Sustained growth reflects continuous service improvements and XLA programme investment.
First Call Resolution FCR
0
FCR Rate📈 KPI
Tickets fully resolved on first contact without escalation or callback. Industry benchmark: 70–75%.
↑ +12% vs benchmark
Avg Handle Time AHT
0
Avg Handle Time📈 KPI
Mean time spent per ticket from open to resolution. Reduction driven by AI-assisted triage and knowledge base improvements.
↓ 18% improvement
SLA Compliance SLA
0
SLA Compliance🛡️ KCI
Percentage of tickets resolved within agreed SLA windows across all priority tiers. Target: 98%.
↑ Exceeds 98% target
Ticket Reopen Rate QUALITY
0
Reopen Rate🛡️ KCI
Tickets reopened after closure. Lower is better — industry average 5–8%. Reflects resolution quality and root cause identification.
↓ Below 5% benchmark
SLA Performance by Priority
Resolution compliance across all four priority tiers. P1 critical incidents tracked against a 4-hour resolution target.
99% Overall
P1 · Critical98%
P2 · High99%
P3 · Medium99.5%
P4 · Low100%
Ticket Volume by Channel
Distribution of inbound support requests by channel. Self-service adoption is a key continual improvement target to reduce agent load.
Self-Service Portal38%
Email28%
Phone20%
Chat / Teams14%
Monthly Ticket Volume (6 months)
Total tickets logged per month. Reduction trend reflects improved self-service adoption, proactive monitoring and knowledge base quality.
Ticket Backlog AGEING
0
Open Backlog⚠️ KRI
Total open tickets outside SLA window. Ageing split: <3 days 68% · 3–7 days 22% · 7+ days 10%.
→ Target: <40
Cost Per Ticket EFFICIENCY
0
Cost Per Ticket📈 KPI
Fully-loaded cost per resolved ticket including staffing, tooling and overhead. Industry average: £18–£25. AI triage driving savings.
↓ 22% below avg
Escalation Ratio QUALITY
0
Escalation Rate⚠️ KRI
Percentage of tickets escalated from L1 to L2/L3. Benchmark: <15%. Reduction driven by knowledge base improvements and agent coaching.
↓ Below 15% target
Repeated Incidents PROBLEM MGMT
0
Recurrence Rate⚠️ KRI
Percentage of incidents that recur within 30 days for the same root cause. Drives problem management and permanent fix prioritisation.
↓ Down from 11%
Automation & Self-Service
Breakdown of ticket deflection and automation success. Self-service adoption and AI resolution are key levers for reducing cost-per-ticket and improving experience.
65% Deflected
Self-Service38%
AI Resolution27%
Automation Rules18%
Agent Handled17%
AI Resolution & Business Impact
AI-assisted resolution rate by category, and business downtime minutes prevented through proactive monitoring and automated remediation.
AI Resolution Rate27%
AI Triage Accuracy91%
Downtime Prevented (auto-heal)840min
Business Downtime (actual)42min
Downtime vs Target (<60min/mo)
Backlog Ageing Breakdown
Open ticket distribution by age band. Tickets aged 7+ days are escalated to a dedicated backlog clearance process with daily management review.
0
Active Projects
Across all portfolios
0
On-Time Delivery
Last 12 months
0
On-Budget Rate
Within 10% variance
0
Stakeholder Sat.
End-of-project survey
0
Benefits Realised
vs Business Case
📋
Business Case
95%
🏛️
Organisation
92%
🗺️
Quality
89%
📅
Plans
91%
⚠️
Risk
84%
🔄
Change
88%
📊
Progress
93%
Project
Overall
Schedule
Budget
Scope
Risk
M365 Tenant Migration
Infrastructure · Q1 2026
On Track
+2d
-3%
Locked
Medium
ITSM Platform Upgrade
Service Mgmt · Q2 2026
On Track
On time
+1%
Change CR
Low
Cyber Essentials Plus
Security · Q2 2026
At Risk
+8d
On budget
Stable
High
Network Infrastructure Refresh
Infrastructure · Q3 2026
On Track
On time
+6%
Locked
Low
AI Helpdesk Automation
Innovation · Q3 2026
Planning
TBD
TBD
In draft
Medium
Portfolio Delivery Score
Composite score across on-time, on-budget, scope integrity and benefits realisation — weighted equally across all closed projects in the past 12 months.
90% Score
On Time87%
On Budget92%
Scope Integrity88%
Benefits96%
Risk Register — Open Risks by Category
Distribution of open project risks across PRINCE2 risk categories. Risk appetite maintained within agreed tolerance thresholds.
Technical / Infrastructure34%
Resource & Capacity28%
Supplier & Third Party22%
Scope / Change16%
Projects Completed (6 months)
Monthly project completions with successful handover to BAU operations, tracked against the agreed project roadmap.
Indicator Types
📈 KPI
Key Performance Indicator

Measures how well objectives are being achieved — the "how are we doing?" metric.

⚠️ KRI
Key Risk Indicator

Early warning signals that a risk threshold is being approached — the "what could go wrong?" metric.

🛡️ KCI
Key Control Indicator

Measures the effectiveness of a control — the "are our safeguards working?" metric.

Indicator
Type
Current
Target / Threshold
Status
Framework
SLA Compliance Rate
% tickets resolved within agreed SLA window
📈 KPI
99.2%
Target ≥ 98%
Red < 95%
● Exceeding
ITIL 4 / ISO 20000
First Call Resolution
% tickets resolved without escalation
📈 KPI
87%
Target ≥ 75%
Red < 60%
● Exceeding
HDI / ITIL 4
Customer Satisfaction (CSAT)
Composite satisfaction score across all interactions
📈 KPI
91%
Target ≥ 85%
Red < 75%
● Exceeding
XLA / ISO 20000
Net Promoter Score (NPS)
Internal IT NPS — likelihood to recommend
📈 KPI
+62
Target ≥ +50
Red < +20
● Exceeding
XLA Framework
Cost Per Ticket
Fully-loaded cost per resolved ticket
📈 KPI
£14.20
Target ≤ £18
Red > £25
● Exceeding
FinOps / ITIL
DORA Deployment Frequency
Weekly deployments across managed services
📈 KPI
4.2/wk
Elite ≥ multiple/day
Red < 1/month
● Elite band
DORA / DevOps
Risk Indicator
Type
Current
Amber / Red Threshold
Status
Owner
P1 Incident Rate
Critical incidents per month — leading indicator of reliability degradation
⚠️ KRI
0.3/mo
Amber ≥ 2/mo
Red ≥ 4/mo
● Low Risk
Service Delivery Mgr
SRE Error Budget Burn Rate
Rate at which monthly error budget is being consumed
⚠️ KRI
27%
Amber ≥ 50% burn
Red ≥ 80% burn
● Low Risk
SRE Lead
Escalation Ratio
L1 to L2/L3 escalation rate — signals knowledge gaps or staffing issues
⚠️ KRI
8.4%
Amber ≥ 15%
Red ≥ 25%
● Low Risk
Service Desk Manager
Ticket Backlog (7d+ aged)
Tickets breaching 7-day ageing — signals capacity or process failure
⚠️ KRI
5 tickets
Amber ≥ 5
Red ≥ 15
● Amber Watch
Service Delivery Mgr
Repeated Incident Rate
Incidents recurring within 30 days — signals ineffective root cause analysis
⚠️ KRI
6.2%
Amber ≥ 10%
Red ≥ 20%
● Low Risk
Problem Manager
Business Downtime
Monthly unplanned service downtime minutes — direct business impact indicator
⚠️ KRI
42 min
Amber ≥ 60 min
Red ≥ 240 min
● Low Risk
Service Delivery Mgr
Control Indicator
Type
Score
Effectiveness Target
Status
Framework
Change Success Rate
Effectiveness of change management controls — % changes with no adverse impact
🛡️ KCI
97%
Effective ≥ 95%
Failing < 85%
● Effective
ITIL 4 / PRINCE2
Ticket Reopen Rate
Effectiveness of resolution quality controls — low reopen = good root cause fix
🛡️ KCI
3.1%
Effective ≤ 5%
Failing > 15%
● Effective
ITIL 4 / HDI
Toil Elimination Rate
Effectiveness of automation controls — % of repetitive tasks eliminated via automation
🛡️ KCI
42%
Effective ≥ 50%
Failing < 20%
● Improving
SRE / Google
Runbook Coverage
% of P1/P2 incident types covered by documented runbooks — response readiness control
🛡️ KCI
84%
Effective ≥ 80%
Failing < 50%
● Effective
SRE / ISO 20000
PRINCE2 Theme Compliance
Average maturity score across the 7 PRINCE2 themes — project governance control
🛡️ KCI
90%
Effective ≥ 80%
Failing < 60%
● Effective
PRINCE2 Practitioner
AI Triage Accuracy
Effectiveness of AI classification controls — correct routing on first attempt
🛡️ KCI
91%
Effective ≥ 85%
Failing < 70%
● Effective
AI Governance
📊
DORA Metrics
DevOps Research
The four key metrics from Google's DevOps Research & Assessment programme — measuring software delivery performance through deployment frequency, lead time, change failure rate and MTTR.
Aligns with: COBIT APO · ISO/IEC 20000
⚙️
SRE Principles
Site Reliability Engineering
Google's SRE framework applied to IT operations — using SLOs, error budgets and toil reduction to balance reliability with the pace of innovation across managed services.
Aligns with: COBIT DSS · ISO/IEC 20000-1
🧭
XLA Mindset
Experience Level Agreements
Moving beyond SLA compliance to measure what matters — the human experience of IT. XLAs capture how users feel about services through CSAT, NPS, digital experience scores and outcome-based measurement.
Aligns with: COBIT BAI · ISO/IEC 20000-1 §8
📘
ITIL 4
IT Infrastructure Library v4
The globally recognised IT service management framework. ITIL 4 introduces the Service Value System, four dimensions model and 34 management practices — underpinning SLA governance, incident, change and problem management across all metrics shown.
Aligns with: COBIT DSS02–DSS06 · ISO/IEC 20000-1 full scope
🏆
PRINCE2 Practitioner
Project Management
Projects IN Controlled Environments — a structured, scalable project management methodology. PRINCE2 Practitioner certified, applying its seven principles, themes and processes to deliver consistent, governed and business-case-driven outcomes.
Aligns with: COBIT BAI01 · ISO/IEC 20000-1 §6

AI-curated insights across AI, cybersecurity, Azure & Microsoft 365, and IT service management — the topics that matter most in modern IT.

AI-powered news — updated on demand

Get in touch

Open to new opportunities, collaborations or just a conversation about IT strategy.