Skip to content

Scout AI System Profile — Worked Example

Project: Sable AI Ltd — AI Governance Framework Stage: Phase 2 — Worked Examples Status: Draft Version: v1 Date: 2026-03-01 Assumptions: Built on outline assumptions — not verified against real Sable AI Ltd data


How to Use This Document

This is the worked example version of L1-2.1-AI-System-Inventory-v1.md. Where the template provides field headings and instructions, this document provides the specific technical content for Scout as assumed for this framework.

Every field that relies on an unverified assumption is marked [ASSUMPTION: A-XXX]. Before using this profile operationally, all assumptions must be validated against real Sable AI Ltd data and reviewed by qualified legal counsel.

Cross-references: L1-2.1-AI-System-Inventory-v1.md (inventory template) | L1-2.3-Data-Flow-Map-v1.md (data flow template) | P2-Scout-UK-GDPR-Mapping-v1.md (GDPR obligations worked example)


1. System Overview

Field Value
System name Scout
System owner Sable AI Ltd
Version Production (current as at 2026-03-01)
Purpose AI-assisted CV screening and candidate shortlisting. Analyses candidate CVs against job descriptions and produces a ranked shortlist with match scores, matched/unmet criteria, human-review flags, and a one-paragraph candidate summary.
AI type Prompt-engineered AI skill — not a custom ML model. Scout is a structured wrapper around the Anthropic Claude API. No proprietary model training, no fine-tuning, no model weights owned by Sable AI Ltd.
Underlying model claude-sonnet-4-6 via Anthropic Claude API (commercial API access) [ASSUMPTION: A-002]
Model ownership Anthropic, PBC (San Francisco, CA, USA). Sable AI Ltd owns the system prompt, workflow orchestration, and user interface — not the underlying AI model.
Hosting AWS eu-west-2 (London region) throughout [ASSUMPTION: A-004]. All compute, storage, and database resources are UK-region.
Sub-processors Anthropic, PBC — processes extracted CV text and job descriptions on every API call [ASSUMPTION: A-005]. AWS — compute (EC2/Fargate), storage (S3), database (RDS), gateway (API Gateway + WAF).
Data subjects Job candidates whose CVs are uploaded to Scout for screening
Customer segments (1) UK recruitment agencies — screening candidates on behalf of employer clients [ASSUMPTION: A-003]; (2) UK in-house HR teams — direct hire screening [ASSUMPTION: A-003]
Intended user Recruiter or HR professional at a customer organisation. Scout is a B2B tool; candidates do not interact with Scout directly.
Human review requirement Mandatory. No candidate is contacted before a recruiter has reviewed and approved the AI's assessment [ASSUMPTION: A-007]. Human review is built into the Scout workflow and enforced at the UI level.
Risk classification High — see L1-2.2-Risk-Classification-Framework-v1.md for full classification rationale
DPIA required Yes — ICO confirms AI in recruitment triggers Art. 35 DPIA. Template at L2-3.4-DPIA-Template-v1.md. Deployment-specific DPIA not yet completed [ASSUMPTION: A-021].
Last review date 2026-03-01 (initial version — framework document)
Next review due 2026-09-01 [ASSUMPTION: A-023 — review cycle not yet confirmed]

2. Architecture Note

Scout is not a custom AI model. This distinction has significant compliance implications.

Scout operates as a prompt-engineered skill layered on top of the Anthropic Claude API — analogous to a structured use of Claude for a specific domain task. The AI capability (language understanding, reasoning, structured output) is provided entirely by Anthropic's model infrastructure. Sable AI Ltd provides:

  • The system prompt (defines Scout's role, scoring rubric, output schema, and operating constraints — a Sable AI Ltd trade secret)
  • Backend orchestration (document parsing, API call construction, output storage)
  • Recruiter-facing UI
  • Workflow logic (upload → screen → review → decision)

Compliance implications of this architecture:

Implication Detail
Candidate data reaches Anthropic on every API call Extracted CV text (including PII) is sent to Anthropic's API as the user message payload. There is currently no pre-processing step to strip PII before transmission. This is a compliance gap against UK GDPR Art. 5(1)(c) (data minimisation).
No control over model internals Sable AI Ltd cannot audit, inspect, or modify Anthropic's model weights or training data. Prompt engineering is the sole mechanism for bias control and output structuring at Sable AI Ltd's disposal.
No model training on candidate data Anthropic processes candidate data in-context for API responses only. Candidate data is not used to train or fine-tune any model [ASSUMPTION: A-005]. A Data Processing Agreement with Anthropic is assumed to be in place [ASSUMPTION: A-005].
International transfer Anthropic is a US-incorporated company. API calls may traverse to US infrastructure. The applicable UK international transfer mechanism has not been confirmed [ASSUMPTION: A-027].

3. Technical Stack

Layer Component Detail
AI core Anthropic Claude API Model: claude-sonnet-4-6. System prompt + structured user message per screening request. No fine-tuning. [ASSUMPTION: A-002]
Backend Python / FastAPI API server handling document parsing, Anthropic API orchestration, output storage, and recruiter-facing API endpoints [ASSUMPTION: A-030]
Document parsing pdfplumber (PDF) + python-docx (DOCX) Extracts plain text from candidate CV files server-side before sending to Claude. Raw CV files are not forwarded to Anthropic — only extracted text is sent. [ASSUMPTION: A-030]
Frontend React (TypeScript) Recruiter web UI: upload job descriptions and CVs; view ranked shortlist; read AI summaries; log review decisions; view raw AI output JSON [ASSUMPTION: A-030]
Database PostgreSQL on AWS RDS Stores job postings, candidate records, AI output JSON, recruiter decisions, and audit logs. All data in eu-west-2. [ASSUMPTION: A-004] [ASSUMPTION: A-030]
File storage AWS S3 Stores raw CV documents (PDF/DOCX). Encrypted at rest (AES-256). Access restricted to backend service role. [ASSUMPTION: A-004] [ASSUMPTION: A-030]
Hosting AWS eu-west-2 throughout All compute, storage, and database resources in UK region [ASSUMPTION: A-004]
API gateway AWS API Gateway + WAF Controls external access to backend; rate limiting; request logging [ASSUMPTION: A-030]

4. End-to-End Screening Workflow

The following describes Scout's processing flow for a single screening run. Each step is numbered for reference against the data flow map in Section 6.

Step Actor Action Data involved
1 Recruiter Creates a new role in Scout UI. Enters or pastes job description (role title, required skills, experience level, responsibilities, nice-to-haves). Job description text
2 Recruiter Uploads one or more CV files (PDF or DOCX). Batch uploads supported. Raw CV files (PDF/DOCX)
3 Scout backend Extracts plain text from each CV using pdfplumber / python-docx. Extraction performed server-side. Raw files stored in S3; extracted text held transiently in memory. Extracted CV text (including PII)
4 Scout backend Constructs an Anthropic API call for each CV: System message — Scout's operating constraints and output schema. User message — job description text + extracted CV text. System prompt; extracted CV text; job description text → all sent to Anthropic
5 Anthropic Claude API Processes the input and returns a structured JSON response per the schema in Section 5. AI output JSON returned to Scout backend
6 Scout backend Stores JSON output in PostgreSQL against the candidate record, with timestamp, model version, and an input hash (for audit). Extracted CV text is not stored in raw form post-call. AI output JSON; audit metadata
7 Scout UI Displays ranked shortlist to recruiter. For each candidate: match score, matched/unmet criteria, flags for human review, one-paragraph summary. Full raw AI output JSON is also accessible. AI output JSON; job data
8 Recruiter Reviews each candidate. Selects to progress, reject, or hold. Logs decision in UI. No candidate is contacted before this step is completed. [ASSUMPTION: A-007] Recruiter decision; assessor notes
9 Scout backend Logs all actions (upload, API call, recruiter decision) to audit log with timestamp and user ID. Audit log entries

5. AI Output Schema

Each Anthropic API call returns a structured JSON object. The schema below is the canonical output format. The system prompt instructs Claude to produce only this format; non-conforming outputs are rejected by the backend.

{
  "candidate_id": "[internal UUID]",
  "job_id": "[internal UUID]",
  "model_version": "claude-sonnet-4-6",
  "processed_at": "[ISO 8601 timestamp]",
  "match_score": 74,
  "score_confidence": "high",
  "matched_criteria": [
    "5+ years Python experience",
    "Experience with REST API design",
    "Prior work in regulated industry"
  ],
  "unmet_criteria": [
    "No evidence of team leadership experience"
  ],
  "flags_for_human_review": [
    "CV contains a gap in employment history 2021–2022 — recruiter to explore at interview"
  ],
  "summary": "Strong technical match for the role. Python and API experience well-evidenced. No leadership experience visible in CV — this may warrant a conversation. Recommend progressing to phone screen.",
  "assessor_notes": null
}

Field notes:

Field Compliance relevance
match_score Numerical output of AI assessment — subject to human review before any candidate action. Not used as a sole decision mechanism.
score_confidence Low-confidence outputs trigger additional human review flags.
flags_for_human_review Surfaced explicitly to recruiter for all aspects Claude cannot confidently assess. Supports Art. 22C right to human review of AI-assisted assessments.
assessor_notes Populated by recruiter post-review. Creates an auditable record of the human decision separate from the AI output.
model_version Records which Claude model version produced this output. Important for reproducibility and audit.

6. Data Flows — What Reaches Each System

The table below maps each data element to the systems that process or store it. This is the specific Scout realisation of the flow described in L1-2.3-Data-Flow-Map-v1.md.

Data element Scout backend (AWS) Anthropic Claude API AWS S3 AWS RDS PostgreSQL
Raw CV file (PDF/DOCX) ✓ received, parsed ✓ stored encrypted (AES-256)
Extracted CV text ✓ transiently (in-memory during processing) ✓ as part of user message ✗ not stored in raw form
Candidate PII (name, email, phone, address) ✓ extracted fields stored ✓ embedded in CV text — full PII reaches Anthropic ✓ extracted fields stored
Job description text ✓ as part of user message ✓ stored
System prompt ✓ as system field
AI output JSON ✓ returned ✓ stored
Recruiter decision + assessor notes ✓ stored
Audit log entries ✓ stored

Key compliance gap — data minimisation: Candidate PII (name, email, phone, address) is embedded in CV text and reaches Anthropic as part of every API call. There is currently no pre-processing step to strip PII before sending to Anthropic. This is a compliance gap against UK GDPR Art. 5(1)(c) (data minimisation principle) and is recorded in the action register at P2-Scout-UK-GDPR-Mapping-v1.md. [ASSUMPTION: A-030 — the absence of a PII-stripping step is assumed as the current implementation state]


7. Retention Schedule (Proposed — Not Legally Confirmed)

[ASSUMPTION: A-022] The following retention periods are proposed for Scout's data lifecycle. They have not been confirmed through a formal retention review or legal sign-off.

Data System Proposed retention period Deletion trigger Status
Raw CV files AWS S3 6 months after role filled or application closed Automated deletion job Not implemented
Extracted CV text Anthropic API (in-context only) Not retained post-call Anthropic processes in-context only [ASSUMPTION: A-005] Assumed — not verified
AI output JSON AWS RDS 6 months after role filled or application closed Automated deletion job Not implemented
Recruiter decisions + notes AWS RDS 6 months after role filled or application closed Automated deletion job Not implemented
Audit logs AWS RDS 2 years Regulatory retention for accountability Not implemented
Job description records AWS RDS Duration of customer contract + 1 year Contract management Not implemented

Action required: Retention periods must be legally confirmed, implemented as automated deletion processes, and communicated to data subjects via the candidate transparency notice (L4-5.2-Candidate-Transparency-Notice-v1.md).


8. System Prompt — Compliance-Relevant Constraints

The full Scout system prompt is a Sable AI Ltd trade secret. The constraints relevant to compliance are described below [ASSUMPTION: A-030].

Instructions given to Claude: - Score candidates against stated job requirements only - Produce output strictly in the JSON schema defined in Section 5 - Flag employment gaps, unexplained career changes, and any aspect of the CV it cannot confidently assess — all such items appear in flags_for_human_review - Recommend human review where score_confidence is "low"

Explicit prohibitions: - Do not consider, reference, or infer from any information that could be a proxy for a protected characteristic under the Equality Act 2010 (age, gender, nationality, religion, disability, pregnancy, race, sexual orientation, marriage/civil partnership) - Do not use candidate name, address, or personal contact details as scoring inputs - Do not produce output outside the defined JSON schema

Known limitations of prompt-level bias control:

Limitation Implication
Implicit associations in model training Claude cannot guarantee that implicit associations in its training data do not influence scoring, even when explicitly instructed not to use protected characteristics.
Proxy correlation The instruction not to use protected characteristics does not prevent their influence where they correlate with other CV features (e.g., institution names, career timelines, naming conventions, geographic indicators).
No demographic monitoring No automated demographic monitoring is in place to detect disparate impact in shortlisting outputs.
No independent audit Prompt-level bias controls have not been independently audited.

These limitations are addressed further in P2-Scout-Bias-Monitoring-Protocol-v1.md.


9. Adaptation Notes for Real Organisations

This worked example is built on assumed Sable AI Ltd data. Before using it as a real system profile, the following must be verified or replaced:

Item Action required
Anthropic model version Confirm current production model. Model versions change; profiles should be updated on model version changes.
Technical stack Confirm actual technology choices (Python/FastAPI, pdfplumber, PostgreSQL, React).
AWS region Confirm all resources are genuinely in eu-west-2 (or update if different).
Anthropic DPA status Confirm a Data Processing Agreement with Anthropic is executed and covers UK GDPR obligations. Confirm the international transfer mechanism.
Human review enforcement Confirm how the "no candidate contact before human review" obligation is technically enforced — not just a policy commitment.
Retention periods Confirm legally reviewed retention periods and implement automated deletion.
System prompt text Review full system prompt for compliance — the summary in Section 8 is not a substitute for a formal prompt review.

[LEGAL REVIEW REQUIRED] This profile should be reviewed by a qualified UK data protection lawyer before being used in an operational context.


This document is a worked example for educational purposes. It is not a legal compliance certification. All assumptions must be verified against real Sable AI Ltd data before operational use.