Somvilla Custom Software & Automation
Live Demo

Automated Document Processing

See what structured data extraction actually looks like in practice. Switch between document types to see how unstructured paperwork becomes clean, usable data — automatically.

Depending on your document type and consistency we use the right approach — Python scripts for predictable formats, AI assistance for variable layouts. You get reliable structured data either way.

Get a quote →

01 — Original Document

NPL
Norwood Parts Ltd
Unit 4, Boucher Road, Belfast BT12 6HR
Tel: 028 9042 1234
INVOICE
NPL-2026-08847

Bill To
Lagan Valley Industrial Supplies Ltd
12 Sprucefield Park
Lisburn BT27 5JH
Invoice Date: 14 March 2026
Due Date: 14 April 2026
PO Reference: LV-PO-2891
Description Qty Unit Total
M8 Hex Bolt S/Steel 316 500 £0.175 £87.50
Hydraulic Hose 1/2" BSP 12 £13.00 £156.00
Cable Gland M20 Nylon IP68 100 £0.432 £43.20
BSP Elbow Fitting 3/8" 50 £1.356 £67.80
Subtotal: £354.50
VAT (20%): £70.90
TOTAL: £425.40
Payment Details
BACS within 30 days
Sort: 12-34-56  |  Acc: 87654321
🇬🇧 DVLA — Great Britain Driving Licence
👤
1. Surname, 2. First names
MCALLISTER, James Patrick
3. Date of birth / Nationality
15.01.1978 / GBR
4a–4c. Validity / Issue
12.03.2021 15.01.2031
5. Licence number
MCALL781156JP9AB
8. Address
24 Malone Road, Belfast, BT9 5LT
Belfast City Council
Rates Notice
01 February 2026
Ref: BCC-2026-447821
Issued to:
James Patrick McAllister
24 Malone Road
Belfast BT9 5LT
Proof of address document dated within 3 months
DELIVERY NOTE
Norwood Parts Ltd
DN No: DN-2026-4421
Date: 17 March 2026
PO Ref: LV-PO-2891
Delivered to
Lagan Valley Industrial Supplies Ltd
Goods Inward — Bay 3
Sprucefield Industrial Estate
Lisburn BT27 5JH
Driver / Vehicle
Driver: T. Patterson
Vehicle: MXZ 4821
Time: 09:47
Item Ordered Rcvd Status
M8 Hex Bolt S/Steel 316 500 500 ✓ OK
Hydraulic Hose 1/2" BSP 12 12 ✓ OK
Cable Gland M20 Nylon IP68 100 100 ✓ OK
BSP Elbow Fitting 3/8" 50 45 ⚠ SHORT
Received by
D. McAuley
D. McAuley — 09:47
Norwood Parts Ltd · VAT 123 456 789

02 — Extracted Data

Extracted Fields ✓ 11 fields
Python / pdfplumber
Field Value Conf.
Description Qty Total
M8 Hex Bolt S/Steel 316 500 £87.50
Hydraulic Hose 1/2" BSP 12 £156.00
Cable Gland M20 Nylon IP68 100 £43.20
BSP Elbow Fitting 3/8" 50 £67.80

Hover a field to highlight it in the original document →

Extracted Fields ✓ 9 fields
OCR + regex
Field Value Conf.

AML Checks

✓ Photo ID provided
✓ ID not expired (valid until 2031)
✓ Proof of address within 3 months
✓ Name matches across documents
✓ Address consistent across documents

⚠ Manual review recommended

Postcode extracted with medium confidence — verify BT9 5LT

Extracted Fields ✓ 12 fields, 1 discrepancy flagged
Python / pdfplumber
Field Value Conf.

⚠ Short Delivery Flagged

Item BSP Elbow Fitting 3/8"
Ordered 50 units
Received 45 units
Shortfall 5 units (10%)
Action: Credit note / reorder required

03 — Where It Went

Accounts Payable Sheet

Google Sheets — auto-updated

Supplier
Inv No
Date
Amount
Status
Belfast Steel
BSS-0841
10/03
£892.00
Paid
NI Hydraulics
NIH-2198
12/03
£1,240.00
Due
Norwood Parts
NPL-08847
14/03
£425.40
NEW
Row added to sheet
Accounts team notified
Filed to /Invoices/2026/March/

Processed in 0.8 seconds

Case Management System

Client Onboarding — AML Record

Client James Patrick McAllister
Matter Residential Conveyancing
File Ref BEL-2026-CV-1847
AML Status ✓ VERIFIED
✓ Photo ID collected and verified
✓ Proof of address collected
✓ ID expiry checked (2031)
✓ Risk assessment: LOW RISK
□ Partner sign-off (pending)
Auto-filed to: /Clients/McAllister_JP/AML/2026/

Processed in 1.2 seconds

One field flagged for manual review — postcode confidence MEDIUM. Highlighted for fee earner to verify.

Stock Management System

Goods Received — Auto Update

✓ M8 Hex Bolt S/Steel 316
+500 units added to stock
✓ Hydraulic Hose 1/2" BSP
+12 units added to stock
✓ Cable Gland M20 Nylon IP68
+100 units added to stock
⚠ BSP Elbow Fitting 3/8"
+45 units added (5 short)
Credit note request raised: CN-REQ-0291
Purchasing team notified

Processed in 0.6 seconds

PO LV-PO-2891 marked: RECEIVED WITH DISCREPANCY

Manual vs Automated

Manual Process

Open document in email
Read each field manually
Type into spreadsheet
Check for errors yourself
File the document somewhere
~8 minutes per document

Automated

Script detects new document
Extracts all fields in < 1s
Writes directly to your system
Flags discrepancies automatically
Auto-files by type and date
~1 second per document

If you process 50 documents/week:

Manual: ~6.5 hours
Automated: ~1 minute

That's 6+ hours every week that your team could spend on work that actually requires their judgement.

The right tool for the job

Not every document processing problem needs AI. Here's how we approach the choice:

📄

Python Scripts

For consistent, predictable document formats from the same source. Fast, cheap, completely reliable. pdfplumber, camelot, regex.

Best for: Supplier invoices from the same supplier, structured exports
🔍

OCR + Pattern Matching

For scanned documents or PDFs without selectable text. Combines optical character recognition with pattern matching for known fields.

Best for: Scanned delivery notes, older document formats
🤖

AI Assistance

For genuinely variable documents where format changes between senders. Used selectively where it adds value over a script.

Best for: Mixed supplier formats, complex layouts, exception handling

We always use the simplest reliable approach first. AI is a tool, not a default.

How much time does your team spend processing documents manually?

Tell us what document type you receive, how many per week, and where the data needs to go. We'll tell you honestly whether it's automatable and what it costs.

From £2,200
Get a Quote → See the full service →