Finn 💰 — A Personal Finance Assistant That Lives in WhatsApp

DEV Community

The problem Finn solves is a real one: most personal finance tools require you to change your habits (log into an app, categorize manually, remember later). Finn fits into a habit you already have — checking WhatsApp.

What it can do:

📝 Log expenses and income via text or voice note (Whisper transcription)
📄 Parse credit card PDF invoices — send the PDF, review the itemized list, confirm, saved
🏦 Parse bank account statement PDFs — transactions extracted via a saldo-diff algorithm that works around garbled PDF text
🔍 Query spending — by period, category, card, or cost center (cross-source: credit cards + bank account + manual entries combined)
✏️ Edit or delete any saved record by just describing it
🔒 Phone whitelist — only your number gets a response

Demo

All four scenarios below are real conversations with a running Finn instance.

🟢 Demo 1 — Log a Single Expense

The most common interaction: user mentions an amount and a description. Finn classifies the category and card automatically, shows a confirmation line, and only saves after approval.

You: 189 on lunch
Finn: 189ドル · Food · Mastercard · Me · 2026年04月22日 — confirm? ✅
You: yeah
Finn: ✅ Saved! Anything else?

If the description is ambiguous, Finn asks exactly one clarifying question:

You: spent 80 today
Finn: What was it for?
You: pharmacy
Finn: 80ドル · Pharmacy · Mastercard · Me · 2026年04月22日 — confirm? ✅
You: yes
Finn: ✅ Saved!

Voice notes work the same way — Finn echoes the transcription before acting:

You: 🎙️ [audio: "spent ninety at the grocery store today"]
Finn: 🎙️ _"spent ninety at the grocery store today"_
 90ドル · Groceries · Mastercard · Me · 2026年04月22日 — confirm? ✅
You: yes
Finn: ✅ Saved!

✏️ Demo 2 — Change Something

You can correct any saved record in plain language. Finn finds the most recent matching transaction, shows what will change, and waits for confirmation.

Change the amount:

You: the pharmacy one was actually 95, not 80
Finn: Update 80ドル → 95ドル · Pharmacy · Mastercard · Me · 2026年04月22日 — confirm? ✅
You: yes
Finn: ✅ Updated!

Change the category:

You: that lunch from yesterday, mark it as Entertainment
Finn: Update Food → Entertainment · 189ドル · Mastercard · Me · 2026年04月22日 — confirm? ✅
You: yes
Finn: ✅ Updated!

Delete a record:

You: delete the 90ドル grocery entry
Finn: Delete 90ドル · Groceries · Mastercard · Me · 2026年04月22日 — are you sure? 🗑️
You: yes
Finn: ✅ Deleted.

📄 Demo 3 — Import a Credit Card PDF Invoice

You forward the invoice PDF to the WhatsApp chat. Finn parses it, shows you every line, resolves unknown categories, and bulk-saves on one "yes".

You: [sends Mastercard invoice PDF]
Finn: 📄 Mastercard · Due 2026年04月10日 · 2,340ドル.00 · 18 items (16 new · 2 skipped)
 📊 Invoice Summary
 Purchases: 2,680ドル.00 · Financed balance (credit): -520ドル.00 · Interest: +180ドル.00 · Total due: 2,340ドル.00
 1. ✅ UBER*TRIP
 Mar 12 · 18ドル.50 · Transport · Me
 2. ✅ NETFLIX.COM
 Mar 13 · 15ドル.99 · Entertainment · Me
 3. ✅ AMAZON PRIME
 Mar 14 · 42ドル.30 · Shopping · Me (intl · USD 7.99)
 4. ❓ SQ*DOWNTOWN MKT
 Mar 15 · 34ドル.00 · ❓ · Me
 5. ✅ SHELL GAS STATION
 Mar 17 · 95ドル.00 · Transport · Me
 6. ✅ DELTA AIRLINES 01/03
 Mar 19 · 320ドル.00 · Travel · Me
 ...
 ❓ What category for: #4 SQ*DOWNTOWN MKT (34ドル.00)?
You: Groceries
Finn: Got it. Save all 16 items? (2 duplicates will be skipped)
You: yes
Finn: ✅ Saved 16 transactions · Total 2,340ドル.00
 Transport: 210ドル.50 · Shopping: 380ドル.00 · Food: 195ドル.00 · Entertainment: 95ドル.99 · Travel: 320ドル.00 · Other: 1,138ドル.51

The "Invoice Summary" line reconciles the invoice math: Purchases + Interest + Financed balance = Total due. Finn stores the interest and financed balance as signed rows so SELECT SUM(amount) always equals the invoice total.

For image-based (scanned) PDFs, Finn automatically falls back to GPT-4.1 Vision OCR — same output, no extra steps for the user.

🔍 Demo 4 — Query Spending

Finn always queries the database — it never answers from memory or inference.

Total this month:

You: how much did I spend this month?
Finn: 📊 All spending — last 60 days
 Total: 6,234ドル.18
 Shopping: 1,402ドル.10
 Food: 534ドル.90
 Transport: 487ドル.20
 Entertainment: 211ドル.70
 Travel: 420ドル.00
 Other: 3,178ドル.28

By category:

You: breakdown of Transport
Finn: 🚗 Transport — last 60 days
 487ドル.20 across 8 transactions
 Mar 15 · Uber 34ドル.90 (Mastercard)
 Mar 20 · Posto Ipiranga 180ドル.00 (Mastercard)
 Apr 01 · Uber 28ドル.50 (Mastercard)
 Apr 05 · Shell Gas 155ドル.00 (Nu)
 Apr 10 · Parking 22ドル.00 (Bradesco)
 ...

Cross-source (credit card + bank account):

You: did my card payment hit the bank account?
Finn: Yes — Mar 31 bank debit of 4,782ドル.24 matches your Mastercard invoice exactly. ✅

A specific period:

You: how much did I spend on Food in March?
Finn: 🍽️ Food — March 2026
 312ドル.40 across 9 transactions
 Highest: 89ドル.90 at a restaurant on Mar 22
 Daily average: 10ドル.08

How I Used OpenClaw

The entire agent is an OpenClaw plugin registered as a before_dispatch hook. Every WhatsApp message — text, voice, or PDF — passes through Finn before OpenClaw does anything else.

WhatsApp → OpenClaw gateway → before_dispatch hook → Finn plugin → OpenAI gpt-4.1 → Supabase

Plugin Registration

// openclaw.plugin.json
{
 "name": "finance-agent",
 "version": "1.0.0",
 "hooks": ["before_dispatch"]
}

// plugin/src/index.ts
api.on('before_dispatch', async (event, ctx) => {
 const phone = ctx.senderId ?? event.senderId
 // Phone whitelist — only the owner gets responses
 const allowedPhones = process.env.ALLOWED_PHONES?.split(',').map(p => p.trim()) ?? []
 if (allowedPhones.length > 0 && !allowedPhones.includes(phone)) {
 return { handled: true, text: '' } // silent ignore for unknown numbers
 }
 const result = await runAgent({ phone, message, mediaType })
 return { handled: true, text: result.reply }
})

The Agent Loop

The core is a tool-use loop over gpt-4.1 with six tools and a maximum of 5 iterations:

Tool	Purpose
`save_transaction`	Persist a confirmed expense or income entry
`query_spending`	Query totals, breakdowns, history from Supabase
`save_bulk_transactions`	Bulk-save confirmed invoice items from a PDF
`save_bank_statement`	Bulk-save confirmed bank statement rows
`update_transaction`	Edit a saved record after confirmation
`delete_transaction`	Delete a record after confirmation

The system prompt enforces a strict state machine: the LLM never calls save_transaction without an explicit user confirmation. The confirmation always uses a canonical format:

45ドル · Food · Mastercard · Me · 2026年04月22日 — confirm? ✅

Preventing LLM Hallucination on Queries

Even with a well-crafted prompt saying "always call query_spending", the model would sometimes answer "you spent $X on Transport" by inferring from a recently-parsed PDF in context — instead of querying the database. The fix: force tool_choice on the first iteration for any spending question:

const SPENDING_Q_RE = /how much|breakdown|what did I spend/i
let toolChoice = SPENDING_Q_RE.test(userText)
 ? { type: 'function', function: { name: 'query_spending' } }
 : 'auto'

PDF Invoice Pipeline

When a PDF arrives, the plugin routes it before the LLM ever sees it:

if (pdfText.trim().length < 100) {
 // Scanned/image-based PDF → GPT-4.1 Vision OCR
 invoice = await parseInvoiceOcr(pdfToImages(pdfBuffer))
} else if (/Extrato de:.*Agência/i.test(pdfText)) {
 // Bank statement
 stmt = parseStatementBradesco(pdfText)
} else {
 // Text-based credit card invoice
 invoice = parseInvoice(pdfText)
}

What I Learned

1. PDF parsing is harder than it looks. The text extraction from pdf-parse is reliable for prose but unreliable for table columns — numbers get concatenated with adjacent reference codes. The saldo-diff approach was a counterintuitive fix: instead of parsing the value I want, compute it from context.

2. "This month" is not a calendar concept for credit cards. A purchase on March 8 appears on an April invoice — so a filter of date >= April 1 would miss it. Finn uses a 60-day rolling window for "this month" queries to cover the billing cycle lag.

3. Forcing tool_choice prevents silent hallucination. The model reliably answers from database queries when forced, and sometimes "just knows" from context when not forced. Both answers look correct — the second one just isn't queryable later.

4. WhatsApp as an interface has a real adoption advantage. The friction of opening a dedicated finance app is the #1 reason people stop using them. A chat interface that's already open all day has zero switching cost.

5. Security in layers. OpenClaw's allowFrom whitelist blocks at the gateway level. ALLOWED_PHONES adds an application-level check. Supabase rows are scoped by phone with RLS. Each layer is independent — if one fails, the others still hold.

Tech Stack

Layer	Technology
Channel	WhatsApp via OpenClaw
Runtime	TypeScript, Node.js 20
Framework	OpenClaw (`before_dispatch` hook)
LLM	OpenAI gpt-4.1 (tool-use loop)
Database	Supabase (PostgreSQL + Row Level Security)
PDF parsing	`pdf-parse` + custom text parsers
Vision OCR	GPT-4.1 Vision (scanned PDFs)
Audio	OpenAI Whisper
Deployment	pm2 on a VPS

GitHub: github.com/vicente-r-junior/finn

Top comments (1)

emeersin profile image

Emerson Rodrigues

I am a Solutions Architect with over 13 years of experience in the technology field, specializing in ensuring that digital products are robust, scalable, and high-performing.

Location

São Paulo, Brasil
Work

Solutions Architect
Joined

Mar 29, 2026

• Apr 25

Niiice documentation, I'll fork it on git to test it! 👏
I was looking at the repo and saw the "migrations" careful when exposing them, if I know their columns I know where to carry out a possible attack.