You have 27 PDF invoices sitting in a folder. Your accounting system — QuickBooks, Xero, FreshBooks, or just the master Excel sheet your accountant asked for — wants the data as rows: vendor, date, items, totals, tax. Nothing complicated, but nothing it can read directly from a PDF either.
So what are your actual options?
There are five of them, realistically. I've used all five at different points — some still occasionally, some I gave up on for reasons I'll get to. Below, each one with a time estimate for that 27-invoice batch, an honest view of where it breaks down, and what it actually costs. No affiliate spin. Some of these methods will recommend tools I don't sell, because they're the right answer for certain batch sizes.
If you want the short version: under 3 invoices, type manually. 3-10, copy-paste with cleanup. 10+ and recurring, invest in a dedicated extractor. Read on for the reasoning.
First: check whether your PDFs are text or image
Before you pick a method, do this one-second test: open any of the invoices and try to select the total amount with your cursor. If you can select it (it highlights cleanly), the PDF is text-based and almost any method will work. If you can't select it — if the whole page acts like a picture — it's a scanned image PDF and you need real OCR, not just text extraction.
Most digital invoices (the ones vendors generate from billing software and email to you) are text-based. Most scanned physical invoices (customer handed you a paper receipt, you scanned it) are image PDFs. Mixing both is normal; just know which is which before you pick your method.
Method 1 · Type them in manually
Cost: Free (your time)
Privacy: Perfect
Best for: Under 3 invoices a month
Open the PDF. Read a value. Type it into Excel. Read the next value. Type that. Repeat.
I know this sounds stupid to suggest, but hear me out: if you process three invoices a month, this is the right answer. The setup overhead of any tool wipes out the savings. And every tool has some mis-extraction rate; manual typing has zero (as long as you type carefully). For very low volume, manual still wins on accuracy-per-minute.
Where it breaks down: anywhere above about five invoices at once. Attention dies, typos creep in, and you start mis-keying totals in ways you won't catch until reconciliation complains two weeks later. Also: for scanned image PDFs with tiny text, your eyes are doing the OCR and it hurts.
Method 2 · Copy-paste into Excel with cleanup
Cost: Free
Privacy: Perfect (nothing leaves your machine)
Best for: 3-10 invoices a month on text-based PDFs
Open the PDF. Select the invoice content. Copy. Paste into Excel. Realise it pasted as one mangled blob that loses the table structure. Spend 5 minutes splitting columns and cleaning whitespace. Repeat for the next invoice.
This works fine for PDFs where the invoice layout is simple — one table, clear columns, no merged cells. For anything more complex (multi-page invoices, pricing in one section and items in another, tax summaries at the bottom), the paste gets so mangled that cleanup takes longer than manual typing.
Two tricks that help: paste as plain text (Ctrl+Shift+V in most systems) to strip PDF formatting, and learn Excel's Data → Text to Columns feature for splitting mashed-together rows on a delimiter.
Where it breaks down: doesn't work at all on scanned image PDFs (nothing to copy). Line items in multi-invoice batches get confused with each other. No auto-categorisation — you're still typing vendor names and mapping to chart-of-accounts codes manually.
Method 3 · Adobe Acrobat Pro tabular export
Cost: $20-23/month subscription (or $240+/year)
Privacy: Mixed — local processing but Adobe's cloud features may sync unless disabled
Best for: People who already have Acrobat Pro for other reasons
Adobe Acrobat Pro has two features that help here. The first is the "Export To Spreadsheet" option under Tools → Export PDF, which does a reasonable job on text-based PDFs with clear tables. The second is Acrobat's built-in OCR (Tools → Recognize Text → In This File), which converts scanned image PDFs into text-selectable ones — after which you can use Method 2.
Quality-wise, Acrobat's table detection is decent but not great. It handles clean invoices from systems like Xero or FreshBooks well. On weirder layouts — invoices with a logo header spanning three rows, or tax summaries nested inside line items — it gets confused and produces output that needs 2-5 minutes of cleanup per file.
Where it breaks down: the subscription cost is the big one. If you don't already need Acrobat for other reasons (contract review, form filling, etc.), paying $240/year just for invoice extraction makes no sense — the dedicated tools are cheaper and better at this specific task. Also: Acrobat's cloud-sync defaults mean your invoices can end up on Adobe's servers if you haven't explicitly turned off Adobe Document Cloud. Worth checking.
Method 4 · Free online OCR services
Cost: Free (usually limited to 5-20 files per day before paywall kicks in)
Privacy: Poor — your invoices are uploaded to their servers
Best for: One-off personal invoices that contain nothing sensitive
Sites like Smallpdf, ILovePDF, PDF24, OnlineOCR, and half a dozen others offer free PDF-to-Excel conversion in a browser. Upload a PDF, download the extracted Excel. Pretty good for basic cases.
I'm going to be blunt about this one: don't use free online OCR services for client invoices, confidential business documents, or anything with personal data on it. These tools are free because you're uploading your documents to their servers. Most of the terms of service give them some kind of license to store, process, or analyse what you upload — sometimes for "service improvement," sometimes vaguer than that. For your personal restaurant receipts on a side project, fine. For your client's accounts-payable run, not fine.
Quality varies widely. I tested Smallpdf and ILovePDF on a batch of 20 invoices last year. Smallpdf got about 70% of the totals right; ILovePDF got about 60%. That's worse than typing manually, and you had to verify each one anyway. Plus every file got uploaded somewhere I couldn't fully audit.
Where it breaks down: privacy, accuracy, daily quotas, and paywall-for-anything-serious.
Method 5 · Dedicated invoice OCR that runs locally
Cost: $0-39 one-time (for the right tool) or $30-50/month (for subscription tools)
Privacy: Depends on tool — browser-based local is perfect; cloud subscription is their servers
Best for: Ongoing extraction work, 10+ invoices per month
This is where dedicated tools earn their keep. The category splits two ways:
Subscription cloud tools (you probably know the names — I covered them in last week's post). Upload invoice, tool extracts, syncs to QuickBooks or Xero, done. Excellent integration, ~$30-50/month per user, your data lives on their cloud. Best choice if automatic sync is worth the subscription to you.
Browser-based local extraction — this is what I built JuSenseSheet to do. Drop PDFs into a webpage, extraction happens entirely in your browser using PDF.js and a custom layout parser, extracted data goes into IndexedDB on your device (AES-256 encrypted), exports as Excel or CSV. $39 one-time, or free for up to 50 invoices a month.
Honest caveat about JuSenseSheet: it does not automatically sync to QuickBooks or Xero. You export a CSV and import it through those systems' standard import tools. For a 27-invoice batch, that adds about 40 seconds. For a daily workflow moving thousands of transactions, that friction is a dealbreaker and you want a subscription tool. Know which you are.
Where the local-tool category breaks down: it doesn't, really, for the niche it serves. The real limitation is scope — these tools don't do approval workflows, bank feed reconciliation, or team collaboration. If you need those, this category isn't for you regardless of which specific tool.
The 30-second claim, explained
The title of this post says "30 seconds." That's specifically the time it takes, using a dedicated tool like JuSenseSheet, to go from "drop 5 PDFs onto the page" to "Excel file downloaded with extracted data." Here's the actual breakdown I timed on my own machine:
- 0-2 seconds: Drop PDFs, extraction kicks off
- 2-20 seconds: Parsing (depends on PDF complexity — scanned PDFs take longer than text-based)
- 20-25 seconds: Review the extracted values in the spreadsheet view, fix any obvious mistakes inline
- 25-30 seconds: Click Export, file downloads
For the first few batches you'll spend longer reviewing because you don't yet trust the extraction. After a week, you'll scan the results in about 10 seconds and move on. That's where the 30-second figure comes from.
A decision matrix, for the impatient
| Your situation | Best method | Why |
|---|---|---|
| Under 3 invoices/month | Type manually | Tool overhead doesn't pay off |
| 3-10/month, text PDFs | Copy-paste + Excel | Free and fine for simple layouts |
| 3-10/month, scanned PDFs | Acrobat Pro OCR + copy-paste | Need OCR, don't need dedicated tool |
| 10+ invoices/month, solo | Local browser OCR (JuSenseSheet) | Scales without the monthly fee |
| 50+/month, team, QBO/Xero workflow | Dext or Hubdoc subscription | Integration pays for itself |
| AP-heavy with approvals | Bill.com | You need the whole workflow |
| Personal side-project receipts | Free online OCR | Privacy risk is acceptable, cost is zero |
One thing almost everyone gets wrong
Whatever method you pick, always check the totals. Every OCR tool, even the expensive ones, occasionally misreads a "1" as a "7" or drops a decimal place. If your extracted total doesn't match what's printed on the invoice, the line items behind it are probably wrong too. The 10 seconds you spend verifying each invoice's grand total is the cheapest insurance you'll ever buy.
The single most common bookkeeping error I see is trusting OCR output because it looks structured. Structured output that's wrong is worse than unstructured output that you'd have verified anyway. Treat every extracted invoice as "draft data until verified," not "data."
If you want to try the one-time option
JuSenseSheet is free for up to 50 invoices a month with no account, no card, and no time limit. If you're in the 10-to-50 invoices/month band where dedicated tools help but monthly subscriptions don't make sense, that's who I built it for. If you process more than 50 a month, the Pro tier is $39 one-time for unlimited use across all your devices.
And if none of these methods fit — if your workflow has some twist I haven't covered here — email me. I keep a running list of cases where the standard answers don't work, and occasionally it turns into a product feature.