Guide
How to Extract Table Data from Messy Copied Text
Copying a table from a PDF, email, or webpage often strips out the column structure, leaving you with a flat block of text. The Table Extractor tool helps recover the rows and columns so you can export the data as a usable CSV.
How table data gets mangled
Most PDF readers and web browsers copy table content as plain text. The rows are usually preserved as separate lines, but the column separators depend entirely on how the source document was formatted:
- PDFs often use multiple spaces to align columns visually
- Spreadsheets and CSV files typically use tabs or commas
- HTML tables may lose all structure and paste as a single block
- Pipe-delimited formats are common in database exports
Choosing the right delimiter
| Delimiter | Best for |
|---|---|
| Auto detect | Try this first — works in most cases |
| Tab | Spreadsheet copy-paste, TSV files |
| Multiple spaces | PDF column layouts, fixed-width reports |
| Comma | CSV files, some web table copies |
| Semicolon | European CSV exports, some database formats |
| Pipe | | Database exports, Markdown tables |
Step-by-step
1
Copy the table text
Select the table in your PDF, email, or document and copy it with Ctrl+C.
2
Paste into the Table Extractor
Open the Table Extractor and paste the text. Click "Load example" to see a sample first.
3
Try Auto detect
Click Extract table with Auto detect selected. Check the preview — if the columns look right, you're done.
4
Switch delimiter if needed
If the columns are wrong, try each delimiter option and re-run until the preview looks correct.
5
Export
Click Download CSV to save a .csv file, or Copy CSV to paste directly into a spreadsheet.
When it won't work well
- HTML tables that collapse into a single line when copied — the row structure is lost
- PDFs with merged cells — those cannot be recovered after copy-paste
- Tables where column widths vary wildly with the spaces delimiter
- Multi-level or nested headers — the first row is always treated as the header row
Related guides: Clean OCR text from a resume · Fix copy-paste formatting