Evidence · first-party tested · observation #53

Grouped column headers in scanned tables lose relationships during extraction; column semantics drift.

✗ Failed🧾 artifact-verifiedTested Jun 12, 2026LlamaParse
Input — the actual file we sent
Scanned multi-column research paper
An image-only scanned research paper created from an OCR-bearing original by converting pages to images and rebuilding the PDF without searchable text. It contains text, multi-column layouts, figures, charts, tables, captions, and references, and is designed to stress OCR and reconstruction quality.
Output — unretouched
Extracted treatment table with diameter measurements.
Extracted treatment table with diameter measurements.
Output — unretouched
Scanned bar chart of tree mortality by year and treatment.
Scanned bar chart of tree mortality by year and treatment.
The observation

Grouped column headers in scanned tables lose relationships during extraction; column semantics drift.

Criterion: nested header handling · Scenario: Scanned multi-column research paper (group: scanned-research-paper)
Query this
get_evidence({
  tool: "llamaparse",
  scenario: "scanned-research-paper"
})
MCP · admin.futuresmart.ai/api/mcp
Free with attribution.
Real outputs, no retouching · every cell queryable via API & MCP · aidemos.com