Parsing & Control-File Matching Lab
Paste or import any text/story/data record, choose delimiters and parsing dimensions, then optionally compare against a control/source document.
1. Input / Source Text
Import file (txt, html, docx, csv, etc.)
Load file → Text
Clear
Copy
Toggle HTML/Raw
Title / Main Focus
2. Delimiter & Dimensions
Primary delimiter for parsing
New line (\\n)
Blank line (paragraph)
| (pipe)
, (comma)
; (semicolon)
TAB
~ (tilde)
Custom (regex or string)
Select parsing dimensions (in selection order)
Date
Common Name (person)
Entity/Company Name
Event/Cause Name
$ / Money
Rank / Order
Assertion / Statement of Fact
Author
Recipient
Attachments
Human Generated
System Generated
Use Ctrl/Cmd-click to select up to 8; order of clicking becomes column order.
Lock Dimension Order
Stringency / Fuzzy Matching
Strict (score ≥ 0.80)
Right = exact-ish; left = fuzzier.
Run Parsing
Download Parsed CSV
3. Parsed Records
Quick tools on parsed table:
Copy table as TSV
Highlight money
Find
#
Raw Segment
No parsed data yet.
4. Control / Source Document
Import control/source file
Load file → Text
Clear
Copy
Toggle HTML/Raw
5. Matching Scenarios (Parsed ↔ Control)
Each scenario uses the same stringency slider and dimension order.
Scenario 1: Date + Name alignment
Scenario 2: Money + timeline / rank
Scenario 3: Assertion / fact overlap
Scenario 4: Author ↔ Recipient mapping
Scenario 5: Global fuzzy similarity
You can extend these scenarios later; they all output below.
6. Scenario Results
Copy results