Ai
Docling v2.97.0 Adds Email Parsing and HTML Backend Row-Section Support

Docling v2.97.0 Adds Email Parsing and HTML Backend Row-Section Support

Docling v2.97.0 Adds Email Parsing and HTML Backend Row-Section Support

Docling's latest update, v2.97.0, is out and it's a solid step forward for anyone dealing with document processing. The team focused on expanding input formats and polishing the HTML pipeline. Let's dive into what changed and why it matters.

What Changed

Email parsing is now part of the backends. That's a big deal for enterprise workflows where documents often arrive as email attachments. The feature landed in commit b741c4a, addressing a long-standing request. The HTML backend also got row-section support—meaning tables within HTML documents are now parsed more accurately. Finally, the CLI can fetch images from HTML pages, which means less manual pre-processing.

Why It Matters

Email parsing alone makes this release worthwhile. Plenty of industries rely on extracting data from email threads—think invoices, contracts, or support tickets. Row-section support? It's those annoying spanning headers that trip up parsers. Now Docling handles them. And the CLI image fetch? For developers automating data pipelines, that's one less scripting headache. These updates might not be flashy, but they hit real pain points.

There's a pattern here: Docling keeps chipping away at document complexity. Each version adds resilience to weird formats. That's the kind of progress that actually saves time in production.

Official Source: https://github.com/docling-project/docling/releases/tag/v2.97.0

Tags:

What's your reaction?

0
AWESOME!
AWESOME!
0
LOVED
LOVED
0
NICE
NICE
0
LOL
LOL
0
FUNNY
FUNNY
0
EW!
EW!
0
OMG!
OMG!
0
FAIL!
FAIL!