HTML to Text

Browsers Viewers HTML to Text Extras Wares & Sites

h2t151b

h2t151b.zip: lots of command-line options to fine tune the output (that can be reused in a batch file). Performed excellently on my test HTML page (Ars Technica).
Core links: to follow...




h2txt102

h2txt102.zip: failed. Displayed 100s of lines of internal html non-content from modern HTML page. Scrolled more than half-way through document in order to start seeing actual content. The content itself not extracted well: shortened sentences, line breaks - not very readible.
Core links: to follow...


htm2txt1

Included about 500 lines (80 characters/line) of non-content from modern HTML page. Does not process the content in a readable way: no linefeeds (ie, wall of text). Fail.
Core links: to follow...


htmlco20

htmlco20.zip: Fail. Same issues as above. Displays hundreds of lines of non-content HTML. Displays the actual content in a difficult to read manner.
Core links: to follow...


htmstrip

htms0208.zip: Fail. Very surprised by this. This had always been a great program for HTML conversion to text. It not only displayed the recurring issue with HTML non-content being shown (2021 Ars Technica Test Page) but it failed to produce any of the actual content itself.

It's still the most feature-rich converter for DOS with tons of INI options. I did not delve into the INI options and it's possible that some tweaks here might have produced better results.
Core links: to follow...


{-- 640kb ought to be enough for anyone --}

640kb.neocities.org | updated: 03 Nov 2025
All external links open in new tab