HTML to Text Converter (Tag Removal & Line Formatting)

Extract readable body text from HTML, format paragraphs/lists, and remove script/style blocks automatically.

Choose entity decoding and handling for links, images, and tables, then copy or download the result in one flow.

Your input is processed locally in this browser and is never sent to a server.

Remove HTML tags and extract only text

Input and output are shown side-by-side, so you can tune line-break rules and options with minimal trial and error.

What this tool does

  • Remove HTML tags
  • Format line breaks for paragraphs and lists
  • Always exclude script/style and optionally decode entities

How to use

  1. Paste HTML (or load a file).
  2. Adjust formatting options if needed.
  3. Convert, then copy the result or save as .txt.

Sample

Standard rule

Input (HTML)

<article><h2>Notice</h2><p>Welcome<br>to Finite Field</p><ul><li>Intro</li><li>Usage</li></ul></article>

Output (Text)

Notice

Welcome
to Finite Field

・Intro
・Usage

Minimal rule

Input (HTML)

<div><p>A</p><p>B</p></div>

Output (Text)

A
B

Detailed rule

Input (HTML)

<ol><li>Define requirements</li><li>Implement</li></ol>

Output (Text)

1. Define requirements
2. Implement

Formatting options

  • Switch line-break rule: Standard / Minimal / Detailed.
  • Choose whether to decode HTML entities.
  • Tune readability with space collapsing and max consecutive line breaks.
  • Control handling of comments, links, images, and tables.

Note: <script> and <style> are always excluded.

Conversion rules

Elements converted to line breaks (Standard)

  • <br>
  • <p>, headings (h1-h6), <li>
  • <div>, <section>, <article>, <header>, <footer>, <blockquote>

Always excluded elements

  • <script>...</script>
  • <style>...</style>
  • You can also exclude <noscript> when needed.

List formatting examples

  • <ul><li>Item</li></ul> -> ・Item
  • <ol><li>Item</li></ol> -> ・Item (Standard/Minimal)
  • <ol><li>Item</li></ol> -> 1. Item (Detailed)

FAQ

Line breaks look odd

This tool converts tags like <p>/<br>/<li> into line breaks. Adjust Line Break Rule and Max Consecutive Line Breaks to get closer to your target format.

Are script/style blocks kept?

No. <script> and <style> are always removed.

What about entities like &amp; ?

You can choose decode on/off. Turn decoding on when you want &amp; to become &.

How can I reduce extra line breaks and spaces?

Use Minimal rule, keep Collapse Spaces on, and set Max Consecutive Line Breaks to 1 or 2.

Can I keep URLs with link text?

Yes. Set Link Handling to Text + URL.

Related tools

Notes

  • The tool parses the input HTML source; it does not execute scripts or reproduce full browser rendering output.
  • Very large inputs may take longer to convert.
  • Input content is not stored in URL query parameters.