Skip to main content

What does this node do?

Extracts specific HTML tags and their contents from HTML documents.

Configuration

html
string
required
HTML content.
tags
array
required
Tags to extract (e.g., ["h1", "h2", "a"]).
include_attributes
boolean
default:"true"
Include tag attributes in output.

Output

{
  "h1": ["Main Title"],
  "h2": ["Section 1", "Section 2"],
  "a": [
    {"text": "Link", "href": "https://..."}
  ]
}