Node Inputs
Required Fields
HTML Content:The HTML content to parse and extract tags from.
Example:
Optional Fields
Tag Names:Specific HTML tags to extract. If empty, all tags are extracted.
Example:
["h1", "h2", "h3", "a", "img"]
Extract Attributes:Whether to extract tag attributes.
Example:
true
Default:
true
Extract Text Content:Whether to extract text content within tags.
Example:
true
Default:
true
Remove Duplicates:Remove duplicate tag occurrences from results.
Example:
true
Default:
false
Node Output
Extracted Tags:A structured list of all extracted HTML tags with their attributes and content. Example Output:
Node Functionality
The Tag Extractor node:- Extracts HTML tags from HTML content.
- Captures tag attributes (class, id, href, src, etc.).
- Extracts text content within tags.
- Supports filtering by specific tag names.
- Provides tag occurrence statistics.
- Useful for HTML analysis, SEO audits, and content extraction.