Web Scraper

Node Inputs

URL:
The web address to scrape.
Example: "https://www.example.com"

Default Content Type:
Specifies the type of content to extract based on predefined templates. Options:

XPath 1, XPath 2, XPath 3:
Custom XPath expressions for targeted data extraction.
Example: "//div[@class='content']"

Output:
Extracted content or HTML code based on the URL and settings provided.
Example Output:

<div class="content">This is the extracted content.</div>

The Web Scraper node:

Connects to the provided URL and retrieves HTML content.
Supports structured extraction using predefined templates like Article or Product.
Allows customization using up to three custom XPath expressions for precise data targeting.
Outputs raw HTML or parsed content for further analysis or integration.

On this page