Actions14
- Marketplace Dataset Actions
- Web Scraper Actions
- Web Unlocker Actions
Overview
The node interacts with the Bright Data platform to perform web scraping tasks and manage datasets. Specifically, the "Monitor Progress Snapshot" operation under the "Web Scraper" resource allows users to track the progress of a previously initiated snapshot. This is useful for monitoring long-running scraping jobs to check their current status, completion percentage, or any issues encountered during execution.
Practical examples include:
- Monitoring a snapshot that scrapes product prices from an e-commerce site to know when data collection is complete.
- Tracking the progress of a snapshot gathering news articles to trigger downstream processing once finished.
Properties
| Name | Meaning |
|---|---|
| Snapshot ID | The unique identifier of the snapshot whose progress you want to monitor. This is required. |
Output
The node outputs JSON data representing the current state of the specified snapshot. This typically includes fields such as progress percentage, status (e.g., running, completed, failed), timestamps, and possibly error messages if the snapshot encountered issues.
If the snapshot produces binary data (e.g., downloaded files), this node does not directly output it but rather provides metadata about the snapshot's progress.
Dependencies
- Requires an API key credential for authenticating with the Bright Data platform.
- The node makes HTTP requests to the Bright Data API endpoint at
https://api.brightdata.com. - Proper configuration of the API key credential in n8n is necessary for successful operation.
Troubleshooting
Common Issues:
- Invalid or missing Snapshot ID: Ensure the Snapshot ID provided exists and is correctly typed.
- Authentication errors: Verify that the API key credential is valid and has sufficient permissions.
- Network or API downtime: Check connectivity and Bright Data service status.
Error Messages:
- Unauthorized or 401 errors indicate invalid credentials; reconfigure the API key.
- 404 errors suggest the Snapshot ID does not exist or is inaccessible.
- Timeout or network errors may require retrying or checking internet connection.