Actions80
- Add Attachment To PDF
- Add Barcode To PDF
- Add Form Fields To PDF
- Add HTML Header Footer
- Add Image Stamp To PDF
- Add Image Watermark To Image
- Add Margin To PDF
- Add Page Number To PDF
- Add Text Stamp To PDF
- Add Text Watermark To Image
- AI-Invoice Parser
- AI-Process Contract
- AI-Process HealthCard
- Classify Document
- Compress Image
- Compress PDF
- Convert HTML To PDF
- Convert Image Format
- Convert JSON To Excel
- Convert Markdown To PDF
- Convert PDF To Editable PDF Using OCR
- Convert PDF To Excel
- Convert PDF To PowerPoint
- Convert PDF To Word
- Convert To PDF
- Convert URL to PDF
- Convert VISIO
- Convert Word to PDF Form
- Create Images From PDF
- Create PDF/A
- Create Swiss QR Bill
- Crop Image
- Delete Blank Pages From PDF
- Delete Unwanted Pages From PDF
- Split PDF By Barcode
- Disable Tracking Changes In Word
- Enable Tracking Changes In Word
- Extract Attachment From PDF
- Extract Form Data From PDF
- Extract Pages From PDF
- Extract Resources
- Extract Table From PDF
- Extract Text By Expression
- Extract Text From Word
- Fill PDF Form
- Find And Replace Text
- Flip Image
- Flatten PDF
- Generate Barcode
- Generate Document Single
- Generate Documents Multiple
- Get Document From Pdf4me
- Get Image Metadata
- Get PDF Metadata
- Split PDF By Swiss QR
- Get Tracking Changes In Word
- Image Extract Text
- Linearize PDF
- Merge Multiple PDFs
- Overlay PDFs
- Parse Document
- Protect PDF
- Read Barcode From Image
- Read Barcode From PDF
- Read SwissQR Code
- Remove EXIF Tags From Image
- Repair PDF Document
- Replace Text With Image
- Replace Text With Image In Word
- Resize Image
- Rotate Document
- Rotate Image
- Rotate Image By EXIF Data
- Rotate PDF Page
- Sign PDF
- Split PDF By Text
- Split PDF Regular
- Unlock PDF
- Update Hyperlinks Annotation
- Upload File To PDF4me
Overview
The node extracts form data from a PDF document. It is useful when you need to programmatically retrieve filled-in form fields from PDFs, such as surveys, applications, contracts, or any interactive PDF forms. This can automate data collection workflows by converting PDF form inputs into structured JSON data for further processing or integration.
Practical examples:
- Extracting user-submitted data from PDF forms uploaded via a web portal.
- Processing scanned or digitally filled contract forms to capture key information.
- Automating extraction of survey responses collected in PDF format.
Properties
| Name | Meaning |
|---|---|
| Input Data Type | Choose how to provide the PDF file: • Binary Data (from previous node) • Base64 String (direct base64 encoded content) • URL (link to the PDF file) |
| Input Binary Field | Name of the binary property containing the PDF file (usually "data") — used only if Input Data Type is Binary Data |
| Base64 PDF Content | Base64 encoded string of the PDF document content — used only if Input Data Type is Base64 |
| PDF URL | URL pointing to the PDF file to extract form data from — used only if Input Data Type is URL |
| Document Name | Name assigned to the document during processing (default "document.pdf") |
| Advanced Options | Custom profiles in JSON format to adjust extra API call properties or specific options for the extraction process. For example, setting output data format or other API-specific parameters. See https://dev.pdf4me.com/apiv2/documentation/ |
Output
The node outputs JSON data representing the extracted form fields and their values from the PDF. The structure typically includes field names and corresponding user-entered or default values, enabling easy access and manipulation in subsequent workflow steps.
If the PDF contains binary data (e.g., attachments), those are not the focus here; the main output is structured form data in JSON format.
Dependencies
- Requires an external PDF processing API service capable of extracting form data from PDFs.
- Needs proper API authentication configured in n8n (such as an API key credential).
- Internet access may be required if providing PDF via URL.
Troubleshooting
Common issues:
- Incorrect input data type or missing binary property name can cause failures.
- Invalid or inaccessible PDF URL will result in errors fetching the document.
- Malformed base64 content will prevent successful decoding and extraction.
- API authentication errors if credentials are missing or invalid.
Error messages and resolutions:
- "Failed to fetch PDF from URL": Check URL correctness and network accessibility.
- "Invalid base64 content": Verify that the base64 string is complete and properly encoded.
- "Missing binary data": Ensure the binary property name matches the actual input binary field.
- API authentication errors: Confirm API key or token is correctly set up in n8n credentials.
Links and References
- PDF4me API Documentation
- General info on PDF form data extraction: https://en.wikipedia.org/wiki/PDF_form