Extract Text from PDFs
Pull raw text from any PDF URL so you can search, quote, or process its contents programmatically.
Retrieve metadata from any PDF URL including page count, author, creation date, and file size.
ToolPDFBefore processing a PDF — extracting its text, merging it, or sending it to a client — you often need basic facts: how many pages is it, when was it created, who authored it, and is it a scanned image or contains real text? Downloading the file just to check these details is wasteful.
The get_info skill fetches metadata from any public PDF URL without downloading the full file. You get page count, file size, creation and modification dates, author, title, and producer information immediately. This lets you validate documents before processing them and surface basic attribution for cataloguing.
Document management teams validating submissions, developers building PDF processing pipelines, and researchers cataloguing document collections use this to triage files before committing to heavier processing.