A serverless pipeline that automatically extracts structured data from PDF documents using Amazon S3, AWS Lambda, and Amazon Textract. Upload a PDF, get back structured JSON — form fields, tables, and ...