I used Gemini 2.5 Flash to parse receipts at scale. Here’s what I learned about multimodal OCR in production
For my startup, I needed to extract structured data (item name, price, quantity, unit cost) from photos of receipts and from product images on the shelf; faded thermal paper, crumpled, bad lighting, the works. Key findings after thousands of test…