Mistral adds a new API that turns any PDF document into an AI-ready Markdown file

On Thursday French giant language mannequin (LLM) developer Mistral launched a brand new API for builders who deal with advanced PDF paperwork. Mistral OCR is an optical character recognition (OCR) API that may flip any PDF right into a textual content file to make it simpler for AI fashions to ingest.

LLMs, which underpin widespread GenAI instruments like OpenAI’s ChatGPT, work notably effectively with uncooked textual content. So firms that wish to create their very own AI workflow know that it has develop into extraordinarily vital to retailer and index knowledge in a clear format in order that this knowledge will be reused for AI processing.

In contrast to most OCR APIs, Mistral OCR is a multimodal API, which means that it might probably detect when there are illustrations and photographs intertwined with blocks of textual content. The OCR API creates bounding packing containers round these graphical components and consists of them within the output.

Mistral OCR additionally doesn’t simply output an enormous wall of textual content; the output is formatted in Markdown, a formatting syntax that builders use so as to add hyperlinks, headers, and different formatting components to a plain textual content file.