Unstructured screenshot
Code GenerationOpen_source
Unstructured logo

Unstructured

Document ingestion and parsing library for converting PDFs, images, and HTML into structured data for RAG

Convert PDFs, images, documents to structured data and text easily. Document parsing and extraction with open-source and cloud APIs for AI data preparation.

13,536 GitHub Stars
1,119 Forks
Data from: GitHubWebsiteUpdated: Jan 4, 2026

About Unstructured

Tags

document-processingdata-ingestionragparsingopen-source

Similar Tools