Skip to content

The ETL+ Platform for GenAI

Welcome to Unstructured! We're trusted by 82% of the Fortune 1000 and used by over 60,000 organizations globally.

We automatically transform complex, unstructured data into clean, structured data for GenAI applications. Data is routed through dynamic transformation and enrichment pipelines to deliver the highest quality output to your LLM. Continuously. Effortlessly. Automatically.

To get started, check out our open source offerings:

Ready for a more performant and reliable experience? Try Unstructured for free today and experience the next evolution of ETL for GenAI applications.

Learn more:

  • Company Website - Transform complex, unstructured data into clean, structured data. Securely. Continuously. Effortlessly.
  • Extensive Documentation - Our comprehensive docs cover everything from getting started guides to in-depth API references, ensuring you have the resources you need to succeed.
  • Developer Community on Slack - Connect with fellow developers, share knowledge, and get support through our vibrant community Slack channel.

Popular repositories Loading

  1. unstructured unstructured Public

    Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

    HTML 14.3k 1.2k

  2. unstructured-api unstructured-api Public

    Python 896 187

  3. unstructured-inference unstructured-inference Public

    Python 207 75

  4. pipeline-sec-filings pipeline-sec-filings Public archive

    Preprocessing pipeline notebooks and API supporting text extraction from SEC documents

    Jupyter Notebook 149 35

  5. unstructured-python-client unstructured-python-client Public

    A Python client for the Unstructured Platform API

    Python 115 20

  6. unstructured-ingest unstructured-ingest Public

    HTML 104 57

Repositories

Showing 10 of 40 repositories
  • docs Public

    Documentation for all Unstructured products and libraries

    Unstructured-IO/docs’s past year of commit activity
    MDX 8 26 0 16 Updated Mar 25, 2026
  • unstructured Public

    Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

    Unstructured-IO/unstructured’s past year of commit activity
    HTML 14,326 Apache-2.0 1,206 180 (1 issue needs help) 69 Updated Mar 25, 2026
  • unstructured-js-client Public

    A JavaScript/Typescript client for the Unstructured Platform API

    Unstructured-IO/unstructured-js-client’s past year of commit activity
    TypeScript 59 MIT 15 6 1 Updated Mar 25, 2026
  • Unstructured-IO/unstructured-api’s past year of commit activity
    Python 896 Apache-2.0 187 36 17 Updated Mar 25, 2026
  • unstructured-python-client Public

    A Python client for the Unstructured Platform API

    Unstructured-IO/unstructured-python-client’s past year of commit activity
    Python 115 MIT 20 14 1 Updated Mar 25, 2026
  • Unstructured-IO/unstructured-inference’s past year of commit activity
    Python 207 Apache-2.0 75 24 35 Updated Mar 24, 2026
  • Unstructured-IO/unstructured-ingest’s past year of commit activity
    HTML 104 Apache-2.0 57 60 39 Updated Mar 23, 2026
  • UNS-MCP Public
    Unstructured-IO/UNS-MCP’s past year of commit activity
    Jupyter Notebook 42 21 2 7 Updated Mar 23, 2026
  • Unstructured-IO/unstructured-platform-plugins’s past year of commit activity
    Python 6 Apache-2.0 3 0 2 Updated Mar 3, 2026
  • notebooks Public
    Unstructured-IO/notebooks’s past year of commit activity
    Jupyter Notebook 2 0 0 0 Updated Jan 29, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…