U
Repositories
Select a repository to view its commits, contributors, and more.public
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
HTML
9806
820
184
Updated 8 hours ago
public
unstructured-api
Python
617
134
35
Updated 4 days ago
public
unstructured-inference
Python
167
54
34
Updated 9 hours ago
public
pipeline-sec-filings
Preprocessing pipeline notebooks and API supporting text extraction from SEC documents
Jupyter Notebook
142
31
12
Updated 3 days ago
public
unstructured-python-client
A Python client for the Unstructured hosted API
Python
87
17
14
Updated 2 days ago
public
unstructured-js-client
A Typescript client for the Unstructured hosted API
TypeScript
45
12
7
Updated 3 days ago
public
unstructured-ingest
HTML
37
22
67
Updated 15 hours ago
public
unstructured-api-tools
Python
28
11
3
Updated a year ago
public
community
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
26
8
0
Updated 5 days ago
public
pipeline-paddleocr
Pipeline for converting PDFs to raw text with PaddleOCR
Jupyter Notebook
21
6
6
Updated a month ago
public
irs-manual-demo
Python
14
7
1
Updated a year ago
public
pipeline-template
Python
8
8
4
Updated a year ago