GitXplorerGitXplorer
m

UDOP

public
242 stars
6 forks
3 issues

Commits

List of commits on branch main.
Verified
08d7aab346524eda997551ef88dc26833785fbfd

Update README.md

zzinengtang committed 2 years ago
Verified
13d498c6344f4bd7e91caa7b4933233f56cc93f7

Update README.md

zzinengtang committed 2 years ago
Verified
b4f60536cacca522380b82879fc2c31d3c723b08

Update README.md

zzinengtang committed 2 years ago
Verified
0c3013585c4d7546ace23d1747985e9a1f788ff4

Update README.md

zzinengtang committed 2 years ago
Verified
66ece84a54dbfb89a24a3d0b188018882a2a0649

Merge pull request #2 from EIFY/patch-1

zzinengtang committed 2 years ago
Verified
05c10b57fdb7c7c6f97f9ceecfd39ea3be5481d5

s/Comming/Coming/

EEIFY committed 2 years ago

README

The README file for this repository.

Zineng Tang, Ziyi Yang, Guoxin Wang, Yuwei Fang, Yang Liu, Chenguang Zhu, Michael Zeng, Cha Zhang, Mohit Bansal

Code Release Here

Code is rehosted at part of the i-code project

Open Source Checklist:

  • [x] Release Model (Encoder + Text decoder)
  • [ ] Release Most Scripts
  • [ ] Vision Decoder / Weights (Due to fake document generation ethical consideration, we plan to release this functionality as an Azure API)
  • [ ] Demos

Introduction

UDOP unifies vision, text, and layout through vision-text-layout Transformer and unified generative pretraining tasks including vision task, text task, layout task, and mixed task. We show the task prompts (left) and task targets (right) for all self-supervised objectives (joint text-layout reconstruction, visual text recognition, layout modeling, and masked autoencoding) and two example supervised objectives (question answering and layout analysis).