Skip to content

Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"

License

Notifications You must be signed in to change notification settings

TIGER-AI-Lab/OmniEdit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 

Repository files navigation

OmniEdit

Official Repo for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

In this paper, we present OMNI-EDIT, which is an omnipotent editor to handle seven different image editing tasks with any aspect ratio seamlessly. Our contribution is in four folds: (1) OMNI-EDIT is trained by utilizing the supervision from seven different specialist models to ensure task coverage. (2) we utilize importance sampling based on the scores provided by large multimodal models (like GPT-4o) instead of CLIP-score to improve the data quality.

📃Paper | 🌐Website | 💻Github | 📚Dataset

Data Pipeline

We synthesize the large scale dataset through specialist distillation. Our synthesis pipeline is depicted in

Our released version contains 1.2M pairs covering seven different skills like addition, swaping, removal, attribute modification, background change, environment change and sytle transfer. The dataset has been filtered with VIEScore.

Comparison with Others

Our dataset has the most diverse, highest-quality image editing pairs of any resolution.

Citation

If you find our paper useful, please cite us with

@article{wei2024omniedit,
  title={OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision},
  author={Wei, Cong and Xiong, Zheyang and Ren, Weiming and Du, Xinrun and Zhang, Ge and Chen, Wenhu},
  journal={arXiv preprint arXiv:2411.07199},
  year={2024}
}

About

Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision"

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published