OmniEdit

Official Repo for OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision

In this paper, we present OMNI-EDIT, which is an omnipotent editor to handle seven different image editing tasks with any aspect ratio seamlessly. Our contribution is in four folds: (1) OMNI-EDIT is trained by utilizing the supervision from seven different specialist models to ensure task coverage. (2) we utilize importance sampling based on the scores provided by large multimodal models (like GPT-4o) instead of CLIP-score to improve the data quality.

📃Paper | 🌐Website | 💻Github | 📚Dataset

Data Pipeline

We synthesize the large scale dataset through specialist distillation. Our synthesis pipeline is depicted in

Our released version contains 1.2M pairs covering seven different skills like addition, swaping, removal, attribute modification, background change, environment change and sytle transfer. The dataset has been filtered with VIEScore.

Comparison with Others

Our dataset has the most diverse, highest-quality image editing pairs of any resolution.

Citation

If you find our paper useful, please cite us with

@article{wei2024omniedit,
  title={OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision},
  author={Wei, Cong and Xiong, Zheyang and Ren, Weiming and Du, Xinrun and Zhang, Ge and Chen, Wenhu},
  journal={arXiv preprint arXiv:2411.07199},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OmniEdit

Data Pipeline

Comparison with Others

Citation

About

Releases

Packages

Contributors 2

License

TIGER-AI-Lab/OmniEdit

Folders and files

Latest commit

History

Repository files navigation

OmniEdit

Data Pipeline

Comparison with Others

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Packages