Skip to content

🔥[Information Fusion 2024, Official Code] for paper "Prompt-guided image color aesthetics assessment: Models, datasets and benchmarks". Official Weights and Demos provided. 首个多因素色彩美学评估数据集、算法和benchmark.

Notifications You must be signed in to change notification settings

woshidandan/Prompt-DeT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 

Repository files navigation

License Framework

Rethinking Image Color Aesthetics Assessment: Models, Datasets and Benchmarks

Shuai He, Yi Xiao, Anlong Ming, Huadong Ma

Beijing University of Posts and Telecommunications

This repo contains the official implementation and the new dataset ICAA20K of the Information Fusion 2024 paper.

Introduction

  • Brief Version: Impressive zero-shot and fine-tuning capabilities on sub-attribute ICAA tasks, and even supports user-customized scenarios.

  • DR Version: Image color aesthetics assessment (ICAA) aims to assess color aesthetics based on human perception, which is crucial for various applications such as imaging measurement and image analysis. The ceiling of previous methods is constrained to a holistic evaluation approach, which hinders their ability to offer explainability from multiple perspectives. Moreover, existing ICAA datasets often lack multi-attribute annotations beyond holistic scores, which are necessary to provide effective supervision for training or validating models' multi-perspective assessment capabilities, thereby hindering their capacity for effective generalization.To advance ICAA research:

    • We propose an ``all-in-one'' model called the Prompt-Guided Delegate Transformer (Prompt-DeT). Prompt-DeT utilizes dedicated prompt strategies and an Aesthetic Adapter (Aes-Adapter), to exploit the rich visual language prior embedded in large pre-trained vision-language models. It enhances the model's perception of multiple attributes, enabling impressive zero-shot and fine-tuning capabilities on sub-attribute tasks, and even supports user-customized scenarios.
    • We elaborately construct a color-oriented dataset, ICAA20K, containing 20K images and 6 annotated dimensions to support both holistic and sub-attribute ICAA tasks.
    • We develop a comprehensive benchmark comprising of 17 methods, which is the most extensive to date, based on four datasets (ICAA20K, ICAA17K, SPAQ, and PARA) for evaluating the holistic and sub-attribute performance of ICAA methods. Our work, not only achieves state-of-the-art (SOTA) performance, but also offers the community a roadmap to explore solutions for ICAA.
Image text

ICAA20K Dataset

To enhance the ICAA17K+ dataset, we have incorporated 2,000 detailed labels concerning color attributes including colorfulness, harmony, and temperature annotations. The dataset can be downloaded from: ICAA20K Dataset.

Image text

Prompt-DeT

we propose the Prompt-Guided Delegate Transformer (Prompt-DeT). Firstly, it employs the proposed Aes-Adapter and contrastive learning to align attribute-related features with CLIP’s vast knowledge. Guided by our explainable attribute-aware prompts, it extracts attribute-related features to adapt to downstream tasks, and enhances the understanding of diverse aesthetic attributes. Secondly, Prompt-DeT simulates human behavior in color space segmentation and adaptively assigns different attention weights based on color importance. This approach further extracts explainable features to express the color information. The weight can be downloaded from: Prompt-DeT_Weights.

Image text

Performance and Benchmark

We develop a comprehensive benchmark comprising of 17 methods, which is the most extensive to date, based on three datasets (ICAA20K, ICAA17K, SPAQ, and PARA) for evaluating the holistic and sub-attribute performance of ICAA methods. Our work achieves state-of-the-art (SOTA) performance on all benchmarks.

Image text Image text Image text Image text

Requirement

einops==0.8.0
ftfy==6.3.1
matplotlib==3.8.0
numpy==2.1.3
pandas==2.2.3
Pillow==9.0.1
Pillow==11.0.0
pytorch_model_summary==0.1.2
PyYAML==6.0.1
PyYAML==6.0.2
regex==2023.8.8
Requests==2.32.3
scikit_learn==1.3.1
scipy==1.14.1
seaborn==0.13.2
setuptools==68.0.0
setuptools==74.1.2
tensorboardX==2.6.2.2
tensorboardX==2.6.2.2
thop==0.1.1.post2209072238
timm==0.9.7
torch==2.0.1
torchvision==0.15.2
tqdm==4.65.2
yacs==0.1.8

If you find our work is useful, pleaes cite our paper:

@article{he2025prompt,
  title={Prompt-guided image color aesthetics assessment: Models, datasets and benchmarks},
  author={He, Shuai and Xiao, Yi and Ming, Anlong and Ma, Huadong},
  journal={Information Fusion},
  volume={114},
  pages={102706},
  year={2025},
  publisher={Elsevier}
}

Related Work from Our Group

🎁 Projects 📚 Publication 🌈 Content ⭐ Stars
Pixel-level image exposure assessment【首个像素级曝光评估】 NIPS 2024 Code, Dataset Stars
Long-tail solution for image aesthetics assessment【美学评估数据不平衡解决方案】 ICML 2024 Code Stars
CLIP-based image aesthetics assessment【基于CLIP多因素色彩美学评估】 Information Fusion 2024 Code, Dataset Stars
Compare-based image aesthetics assessment【基于对比学习的多因素美学评估】 ACMMM 2024 Code Stars
Image color aesthetics assessment【首个色彩美学评估】 ICCV 2023 Code, Dataset Stars
Image aesthetics assessment【通用美学评估】 ACMMM 2023 Code Stars
Theme-oriented image aesthetics assessment【首个多主题美学评估】 IJCAI 2022 Code, Dataset Stars
Select prompt based on image aesthetics assessment【基于美学评估的提示词筛选】 IJCAI 2024 Code Stars
Motion rhythm synchronization with beats【动作与韵律对齐】 IJCAI 2024 Code, Dataset Stars
Champion Solution for AIGC Image Quality Assessment【NTIRE AIGC图像质量评估赛道冠军】 CVPRW NTIRE 2024 Code Stars

About

🔥[Information Fusion 2024, Official Code] for paper "Prompt-guided image color aesthetics assessment: Models, datasets and benchmarks". Official Weights and Demos provided. 首个多因素色彩美学评估数据集、算法和benchmark.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages