Refrence Additive Scene in Unity

Measuring Image-Relation Alignment: Reference-Free Evaluation of VLMs and Synthetic Pre-Training for Open-Vocabulary Scene Graph Generation

Abstract: Scene Graph Generation (SGG) encodes visual relationships between objects in images as graph structures. Thanks to the advances of Vision-Language Models (VLMs), the task of Open-Vocabulary ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Measuring Image-Relation Alignment: Reference-Free Evaluation of VLMs and Synthetic Pre-Training for Open-Vocabulary Scene Graph Generation

Trending now