Visual Diary Examples Model

[NeurIPS 2025] ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

ChartMuseum is a chart question answering benchmark designed to evaluate reasoning capabilities of large vision-language models (LVLMs) over real-world chart images. The benchmark consists of 1162 ...

GitHub

NASA-IMPACT/Prithvi-EO-2.0

Prithvi-EO-2.0 is based on the ViT architecture, pretrained using a masked autoencoder (MAE) approach, with two major modifications as shown in the figure below. Second, we considered geolocation ...

IEEE

Multiscale Feature-Guided Adversarial Examples Quality Assessment via Hierarchical Perception of Human Visual System

Abstract: Deep neural networks (DNNs) reveal significant robustness deficiencies due to their susceptibility to being misled by small and imperceptible adversarial examples, thus it is crucial to ...

IEEE

Idea Visual: Intent-Driven View Synthesis for Smart Mobile Devices via Retrieval-Augmented Diffusion Models

Abstract: Although the generative novel view synthesis frameworks have already achieved the generation of target views from specific viewpoints, they still rely on either direct or indirect input of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results