Zhang Han
Zhang124
AI & ML interests
None yet
Organizations
None yet
Multimodal Image Classification
-
What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models
Paper • 2405.15668 • Published -
On Large Multimodal Models as Open-World Image Classifiers
Paper • 2503.21851 • Published • 8 -
Benchmarking Large Language Models for Image Classification of Marine Mammals
Paper • 2410.19848 • Published -
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding
Paper • 2501.07783 • Published • 8
image Transformer
Multimodal Image Classification
-
What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models
Paper • 2405.15668 • Published -
On Large Multimodal Models as Open-World Image Classifiers
Paper • 2503.21851 • Published • 8 -
Benchmarking Large Language Models for Image Classification of Marine Mammals
Paper • 2410.19848 • Published -
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding
Paper • 2501.07783 • Published • 8
models 0
None public yet
datasets 0
None public yet