分类 - multi-modal
2023
Contrastive Learning based Vision-Language Pre-Training
Contrastive Learning based Vision-Language Pre-Training
Image2Poem
Image2Poem