(四十八):MAF: Multimodal Alignment Framework for Weakly-Supervised Phrase Grounding

猜你喜欢

转载自blog.csdn.net/qq_37486501/article/details/119750622