版权说明 操作指南
首页 > 成果 > 详情

Cross-modal alignment with synthetic caption for text-based person search

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Weichen Zhao;Yuxing Lu;Zhiyuan Liu;Yuan Yang;Ge Jiao*
通讯作者:
Ge Jiao
作者机构:
[Yuan Yang; Ge Jiao] College of Computer Science and Technology, Hengyang Normal University, Hengyang, China
[Zhiyuan Liu] School of Computer Science and Technology, Soochow University, Suzhou, China
[Yuxing Lu] College of Future Technology, Peking University, Beijing, China
[Weichen Zhao] College of Computer Science and Technology, Hengyang Normal University, Hengyang, China<&wdkj&>School of Computer Science and Technology, Soochow University, Suzhou, China
通讯机构:
[Ge Jiao] C
College of Computer Science and Technology, Hengyang Normal University, Hengyang, China
语种:
英文
关键词:
Text-based person search;Cross-modal retrieval;Cross-modal alignment;Synthetic caption
期刊:
International Journal of Multimedia Information Retrieval
ISSN:
2192-6611
年:
2025
卷:
14
期:
2
页码:
1-13
机构署名:
本校为第一且通讯机构
院系归属:
计算机科学与技术学院
摘要:
Text-based person search aims to retrieve target person from a large gallery based on natural language description. Existing methods take it as one-to-one embedding or many-to-many embedding matching problem. The former approach relies on the assumption of the existence of strong alignment between text and images, while the latter inevitably leads to issues of intra-class variation. Rather than being confined to these two approaches, we propose a new strategy that achieves cross-modal alignment with synthetic caption for joint image-text-caption optimization, named CASC. The core of this strat...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com