Cross-modal alignment with synthetic caption for text-based person search

首页 > 成果 > 详情

认领

导出

Link by DOI

反馈

作者信息关键词期刊信息基础信息归属信息摘要

成果类型：

期刊论文

作者：

Weichen Zhao;Yuxing Lu;Zhiyuan Liu;Yuan Yang;Ge Jiao*

通讯作者：

Ge Jiao

作者机构：

[Yuan Yang; Ge Jiao] College of Computer Science and Technology, Hengyang Normal University, Hengyang, China

[Zhiyuan Liu] School of Computer Science and Technology, Soochow University, Suzhou, China

[Yuxing Lu] College of Future Technology, Peking University, Beijing, China

[Weichen Zhao] College of Computer Science and Technology, Hengyang Normal University, Hengyang, China<&wdkj&>School of Computer Science and Technology, Soochow University, Suzhou, China

通讯机构：

[Ge Jiao] C

College of Computer Science and Technology, Hengyang Normal University, Hengyang, China

语种：

英文

关键词：

Text-based person search;Cross-modal retrieval;Cross-modal alignment;Synthetic caption

期刊：

International Journal of Multimedia Information Retrieval

ISSN：

2192-6611

年：

2025

卷：

期：

页码：

1-13

DOI：

10.1007/s13735-025-00356-w

机构署名：

本校为第一且通讯机构

院系归属：

计算机科学与技术学院

摘要：

Text-based person search aims to retrieve target person from a large gallery based on natural language description. Existing methods take it as one-to-one embedding or many-to-many embedding matching problem. The former approach relies on the assumption of the existence of strong alignment between text and images, while the latter inevitably leads to issues of intra-class variation. Rather than being confined to these two approaches, we propose a new strategy that achieves cross-modal alignment with synthetic caption for joint image-text-caption optimization, named CASC. The core of this strat...

反馈

产权有误：本人成果被他人认领

数据有误：数据基本信息有误

归属有误：成果的院系归属、机构署名归属有误

其他原因：

验证码：

看不清楚，换一个

确定

取消

成果认领

标题：

用户	作者	通讯作者	--
	请选择	请选择	--

确定

取消

Cross-modal alignment with synthetic caption for text-based person search

反馈

成果认领

提示

该栏目需要登录且有访问权限才可以访问