版权说明 操作指南
首页 > 成果 > 详情

Concentrated Reasoning and Unified Reconstruction for Multi-Modal Media Manipulation

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文、会议论文
作者:
Zhao, Weichen;Lu, Yuxing;Jiao, Ge;Yang, Yuan
通讯作者:
Jiao, G
作者机构:
[Jiao, Ge; Yang, Yuan; Zhao, Weichen] Hengyang Normal Univ, Hengyang, Peoples R China.
[Lu, Yuxing] Peking Univ, Beijing, Peoples R China.
通讯机构:
[Jiao, G ] H
Hengyang Normal Univ, Hengyang, Peoples R China.
语种:
英文
关键词:
Multi-Modal Media Manipulation;Deep Fake;Detection;Mask Signal Modeling
期刊:
IEEE International Conference on Acoustics, Speech and Signal Processing. Proceedings
ISSN:
1520-6149
年:
2024
页码:
8190-8194
会议名称:
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
会议论文集名称:
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
会议时间:
14 April 2024
会议地点:
Seoul, Korea, Republic of
会议主办单位:
[Zhao, Weichen;Jiao, Ge;Yang, Yuan] Hengyang Normal Univ, Hengyang, Peoples R China.^[Lu, Yuxing] Peking Univ, Beijing, Peoples R China.
出版地:
345 E 47TH ST, NEW YORK, NY 10017 USA
出版者:
IEEE
ISBN:
979-8-3503-4486-8
机构署名:
本校为第一且通讯机构
摘要:
Detecting and Grounding Multi-Modal Media Manipulation (DGM 4 ) is an emerging task that aims to identify and locate manipulated elements in both textual and visual media. Given the complexity of this task, the model requires more sophisticated reasoning capabilities to align multi-modal features and capture forgery traces. To this end, we propose a Concentrated reasoning and Unified reconstruction framework (CrUr) for DGM 4 . Instead of adhering to traditional hierarchical reasoning paradigms, we directly carry out all inference tasks using integrated multi-modal features. Specifically, we ex...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com