降重: 下面结合附图对本申请作进一步描述。以下实施例仅用于更加清楚地说明本发明 的技术方案而不能以此来限制本申请的保护范围。一般来说模态是指事物发生或存在的方式多模态是指两个或者两个以上的模 态的各种形式的组合。对每一种信息的来源或者形式都可以称为一种模态目前研究领域 中主要是对图像文本语音三种模态的处理。之所以要对模态进行融合是因为不同模态 的表现方式不一样看待事物的角度也会不一样如果能合理的处
Translation:
In general, modality refers to the way in which things happen or exist, and multimodality refers to the combination of various forms of two or more modalities. Each source or form of information can be called a modality, and currently, the main research fields focus on the processing of three modalities: images, text, and speech. The reason for modal fusion is that different modalities have different ways of expression, and the perspective of viewing things is also different. If multi-modal information can be processed reasonably, rich feature information can be obtained. For this application, obtaining multi-modal fusion information is more helpful in improving the accuracy of postoperative cancer prediction, without the need for patients to undergo painful live tissue collection and pathological slicing. The innovation of this application is that, unlike existing technologies that use images, text, and speech as inputs, this application uses CT images, clinical information, and genetic data as inputs for multimodal fusion. A memory fusion network, a network structure that can simultaneously capture interactions in time and between modalities, is used to obtain better multi-view fusion. This application improves the accuracy of postoperative cancer prediction.
原文地址: https://www.cveoy.top/t/topic/b2vQ 著作权归作者所有。请勿转载和采集!