Here, V represents the training set images in the target domain, where N is the total number of images included in the target domain training set. For each image, its visual feature vector is generated by the initialized visual model and denoted as v. The affinity between any pair of images is calculated as follows:

帮我把下面这段话按照行人重识别领域的论文的风格翻译成英文:在这里V表示目标域中的训练集图像其中N是目标域的训练集中包含的图像的总数。对于每个图像其视觉特征向量由初始化的视觉模型生成记为v。任意一对图像之间的亲和性计算如下:

原文地址: https://www.cveoy.top/t/topic/craB 著作权归作者所有。请勿转载和采集!

免费AI点我,无需注册和登录