ACM Multimedia 2020 - Paper List (2023)

38 - ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences

Zhizhong Han (University of Maryland, College Park); Chao Chen (Tsinghua University); Yu-Shen Liu (Tsinghua University)*; Matthias Zwicker (University of Maryland)

53 - Image Inpainting Based on Multi-frequency Probabilistic Inference Model

Jin Wang (Beijing University of Technology)*; Chen Wang (Beijing University of Technology); Qingming Huang (University of Chinese Academy of Sciences); Yunhui Shi (Beijing University of Technology); Jian-Feng Cai (The Hong Kong University of Science and Technology); Qing Zhu (Beijing University of Technology); Baocai Yin (Beijing University of Technology)

60 - Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition

Wenbo Zheng (School of Software Engineering, Xi'an Jiaotong University); Lan Yan (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences); Chao Gou (School of Intelligent Systems Engineering, Sun Yat-sen University)*; Fei-Yue Wang (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences)

63 - Dual Adversarial Network for Unsupervised Ground/Satellite-to-Aerial Scene Adaptation

jianzhe peter lin (University of British Columbia)*; Lichao Mou (DLR&TUM); tianze yu (University of British Columbia); Xiaoxiang Zhu (Technical University of Munich (TUM); German Aerospace Center (DLR)); Z. Jane Wang (University of British Columbia)

68 - Scene-Aware Background Music Synthesis

Yujia Wang (Beijing Institute of Technology)*; Wei Liang (Beijing Institute of Technology); Wanwan Li (George Mason University); Dingzeyu Li (Adobe Research); Lap-Fai Yu (George Mason University)

75 - Adversarial Bipartite Graph Learning for Video Domain Adaptation

Yadan Luo (University of Queensland)*; Zi Huang (University of Queensland); Zijian Wang (University of Queensland); Zheng Zhang (Harbin Institute of Technology, Shenzhen); Mahsa Baktashmotlagh (University of Queensland)

111 - Domain Adaptive Person Re-Identification via Coupling Optimization

Xiaobin Liu (Peking University); Shiliang Zhang (Peking University)*

118 - Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge

Peng Wang (Northwestern Polytechnical University); Dongyang Liu (Northwestern Polytechnical University); Hui Li (the University of Adelaide)*; Qi Wu (University of Adelaide)

142 - Controllable Video Captioning with an Exemplar Sentence

Yitian Yuan (Tsinghua University)*; Lin Ma (Tencent AI Lab); Jingwen Wang (Tencent AI Lab); Wenwu Zhu (Tsinghua University)

143 - MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos

Guangyao Shen (Tsinghua University)*; Xin Wang (Tsinghua University); Xuguang Duan (Tsinghua University); Hongzhi Li (Microsoft Research); Wenwu Zhu (Tsinghua University)

162 - Single Image De-noising via Staged Memory Network

Weijiang Yu (SUN YAT-SEN UNIVERSITY)*; Jian Liang (Nanchang University); Lu Li (Zhejiang University); Nong Xiao (Sun Yat-sen University)

193 - Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing

Peipei Li ( Institute of Automation Chinese Academy of Sciences)*; Yinglu Liu (JD AI); Hailin Shi (JD AI); Xiang Wu (Reconova); Yibo Hu (Institute of Automation, Chinese Academy of Sciences); Ran He (Institute of Automation, Chinese Academy of Sciences); Zhenan Sun (Chinese of Academy of Sciences)

197 - A Human-Computer Duet System for Music Performance

Yuen-Jen Lin (Academia Sinica)*; Hsuan-Kai Kao (Academia Sinica); Yih-Chih Tseng (Academia Sinica); Ming Tsai (KoKo Lab); Li Su (Academia Sinica)

202 - Invisible: Federated Learning over Non-Informative Intermediate Updates against Multimedia Privacy Leakages

Qiushi Li (Tsinghua University)*; Wenwu Zhu (Tsinghua University); Chao Wu (Tsinghua University); xinglin pan (University of Electronic Science and Technology of China); Fan Yang (Tsinghua University); Yuezhi Zhou (Tsinghua University); Yaoxue Zhang (Tsinghua University)

221 - Every Moment Matters: Detail-Aware Networks to Bring a Blurry Image Alive

Kaihao Zhang (Australian National University)*; Wenhan Luo (Tencent AI Lab); Bjorn Stenger (Rakuten Institute of Technology); Wenqi Ren (Institute of Information Engineering, Chinese Academy of Sciences); Lin Ma (Tencent AI Lab); HONGDONG LI (Australian National University, Australia)

230 - Self-supervised Dance Video Synthesis Conditioned on Music

Xuanchi Ren (HKUST); Haoran Li (The Hong Kong University of Science and Technology); Zijian HUANG (the Hong Kong University of Science and Technology); Qifeng Chen (HKUST)*

232 - Co-Attentive Lifting for Infrared-Visible Person Re-Identification

Xing Wei (Xi'an Jiaotong University)*; Diangang Li (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Wei Ke (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

291 - Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition

Fanfan Ye ( Hikvision Research Institute); Shiliang Pu (Hikvision Research Institute); Qiaoyong Zhong (Hikvision Research Institute)*; Chao Li (Hikvision Research Institute); Di Xie (Hikvision Research Institute); Huiming Tang (Zhejiang University)

304 - Boosting Visual Question Answering with Context-aware Knowledge Aggregation

Guohao Li (Tsinghua University)*; Xin Wang (Tsinghua University); Wenwu Zhu (Tsinghua University)

306 - Meta Parsing Networks: Towards Generalized Few-shot Scene Parsing with Adaptive Metric Learning

Peike Li (UTS)*; Yunchao Wei (University of Technology Sydney); Yi Yang (UTS)

312 - CODAN: Counting-driven Attention Network for Vehicle Detection in Congested Scenes

Wei Li (Southwest Jiaotong University); Zhenting Wang (Southwest Jiaotong University); Xiao Wu (Southwest Jiaotong University)*; Ji Zhang (Southwest Jiaotong University); Qiang Peng (Southwest Jiaotong University); Hongliang Li (University of Electronic Science and Technology of China)

352 - Modeling both Intra- and Inter-modal Influence for Real-Time Emotion Detection in Conversations

Dong Zhang (Soochow University)*; Weisheng Zhang (Soochow University); Shoushan Li (Soochow University); Zhu Qiaoming (Soochow University); Zhou Guodong (Soochow University)

355 - WIKI Food-500: A dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network

Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences)*; Linhu Liu (ICT); Zhiling Wang (Institute of Computing Technology, Chinese Academy of Sciences); Zhengdong Luo (University of Chinese Academy of Sciences); Xiaoming Wei (MeituanDianping group ); Xiaolin Wei (MeituanDianping group ); Shuqiang Jiang (ICT, China Academy of Science)

358 - Learning Image Classifier from Only Web Labels and Metadata: Automatic Label Correction through Graph

Jingkang Yang (Sensetime Research)*; Weirong Chen (SenseTime Research); Litong Feng (Sensetime Research); Xiaopeng Yan (SenseTime Research); Huabin Zheng (SenseTime Research); Wayne Zhang (SenseTime Research)

373 - Photo Stand-Out: Photography with Virtual Character

Yujia Wang (Beijing Institute of Technology)*; Sifan Hou (Beijing Institute of Technology); Wei Liang (Beijing Institute of Technology); Bing Ning (Beijing Institute of Fashion Technology)

378 - Accurate UAV Tracking with Distance-Injected Overlap Maximization

Chunhui Zhang (Chinese Academy of Sciences); Shiming Ge (Chinese Academy of Sciences)*; Kangkai Zhang (Chinese Academy of Sciences); Dan Zeng (Shanghai University)

383 - Context-Aware Multi-View Summarization Network for Image-Text Matching

Leigang Qu (Shandong University); Meng Liu (Shandong Jianzhu University); Da Cao (Hunan University); Liqiang Nie (Shandong University )*; Qi Tian (Huawei Cloud & AI)

391 - PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music

Hongru Liang (Nankai University); Wenqiang Lei (National University of Singapore)*; Paul Yaozhu Chan (A∗STAR); Zhenglu Yang (Nankai University); Maosong Sun (Tsinghua University); Tat-Seng Chua (National Univ. of Singapore)

395 - An Egocentric Action Anticipation Framework via Fusing Intuition and Analysis

Tianyu Zhang (ICT)*; Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences); Ying Zhu (University of Chinese Academy of Sciences); Yong Rui (Lenovo); Shuqiang Jiang (ICT, China Academy of Science)

436 - Label Embedding Online Hashing for Cross-Modal Retrieval

Yongxin Wang (Shandong University); Xin Luo (Shandong University); Xin-Shun Xu (Shandong University)*

444 - Cloze Test Helps: Effective Video Anomaly Detection via Learning to Complete Video Events

Guang Yu (National University of Defense Technology)*; Siqi Wang (National University of Defense Technology); Zhiping Cai (NUDT); En Zhu (National University of Defense Technology); Chuanfu Xu (National University of Defense Technology); Jianping Yin (National University of Defense Technology); Marius Kloft (TU Kaiserslautern)

484 - CRSSC: Salvage Reusable Samples from Noisy Data for Robust Learning

Zeren Sun (Nanjing University of Science and Technology ); Xian-Sheng Hua (Alibaba Group); Yazhou Yao (Nanjing University of Science and Technology)*; Xiu-Shen Wei (Nanjing University of Science and Technology); Guosheng Hu (AnyVision); Jian Zhang (UTS)

519 - MMFL: Multimodal Fusion Learning for Text-Guided Image Inpainting

Qing Lin (Fudan University); Bo Yan (Fudan University)*; Jichun Li (Fudan University); Weimin Tan (Fudan University)

531 - Vision Meets Wireless Positioning: Effective Person Re-identification with Recurrent Context Propagation

Yiheng Liu (University of Science and Technology of China)*; Wengang Zhou (University of Science and Technology of China); Mao Xi (University of Science and Technology of China); Sanjing Shen (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China)

541 - Learning From Music to Visual Storytelling of Shots: A Deep Interactive Learning Mechanism

Jen-Chun Lin (Academia Sinica)*; Wen-Li Wei (Academia Sinica); Yen-Yu Lin (National Chiao Tung University); Tyng-Luh Liu (Academia Sinica); Hong-Yuan Mark Liao (Institute of Information Science, Academia Sinica, Taiwan)

553 - Asymmetric Deep Hashing for Efficient Hash Code Compression

Shu Zhao (Institute of Information Engineering, Chinese Academy of Sciences); Dayan Wu (Institute of Information Engineering, Chinese Academy of Sciences)*; Wanqian Zhang (Institute of Information Engineering, Chinese Academy of Sciences); Yu Zhou (Institute of Information Engineering, CAS); Bo Li ( Institute of Information Engineering, Chinese Academy of Sciences); Weiping Wang (Institute of Information Engineering, CAS, China)

588 - Quaternion-Based Knowledge Graph Network for Recommendation

Zhaopeng Li (State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences)*; Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Yangbangyan Jiang (Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences); Xiaochun Cao (Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)

601 - Multi-Person Action Recognition in Microwave Sensors

Diangang Li (Xi'an Jiaotong University); Jianquan Liu (NEC Corporation)*; Shoji Nishimura (NEC Corporation); Yuka Hayashi (NEC Corporation); Jun Suzuki (NEC Corporation); Yihong Gong (Xi'an Jiaotong University)

612 - Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment

Dingquan Li (Peking University); Tingting Jiang (Peking University)*; Ming Jiang (Peking University)

639 - Coupling deep textural and shape features for sketch retrieval

Qi Jia (Dalian University of Technology); Xin Fan (Dalian University of Technology)*; Meiyu Yu (Didi Chuxing); Yuqing Liu (Dalian University of Technology); Dingrong Wang (Dalian University of Technology); Longin Jan Latecki (Temple University)

647 - Memory-Augmented Relation Network for Few-Shot Learning

He Jun (Hefei University of Technology)*; Richang Hong (Hefei University of Technology); Xueliang Liu (Hefei University of Technology); Mingliang Xu (Zhengzhou University); Zheng-Jun Zha (University of Science and Technology of China); Meng Wang (Hefei University of Technology)

668 - Performance Optimization of Federated Person Re-identification via Benchmark Analysis

Weiming Zhuang (Nanyang Technological University)*; Yonggang Wen (Nanyang Technological University); Xuesen Zhang (SenseTime); Xin Gan (SenseTime); Daiying Yin (SenseTime); Dongzhan Zhou (The University of Sydney); shuai zhang (Sensetime Ltd); Shuai Yi (SenseTime Group Limited)

691 - Guided Attention Network for Object Detection and Counting on Drones

CAI YuanQiang (UCAS); Dawei Du (University of Chinese Academy of Sciences); Libo Zhang (Institute of Software Chinese Academy of Sciences)*; Longyin Wen (JD Digit); Weiqiang Wang (University of Chinese Academy of Sciences); Yanjun Wu (Institute of Software Chinese Academy of Sciences ); Siwei Lyu (University at Albany)

696 - K-armed Bandit based Multi-Modal Network Architecture Search for Visual Question Answering

Yiyi Zhou (Xiamen University); Rongrong Ji (Xiamen University, China)*; Xiaoshuai Sun ( Xiamen University); Gen Luo (Xiamen University); Xiaopeng Hong (Xi'an Jiaotong University); Jinsong Su (Xiamen University); Xinghao Ding (Xiamen University); Ling Shao (Inception Institute of Artificial Intelligence)

701 - TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection

Fangfang Wang (Zhejiang University)*; Yifeng Chen (Zhejiang University); Fei Wu (Zhejiang University, China); Xi Li (Zhejiang University)

704 - Class-Aware Modality Mix and Center-Guided Metric Learning for Visible-Thermal Person Re-Identification

Yongguo Ling (Xiamen University)*; Zhun Zhong (University of Trento); Zhiming Luo (Xiamen University); Paolo Rota (University of Trento); Shaozi Li (Xiamen University, China); Nicu Sebe (University of Trento)

707 - Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition

Yuan Xie (DarkMatter AI); Tianshui Chen (DarkMatter AI)*; Tao Pu (Sun Yat-sen University); Hefeng Wu (Sun Yat-sen University); Liang Lin (DarkMatter AI)

710 - Weakly Supervised Real-time Image Cropping based on Aesthetic Distributions

Peng Lu (Beijing University of Posts and Telecommunications)*; Jiahui Liu (Beijing University of Posts and Telecommunications); Xujun Peng (Information Sciences Institute, University of Southern California); Xiaojie Wang (Beijing University of Posts and Telecommunications)

732 - Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

Yuting Liu (Sichuan University)*; Zheng Wang (National Institute of Informatics); Miaojing Shi (King's College London); Shin'ichi Satoh (National Institute of Informatics); Qijun Zhao (Sichuan University); hongyu yang (sichuan university)

734 - KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning in Visual Dialogue

Xiaoze Jiang (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Siyi Du (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Zengchang Qin (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University)*; Yajing Sun (Institute of Information Engineering,Chinese Academy of Sciences); Jing Yu ( Institute of Information Engineering,Chinese Academy of Sciences)

737 - Occluded Prohibited Items Detection: An X-ray Security Inspection Benchmark and De-occlusion Attention Module

Yanlu Wei (Beihang University); Renshuai Tao (Beihang University)*; Zhangjie Wu (Beihang University); Yuqing Ma (Beihang University); Libo Zhang (Institute of Software Chinese Academy of Sciences); Xianglong Liu (BUAA)

765 - Context-aware Attention Network for Predicting Image Aesthetic Subjectivity

Munan Xu (Shenzhen Graduate School, Peking University); Jia-Xing Zhong (School of Electronic and Computer Engineering, Peking University); Yurui Ren (Shenzhen Graduate School, Peking University); Shan Liu (Tencent America); Ge Li (SECE, Shenzhen Graduate School, Peking University)*

783 - PIDNet: An Efficient Network for Dynamic Pedestrian Intrusion Detection

Jingchen Sun (Zhejiang University); Jiming Chen (Zhejiang University); Tao Chen (Fudan University); jiayuan fan (Fudan University); Shibo He (Zhejiang University)*

787 - ChoreoNet: Torwards Music to Dance Synthesis with Choreographic Action Unit

Zijie Ye (Tsinghua University)*; Haozhe Wu (Tsinghua University); Jia Jia (Tsinghua University); Yaohua Bu (Tsinghua University); Wei Chen (Beijing Sougou Science and Technology Development Co., Ltd); Fanbo Meng (Sogou Corporation, Beijing, China); Yanfeng Wang ( Beijing Sougou Science and Technology Development Co., Ltd)

794 - Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization

Da Cao (Hunan University)*; Yawen Zeng (Hunan University); Xiaochi Wei (Baidu Inc.); Liqiang Nie (Shandong University ); Richang Hong (Hefei University of Technology); Zheng Qin (Hunan University)

795 - Pose-native Network Architecture Search for Multi-person Human Pose Estimation

Qian Bao (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jun Hong (AI Research of JD.com); Lingyu Duan (Peking University); Tao Mei (AI Research of JD.com)

814 - Cascade Grouped Attention Network for Referring Expression Segmentation

Gen Luo (Xiamen University); Rongrong Ji (Xiamen University, China)*; Yiyi Zhou (Xiamen University); Xiaoshuai Sun ( Xiamen University); Jinsong Su (Xiamen University); Chia-Wen Lin (National Tsing Hua University); Qi Tian (Huawei Cloud & AI)

816 - Temporally Guided Music-to-Body-Movement Generation

Hsuan-Kai Kao (Academia Sinica); Li Su (Academia Sinica)*

818 - Compositional Few-Shot Recognition with Primitive Discovery and Enhancing

Yixiong Zou (Peking University)*; Shanghang Zhang (UC Berkeley); Ke Chen (South China University of Technology); José M. F. Moura (Carnegie Mellon University); Yaowei Wang (PengCheng Laboratory); Yonghong Tian (Peking University)

830 - InteractGAN: Learning to Generate Human-Object Interaction

Chen Gao (Institute of Information Engineering, CAS)*; si liu (Beihang University); Defa Zhu (Institute of Information Engineering, CAS); Quan Liu (Beihang University); Jie Cao (Institute of Automation, Chinese Academy of Sciences); Haoqian He (Beihang University); Ran He (Institute of Automation, Chinese Academy of Sciences); Shuicheng Yan (YITU Tech)

867 - Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos

Jie Wu (Sun Yat-sen University)*; Guanbin Li (Sun Yat-sen University); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen)); Liang Lin (DarkMatter AI)

876 - Traffic-Aware Multi-Camera Tracking of Vehicles Based on ReID and Camera Link Model

Hung-Min Hsu (UW)*; Yizhou Wang (University of Washington); Jenq-Neng Hwang (University of WA�)

893 - VONAS: Network Design in Visual Odometry using Neural Architecture Search

Xing Cai (Peking University); Lanqing Zhang (Peking University); Chengyuan Li (Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University); Thomas H Li (Advanced Institute of Information Technology, Peking University)*

921 - Category-specific Semantic Coherency Learning for Fine-grained Image Recognition

Shijie Wang (Dalian University of Technology); zhihui wang (Dalian University of Technology); Haojie Li (Dalian University of Technology)*; Wanli Ouyang (The University of Sydney)

961 - Poet: Product-oriented Video Captioner for E-commerce

Shengyu Zhang (Zhejiang University)*; Ziqi Tan (Zhejiang University); Jin Yu (Alibaba Group); Zhou Zhao (Zhejiang University); Kun Kuang (Zhejiang University); jie liu (Alibaba); Jingren Zhou (Alibaba Group); Hongxia Yang (Alibaba Group); Fei Wu (Zhejiang University, China)

976 - Beyond the Attention: Distinguish the Discriminative and Confusable Features For Fine-grained Image Classification

Xiruo Shi (Beijing University of Posts and Telecommunications ); Liutong Xu (Beijing University of Posts and Telecommunications); Pengfei Wang (School of Computer Science, Beijing University of Posts and Telecommunications); Yuanyuan Gao (Beihang Univeristy); Haifang Jian (Institute of Semiconductors, Chinese Academy of Sciences); Wu Liu (AI Research of JD.com)*

977 - BlockMix: Meta Regularization and Self-Calibrated Inference for Metric-Based Meta-Learning

Hao Tang (Nanjing University of Science and Technology); Zechao Li (Nanjing University of Science and Technology)*; Zhimao Peng (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

980 - Structural Semantic Adversarial Active Learning for Image Captioning

Beichen Zhang (University of Chinese Academy of Sciences)*; liang li (Institute of Computing Technology, Chinese Academy of Sciences); Li Su (University of Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)

988 - Scene-Aware Context Reasoning for Unsupervised Abnormal Event Detection in Videos

Che Sun (Beijing Institute of Technology); Yunde Jia (Beijing Institute of Technology); Yao Hu (Alibaba Youku Cognitive and Intelligent Lab); Yuwei WU (Beijing Institute of Technology (BIT), China)*

1002 - Active Object Search

Jie Wu (Sun Yat-sen University)*; Tianshui Chen (DarkMatter AI); Lishan Huang (Sun Yat-Sen University); Hefeng Wu (Sun Yat-sen University); Guanbin Li (Sun Yat-sen University); Ling Tian (University of Electronic Science and Technology of China); Liang Lin (DarkMatter AI)

1009 - Deep-Modal: Real-Time Impact Sound Synthesis for Arbitrary Shapes

Xutong Jin (Peking University); Sheng Li (Peking University)*; Tianshu Qu (Peking University); Dinesh Manocha (UMD); Guoping Wang (Peking University)

1011 - Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID

Dechao Meng (vipl,ict,Chinese academic of science)*; Liang Li (Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Xingyu Gao (Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)

1035 - Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection

xincheng Ju (Soochow University)*; Dong Zhang (Soochow University); Junhui Li (Soochow University); Zhou Guodong (Soochow University)

1038 - Beyond the Parts: Learning Multi-view Cross-part Correlation for Vehicle Re-identification

Xinchen Liu (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jinkai Zheng (Hangzhou Dianzi University); Chenggang Yan (Hangzhou Dianzi University); Tao Mei (AI Research of JD.com)

1064 - Look, Read and Feel: Benchmarking Ads Understanding with Multimodal Multitask Learning

Huaizheng Zhang (Nanyang Technological University)*; YONG LUO (Nanyang Technological University); Qiming Ai (Nanyang Technological University); Han Hu (Beijing Institute of Technology, China); Yonggang Wen (Nanyang Technological University)

1075 - Light Field Super-resolution via Attention-Guided Fusion of Hybrid Lenses

Jing Jin (City University of Hong Kong); Junhui Hou (City University of Hong Kong, Hong Kong)*; Jie Chen (Hong Kong Baptist University); Sam Kwong (City Univeristy of Hong Kong); Jingyi Yu (Shanghai Tech University)

1147 - Compact Bilinear Augmented Query Structured Attention for Sport Highlights Classification

Yanbin Hao (City University of Hong Kong); Hao Zhang (City University of Hong Kong)*; Chong-Wah Ngo (City University of Hong Kong); Qiang Liu (DeepAIT (Hong Kong) Limited); Xiaojun Hu (DeepAIT (Hong Kong) Limited)

1195 - Semantic Image Analogy with a Conditional Single-Image GAN

Jiacheng Li (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Dong Liu (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

1196 - Trajectory Prediction in Heterogeneous Environment via Attended Ecology Embedding

Wei-Cheng Lai (National Chiao Tung University); Zi-Xiang Xia (National Chiao Tung University); Hao-Siang Lin (National Chiao Tung University); Lien-Feng Hsu (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University); I-Hong Jhuo (IBM); Wen-Huang Cheng (EE, NCTU)*

1214 - A Structured Graph Attention Network for Vehicle Re-Identification

Yangchun Zhu (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Tianzhu Zhang (University of Science and Technology of China); Jiawei Liu (University of Science and Technology of China); Jiebo Luo (U. Rochester)

1224 - Scoring High: Analysis and Prediction of Viewer Behavior and Engagement in the Context of 2018 FIFA WC Live Streaming

Nikolas Wehner (University of Würzburg)*; Michael Seufert (University of Würzburg); Sebastian Egger-Lampl (AIT Austrian Institute of Technology GmbH); Bruno Gardlo (AIT Austrian Institute of Technology GmbH); Pedro Casas (AIT Austrian Institute of Technology GmbH); Raimund Schatz (AIT)

1275 - Text-Guided Neural Image Inpainting

Lisai Zhang (Harbin Institute of Technology, Shenzhen)*; Qingcai Chen ( Harbin Institute of Technology, Shenzhen); Baotian Hu (Harbin Institute of Technology, Shenzhen); Shuoran Jiang (Harbin Institute of Technology, Shenzhen)

1319 - Weakly-supervised Image Hashing through Masked Visual Semantic Graph Reasoning

Lu Jin (Nanjing University of Science and Technology ); Zechao Li (Nanjing University of Science and Technology)*; Yonghua Pan (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

1344 - Semantic Consistency Guided Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval

Heyu Zhou (Tianjin University, China); Weizhi Nie (Tianjin University)*; Dan Song (Tianjin University); Nian Hu (Tianjin University); Xuanya Li (Baidu); An-An Liu (Tianjin University)

1347 - Performance over Random: A robust evaluation protocol for video summarization methods

Evlampios Apostolidis (QMUL & CERTH-ITI)*; Eleni Adamantidou (CERTH); Alexandros I Metsai (CERTH-ITI); Vasileios Mezaris (Information Technologies Institute, Centre for Research and Technology Hellas, Greece); Ioannis Patras (Queen Mary University of London)

1355 - ARSketch: Sketch-Based User Interface for Augmented Reality Glasses

Zhaohui Zhang (Rokid); Haichao Zhu (The Chinese University of Hong Kong)*; Qian Zhang (California University, Los Angeles)

1367 - Text-Embedded Bilinear Model for Fine-Grained Visual Recognition

Liang Sun (University of Electronic Science and Technology of China); Xiang Guan (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)*; Lei Zhang (Chongqing University)

1384 - Learning Scales from Points: A Scale-aware Probabilistic Model for Crowd Counting

Zhiheng Ma (Xi'an Jiaotong University)*; Xing Wei (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

1394 - Learning Global Structure Consistency for Robust Object Tracking

Bi Li (Huazhong University of Science and Technology); Chengquan Zhang (Baidu Inc); Zhibin Hong (Baidu Inc.); Xu Tang (Baidu); jingtuo liu (baidu); Junyu Han (Baidu Inc.); Errui Ding (Baidu Inc.); Wenyu Liu (Huazhong University of Science and Technology)*

1399 - RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

Niluthpol c Mithun (SRI International)*; Karan Sikka (SRI International); Han-Pang Chiu (SRI International); Supun Samarasekera (SRI International); Rakesh Kumar (SRI International)

1418 - Multimodal Representation with Embedded Visual Guiding Objects for Named Entity Recognition in Social Media Posts

Zhiwei Wu (School of Software Engineering, South China University of Technology); Changmeng Zheng (South China University of Technology); Yi Cai (School of Software Engineering, South China University of Technology)*; Junying Chen (South China University of Technology); Ho-fung Leung (The Chinese University of Hong Kong); Qing Li (The Hong Kong Polytechnic University)

1453 - Contextual Multi-Scale Feature Learning for Person Re-Identification

Baoyu Fan (Inspur Electronic Information Industry Co.,Ltd.); Li Wang (inspur)*; Runze Zhang (Inspur Electronic Information Industry Co.,Ltd.); Zhenhua Guo (Inspur Electronic Information Industry Co.,Ltd.); Yaqian Zhao (Inspur); Rengang Li (Inspur); Weifeng Gong ( Inspur Electronic Information Industry Co.,Ltd.)

1456 - Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene

Xinke Li (National University of Singapore); Chongshou Li (National University of Singapore)*; Zekun Tong (National University of Singapore); Andrew Lim (National University of Singapore); Junsong Yuan ("State University of New York at Buffalo, USA"); Yuwei Wu (National University of Singapore); Jing Tang (National University of Singapore); Raymond Huang (National University of Singapore)

1473 - Space-Time Video Super-Resolution using Temporal Profiles

Zeyu Xiao (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Xueyang Fu (University of Science and Technology of China); Dong Liu (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

1493 - Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions

Yu-Siang Huang (Academia Sinica)*; Yi-Hsuan Yang (Academia Sinica)

1541 - MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

Devamanyu Hazarika (NUS, Singapore)*; Roger Zimmermann (NUS); Soujanya Poria (Singapore University of Technology and Design)

1549 - Instability of Successive Deep Image Compression

Jun-Hyuk Kim (Yonsei University); Soobeom Jang (Yonsei University); Jun-Ho Choi (Yonsei University); Jong-Seok Lee ("Yonsei University, Korea")*

1570 - DeepFacePencil: Creating Face Images from Freehand Sketches

Yuhang Li (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China)*; Binxin Yang (University of Science and Technology of China); Zihan Chen (University of Science and Technology of China); Zhihua Cheng (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

1576 - ALANET: Adaptive Latent Attention Network for Joint Video Deblurring and Interpolation

Akash Gupta (University of California, Riverside)*; Abhishek Aich (University of California, Riverside); Amit K. Roy-Chowdhury (University of California, Riverside)

1595 - CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis

Kaicheng Yang (Hebei University Of Science and Technology); Hua Xu (State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China)*; kai gao (Hebei University Of Science and Technology)

1598 - Single-Shot Two-Pronged Detector with Rectified IoU Loss

Keyang Wang (chongqing university); Lei Zhang (Chongqing University)*

1612 - Object-level Attention for Aesthetic Rating Distribution Prediction

Jingwen Hou (Nanyang Technological University)*; Sheng Yang (Nanyang Technological University); Weisi Lin (Nanyang Technological University, Singapore)

1633 - Not made for each other - Audio-Visual Dissonance-based Deepfake Detection and Localization

Komal Chugh (Indian Institute of Technology Ropar); Parul Gupta (Indian Institute of Technology Ropar); Abhinav Dhall (Monash University)*; Ramanathan Subramanian (Indian Institute of Technology Ropar)

1656 - Make your favorite music curative: music style transfer for anxiety reduction

Zhejing Hu (The Hong Kong Polytechnic University); Yan Liu (The Hong Kong Polytechnic University)*; Gong Chen (The Hong Kong Polytechnic University); Sheng-hua Zhong (Shenzhen University); Aiwei Zhang (St. Paul’s Co-educational College)

1685 - Hearing like Seeing: Improving Voice-Face Interactions and Associations via Adversarial Deep Semantic Matching Network

Kai Cheng (Huaqiao University); Xin Liu (Huaqiao University)*; Yiu-ming CHEUNG (Hong Kong Baptist University); Rui Wang (Huaqiao University); Xing Xu (University of Electronic Science and Technology of China); Bineng Zhong (Huaqiao University)

1702 - Concept Drift Detection for Multivariate Data Streams and Temporal Segmentation of Daylong Egocentric Videos

Pravin Nagar (IIIT Delhi)*; Mansi Khemka (Columbia University); Chetan Arora (Indian Institute of Technology Delhi)

1708 - Dynamic Context-guided Capsule Network for Multimodal Machine Translation

Huan Lin (Xiamen University)*; Fandong Meng (Tencent WeChat AI - Pattern Recognition Center Tencent Inc.); Jinsong Su (Xiamen University); Yongjing Yin (Xiamen University); Zhengyuan Yang (University of Rochester); Yubin Ge (University of Illinois at Urbana-Champaign); Jie Zhou (Tencent); Jiebo Luo (U. Rochester)

1710 - DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices

Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Yihao Huang (East China Normal University); Qing Guo (Nanyang Technological University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)

1717 - RIRNet: Recurrent-In-Recurrent Network for Video Quality Assessment

Pengfei Chen (Xidian University / China University of Mining and Technology); Leida Li (Xidian University)*; Lei Ma (Hangzhou Multi-Color Optoelctronics Co., Ltd.); Jinjian Wu (Xidian University); Guangming Shi (Xidian University)

1719 - Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification

BOQIANG XU (University of Chinese Academy of Sciences;Institute of Automation,Chinese Academy of Sciences)*; Lingxiao He (AI Research of JD.com); Xingyu Liao (AI Research of JD.com); Wu Liu (AI Research of JD.com); Zhenan Sun (Chinese of Academy of Sciences); Tao Mei (AI Research of JD.com)

1722 - PopMAG: Pop Music Accompaniment Generation

Yi Ren (Zhejiang University)*; Jinzheng He (Zhejiang University); Xu Tan (Microsoft Research Asia); Tao Qin (Microsoft Research Asia); Zhou Zhao (Zhejiang University); Tie-Yan Liu (Microsoft)

1729 - PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation

Shaotian Yan (Zhejiang University)*; Chen Shen (Alibaba Group); Zhongming Jin (Alibaba Group); Jianqiang Huang (Alibaba Group); Rongxin Jiang (Zhejiang University); Yaowu Chen (Zhejiang University); Xian-Sheng Hua (Alibaba Group)

1761 - Differentiable Manifold Reconstruction for Point Cloud Denoising

Shitong Luo (Peking University)*; Wei Hu (Peking University)

1775 - Discriminative Spatial Feature Learning for Person Re-Identification

Peixi Peng (Peking University)*; Yonghong Tian (Peking University); Yangru Huang (Beijing University); Xiangqian Wang (Huawei); Huilong An (AI Application Research Center)

1781 - FakePolisher: Making DeepFakes More Detection-Evasive by Shallow Reconstruction

Yihao Huang (East China Normal University)*; Felix Juefei-Xu (Alibaba Group); Run Wang (Nanyang Technological University); Qing Guo (Nanyang Technological University); Lei Ma (Kyushu University); Xiaofei Xie (Nanyang Technological University); Jianwen Li (East China Normal University); Weikai Miao (East China Normal University); Yang Liu (Nanyang Technology University, Singapore); Geguang Pu (East China Normal University)

1784 - SalGCN: Saliency Prediction for 360-Degree Images Based on Spherical Graph Convolutional Networks

Haoran Lv (Shanghai Jiao Tong University)*; Qin Yang (Shanghai Jiao Tong University); Chenglin Li (Shanghai Jiao Tong University); Wenrui Dai (Shanghai Jiao Tong University); Junni Zou (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University)

1800 - AdaHGNN: Adaptive Hypergraph Neural Networks for Multi-Label Image Classification

Xiangping Wu (Harbin Institute of Technology, Shenzhen); Qingcai Chen ( Harbin Institute of Technology, Shenzhen)*; Wei Li (Harbin Institute of Technology, Shenzhen); Yulun Xiao (Harbin Institute of Technology, Shenzhen); Baotian Hu (University of Massachusetts)

1828 - Reinforced Similarity Learning: Siamese Relation Networks for Robust Object Tracking

Dawei Zhang (Zhejiang Normal University)*; Zhonglong Zheng (Zhejiang Normal University); Minglu Li (Zhejiang Normal University); Xiaowei He (Zhejiang Normal University); Tianxiang Wang (Zhejiang Normal University); Liyuan Chen (Zhejiang Normal University); Riheng Jia (Zhejiang Normal University); Feilong Lin (Zhejiang Normal University)

1832 - AffectI: A Game for Diverse, Reliable, and Efficient Affective Image Annotation

xingkun zuo (University of Yamanashi); Jiyi Li (University of Yamanashi / RIKEN AIP); qili zhou (hangzhou dianzi university); jianjun li (HangZhou Dianzi University); Xiaoyang mao (University of Yamanashi)*

1837 - Cognitive Representation Learning of Self-Media Online Article Quality

Yiru Wang (Tencent Inc.; Tsinghua University)*; Shen Huang (Tencent Inc.); Gongfu Li (Tencent Inc.); Qiang Deng (Tencent Inc.); Dongliang Liao (Data Quality Team, WeChat, Tencent Inc., China); Pengda Si (Tsinghua University); Yujiu Yang (Tsinghua University); Jin Xu (Tencent Inc.)

1852 - Describing Subjective Experiment Consistency by p-value qq-plot

Jakub Nawała (AGH University of Science and Technology)*; Lucjan Janowski (AGH University of Science and Technology); Bogdan Ćmiel (); Krzysztof Rusek (AGH University of Science and Technology)

1859 - Deep Structural Contour Detection

Ruoxi Deng (Central South University)*; Shengjun Liu (Central South University)

1874 - Multimodal Multi-Task Financial Risk Forecasting

Ramit Sawhney (Netaji Subhas Institute of Technology)*; Puneet Mathur (University of Maryland, College Park); Ayush Mangal (IIT Roorkee); Piyush Khanna (Delhi Technological University); Rajiv Ratn Shah ("Indraprastha Institute of Information Technology, Delhi"); Roger Zimmermann (NUS)

1893 - Cross-modal Non-linear Guided Attention and TemporalCoherence in Multi-modal Deep Video Models

Saurabh Sahu (); Palash Goyal (Samsung Research); Shalini Ghosh (Samsung Research)*; Chul Lee (Samsung Research America)

1946 - Multi-modal Cooking Workflow Construction for Food Recipes

Liang-Ming Pan (National University of Singapore)*; Jingjing Chen (Fudan University); Jianlong Wu (Fudan University); Shaoteng Liu (Xi'an Jiaotong University); Chong-Wah Ngo (City University of Hong Kong); Min-Yen Kan (National University of Singapore); Yu-Gang Jiang (Fudan University); Tat-Seng Chua (National university of Singapore)

(Video) ACM Multimedia 2020 Tutorial-part1-New trends of person re-ID system - Zheng Wang

1950 - Distributed Multi-agent Video Fast-forwarding

Shuyue Lan (Northwestern University)*; Zhilu Wang (Northwestern University); Amit K. Roy-Chowdhury (University of California, Riverside); Ermin Wei (); Zhu Qi (Northwestern University)

1988 - IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning

Zhenhuan Liu (Institute of Computing Technology, Chinese Academy of Sciences); liang li (Institute of Computing Technology, Chinese Academy of Sciences)*; Shaofei Cai (Institute of Computing Technology, Chinese Academy of Sciences); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Qingming Huang (University of Chinese Academy of Sciences)

1994 - LIGHTEN: Learning Interactions with Graph and Heirarchical TEmporal Networks for HOI in videos

Sai Praneeth Reddy Sunkesula (Indian Institute of Technology, Bombay)*; Rishabh Dabral (IIT Bombay); Ganesh Ramakrishnan (IIT Bombay)

2014 - BS-MCVR: Binary-sensing based Mobile-cloud Visual Recognition

Hongyi Zheng (The Hong Kong Polytechnic University); Lei Zhang ("Hong Kong Polytechnic University, Hong Kong, China")*

2017 - Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition

Yuqian Fu (Fudan University)*; Yanwei Fu (Fudan University); junke wang (Fudan University); Li Zhang (University of Oxford); Xing Zhang (Fudan University); Yu-Gang Jiang (Fudan University)

2030 - Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning

Jingjing Li (University of Electronic Science and Technology of China)*; Mengmeng Jing (University of Electronic Science and Technology of China); Lei Zhu (Shandong Normal Unversity); Zhengming Ding (Indiana University-Purdue University Indianapolis); Ke Lu (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)

2032 - When Bitstream Prior Meets Deep Prior: Compressed Video Super-resolution with Learning from Decoding

Peilin Chen (City University of Hong Kong)*; Wenhan Yang (City University of Hong Kong); Long Sun (Huawei); Shiqi Wang (CityU)

2035 - Describe What to Change: A Text-guided Unsupervised Image-to-image Translation Approach

Yahui Liu (University of Trento); Marco De Nadai (Fondazione Bruno Kessler)*; Deng Cai (The Chinese University of Hong Kong); Huayang Li (Tencent AI Lab); Xavier Alameda-Pineda (INRIA); Nicu Sebe (University of Trento); Bruno Lepri (FBK, Trento, Italy)

2052 - Increasing Video Perceptual Quality with GANs and Semantic Coding

Leonardo Galteri (University of Florence); Marco Bertini (University of Florence)*; Lorenzo Seidenari (University of Florence); Tiberio Uricchio (University of Florence); Alberto Del Bimbo (University of Florence)

2053 - Attentive One-Dimensional Heatmap Regression for Facial Landmark Detection and Tracking

Shi Yin (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*; Xiaoping Chen (University of Science and Technology of China); Enhong Chen (University of Science and Technology of China); Cong Liang (University of Science and Technology of China)

2071 - Fine-Grained Similarity Measurement between Educational Videos and Exercises

Xin Wang (University of Science and Technology of China); Wei Huang (University of Science and Technology of China); Qi Liu (" University of Science and Technology of China, China")*; Yu Yin (University of Science and Technology of China); Zhenya Huang (University of Science and Technology of China ); Le Wu (Hefei University of Technology); Jianhui Ma (University of Science and Technology of China); Xue Wang (Nankai University)

2073 - One-shot Text Field labeling using Attention and Belief Propagation for Structure information extraction

Mengli Cheng (Alibaba Group)*; Minghui Qiu (Alibaba)

2081 - GRAD: Learning for Overhead-aware Adaptive Video Streaming with Scalable Video Coding

Yunzhuo Liu (Shanghai Jiao Tong University); Bo Jiang (Shanghai Jiao Tong University)*; Tian Guo (Worcester Polytechnic Institute); Ramesh K. Sitaraman (UMass Amherst & Akamai Technologies); Don Towsley (University of Massachusetts Amherst); Xinbing Wang (Shanghai Jiao Tong University)

2088 - Down to the Last Detail: Virtual Try-on with Fine-grained Details

Jiahang Wang (Huazhong University of Science and Technology)*; Tong Sha (Beihang University); Wei Zhang (JD AI Research); Zhoujun Li (Beihang University); Tao Mei (AI Research of JD.com)

2151 - Reduce the Influence of Stability in Content Delivery Network via Learning-Based Caching Algorithm

Gang Yan (Binghamton University-SUNY); Jian Li (Binghamton University-SUNY )*

2158 - Temporal Denoising Mask Synthesis Network for Learning Blind Video Temporal Consistency

Yifeng Zhou (University of Electronic Science and Technology of China); Xing Xu (University of Electronic Science and Technology of China)*; Fumin Shen (UESTC); Lianli Gao (The University of Electronic Science and Technology of China); Huimin Lu (Kyushu Institute of Technology); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

2174 - INCLUDE: A Large Scale Dataset for Indian Sign Language Recognition

Advaith Sridhar (IIT Madras)*; Rohith Gandhi G (IIT Madras); Pratyush Kumar (IIT Madras); Mitesh Khapra (IIT Madras)

2205 - A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild

Prajwal K R (International Institute of Information Technology, Hyderabad)*; Rudrabha Mukhopadhyay (IIIT Hyderabad); Vinay Namboodiri (University of Bath); C.V. Jawahar (IIIT-Hyderabad)

2237 - Efficient adaptation of neural network filter for video compression

Yat-Hong Lam (Nokia Technologies)*; Alireza Zare (Nokia Technologies); Francesco Cricri (Nokia Technologies); Jani Lainema (Nokia); Miska Hannuksela (Nokia Technologies)

2246 - An Analysis of Delay in Live 360° Video Streaming Systems

Jun Yi (Georgia State University)*; Md Reazul Islam (Georgia State University); Shivang Aggarwal (University at Buffalo, The State University of New York); Dimitrios Koutsonikolas (SUNY Buffalo); Y. Charlie Hu (Purdue University); Zhisheng Yan (Georgia State University)

2249 - Adaptive Temporal Triplet-loss for Cross-modal Embedding Learning

David Semedo (Universidade NOVA de Lisboa)*; Joao Magalhaes (Universidade NOVA Lisboa)

2257 - SonoSpace: Visual Feedback of Timbre with Unsupervised Learning

Naoki Kimura (The University of Tokyo)*; Keisuke Shiro (The University of Tokyo); Yota Takakura (Innoqua Inc.); Hiromi Nakamura (The University of Tokyo); Jun Rekimoto (The Univertsity of Tokyo)

2264 - Amora: Black-box Adversarial Morphing Attack

Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Qing Guo (Nanyang Technological University); Yihao Huang (East China Normal University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)

2323 - Single Image Deraining via Scale-space Invariant Attention Neural Network

Bo Pang (Harbin Institute of Technology); Deming Zhai (Harbin Institute of Technolgy); Junjun Jiang (Harbin Institute of Technology); Xianming Liu (Harbin Institute of Technology)*

2342 - Concept-based Explanation for Fine-grained Images and Its Application in Infectious Keratitis Classification

Zhengqing Fang (Zhejiang University)*; Kun Kuang (Zhejiang University); Yuxiao Lin (Zhejiang University); Fei Wu (Zhejiang University); Yufeng Yao (Zhejiang University)

2448 - Visual Relation of Interest Detection

Fan Yu (Nanjing University); Haonan Wang (Nanjing University); Tongwei Ren (Nanjing University)*; Jinhui Tang (Nanjing University of Science and Technology); Gangshan Wu (Nanjing University)

38 - ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences

"Zhizhong Han (University of Maryland, College Park); Chao Chen (Tsinghua University); Yu-Shen Liu (Tsinghua University)*; Matthias Zwicker (University of Maryland)"

46 - VideoIC: A Video Interactive Comments Dataset and Multimodal Multitask Learning for Comments Generation

Weiying Wang (Renmin University of China)*; Jieting Chen (Renmin University of China); Qin Jin (Renmin University of China)

53 - Image Inpainting Based on Multi-frequency Probabilistic Inference Model

Jin Wang (Beijing University of Technology)*; Chen Wang (Beijing University of Technology); Qingming Huang (University of Chinese Academy of Sciences); Yunhui Shi (Beijing University of Technology); Jian-Feng Cai (The Hong Kong University of Science and Technology); Qing Zhu (Beijing University of Technology); Baocai Yin (Beijing University of Technology)

60 - "Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition"

"Wenbo Zheng (School of Software Engineering, Xi'an Jiaotong University); Lan Yan (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences); Chao Gou (School of Intelligent Systems Engineering, Sun Yat-sen University)*; Fei-Yue Wang (The State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences)"

63 - Dual Adversarial Network for Unsupervised Ground/Satellite-to-Aerial Scene Adaptation

jianzhe peter lin (University of British Columbia)*; Lichao Mou (DLR&TUM); tianze yu (University of British Columbia); Xiaoxiang Zhu (Technical University of Munich (TUM); German Aerospace Center (DLR)); Z. Jane Wang (University of British Columbia)

68 - Scene-Aware Background Music Synthesis

Yujia Wang (Beijing Institute of Technology)*; Wei Liang (Beijing Institute of Technology); Wanwan Li (George Mason University); Dingzeyu Li (Adobe Research); Lap-Fai Yu (George Mason University)

70 - Textual Dependency Embedding for Person Search by Language

"Kai Niu (National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences; University of Chinese Academy of Sciences;)*; Yan Huang (Institute of Automation, Chinese Academy of Sciences); Liang Wang (NLPR, China)"

73 - University-1652: A Multi-view Multi-source Benchmark for Drone-based Geo-localization

Zhedong Zheng (University of Technology Sydney)*; Yunchao Wei (UTS); Yi Yang (UTS)

75 - Adversarial Bipartite Graph Learning for Video Domain Adaptation

"Yadan Luo (University of Queensland)*; Zi Huang (University of Queensland); Zijian Wang (University of Queensland); Zheng Zhang (Harbin Institute of Technology, Shenzhen); Mahsa Baktashmotlagh (University of Queensland)"

76 - DIPDefend: Deep Image Prior Driven Defense against Adversarial Examples

Tao Dai (Tsinghua University)*; Yan Feng (Tsinghua University); Dongxian Wu (Tsinghua University); Bin Chen (Tsinghua University); Jian Lu (Shenzhen University); Yong Jiang (Tsinghua University); Shutao Xia (Tsinghua University)

77 - MRS-Net: Multi-Scale Recurrent Scalable Network for Face Quality Enhancement of Compressed Videos

Tie Liu (BUAA)*; Mai Xu (BUAA); Shengxi Li (Imperial College London); Rui Ding (Beihang University); Huaida Liu (Momo Inc.)

78 - TRIE: End-to-End Text Reading and Information Extraction for Document Understanding

PENG ZHANG (Hikvision Research Institute); Yunlu Xu (Hikvision Research Institute); Zhanzhan Cheng (Hikvision Research Institute)*; Shiliang Pu (Hikvision Research Institute); Jing Lu (Hikvision Research Institute); Liang Qiao (Hikvision Research Institute); Yi Niu (Hikvision Research Institute); Fei Wu (Zhejiang University)

86 - Iterative Back Modification for Faster Image Captioning

"Zhengcong Fei (Chinese Academy of Sciences, Institute of Computing Technology)*"

87 - Visual-Semantic Graph Matching for Visual Grounding

"Chenchen Jing (Beijing Institute of Technology); Mingtao Pei (Beijing Institute of Technology); Yuwei WU (Beijing Institute of Technology (BIT), China)*; Yao Hu (Alibaba Youku Cognitive and Intelligent Lab); Yunde Jia (Beijing Institute of Technology); Qi Wu (University of Adelaide)"

99 - Human Identification and Interaction Detection in Cross-View Multi-Person Videos with Wearable Cameras

"Jiewen Zhao (College of Intelligence and Computing, Tianjin University); Ruize Han (College of Intelligence and Computing, Tianjin University); Yiyang Gan (College of Intelligence and Computing, Tianjin University); Liang Wan (College of Intelligence and Computing, Tianjin University)*; Wei Feng (College of Intelligence and Computing, Tianjin University, China); Song Wang (University of South Carolina)"

111 - Domain Adaptive Person Re-Identification via Coupling Optimization

Xiaobin Liu (Peking University); Shiliang Zhang (Peking University)*

118 - Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge

Peng Wang (Northwestern Polytechnical University); Dongyang Liu (Northwestern Polytechnical University); Hui Li (the University of Adelaide)*; Qi Wu (University of Adelaide)

123 - Adversarial Privacy-preserving Filter

"Jiaming Zhang (Beijing Jiaotong University, China)*; Jitao Sang (Beijing Jiaotong University, China); Xian Zhao (Beijing Jiaotong University, China); Xiaowen Huang (Beijing Jiaotong University, China); Yanfeng Sun (Beijing University of Technology); Yongli Hu (Beijing University of Technology)"

135 - Deep Disturbance-disentangled Learning for Facial Expression Recognition

Delian Ruan (Xiamen University); Yan Yan (Xiamen University)*; Si Chen (Xiamen University of Technology); Jing-Hao Xue (University College London); Hanzi Wang (Xiamen University)

142 - Controllable Video Captioning with an Exemplar Sentence

Yitian Yuan (Tsinghua University)*; Lin Ma (Tencent AI Lab); Jingwen Wang (Tencent AI Lab); Wenwu Zhu (Tsinghua University)

143 - MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos

Guangyao Shen (Tsinghua University)*; Xin Wang (Tsinghua University); Xuguang Duan (Tsinghua University); Hongzhi Li (Microsoft Research); Wenwu Zhu (Tsinghua University)

146 - Mix Dimension in Poincar\`e Geometry for 3D Skeleton-based Action Recognition

"Wei Peng (CMVS, University of Oulu)*; Jingang Shi (University of Oulu); Zhaoqiang Xia (Northwestern Polytechnical University); Guoying Zhao (University of Oulu)"

151 - Online Multi-view Subspace Learning with Mixed Noise

"Jinxing Li (The Chinese University of Hong Kong (Shenzhen))*; Hongwei Yong (The Hong Kong Polytechnic University); Feng Wu (University of Science and Technology of China); Mu Li (The Chinese University of Hong Kong, Shenzhen)"

160 - Few-Shot Ensemble Learning for Video Classification with SlowFast Memory Networks

"Mengshi Qi (Ecole polytechnique f¨¦d¨¦rale de Lausanne (EPFL))*; Jie Qin (Inception Institute of Artificial Intelligence); Xiantong Zhen (University of Amsterdam); Di Huang (Beihang University, China); Yi Yang (UTS); Jiebo Luo (U. Rochester)"

162 - Single Image De-noising via Staged Memory Network

Weijiang Yu (SUN YAT-SEN UNIVERSITY)*; Jian Liang (Nanchang University); Lu Li (Zhejiang University); Nong Xiao (Sun Yat-sen University)

166 - LAL: Linguistically Aware Learning for Scene Text Recognition

Yi Zheng (Bostion University)*; Wenda Qin (Bostion University); Derry Wijaya (Boston University); Margrit Betke (Boston University)

174 - Emerging Topic Detection on the Meta-data of Images from Fashion Social Media

"Kunihiro Miyazaki (The University of Tokyo)*; Scarlett Young (Neural Pocket Inc.); Yuichi Sasaki (Neural Pocket); Takayuki Uchiba (Sugakubunka Co., Ltd.); Kenji Tanaka (the University of Tokyo)"

180 - Dynamic Extension Nets for Few-shot Semantic Segmentation

Lizhao Liu (South China University of Technology); Junyi Cao (South China University of Technology); Minqian Liu (South China University of Technology); Yong Guo (South China University of Technology); Qi Chen (South China University of Technology); Mingkui Tan (South China University of Technology)*

187 - Interpretable Embedding for Ad-Hoc Video Search

Jiaxin Wu (City University of Hong Kong)*; Chong-Wah Ngo (City University of Hong Kong)

191 - Joint Attribute Manipulation and Modality Alignment Learning for Composing Text and Image to Image Retrieval

"Feifei Zhang (Institute of Automation, Chinese Academy of Sciences); Mingliang Xu (Zhengzhou University); Qirong Mao (Jiangsu University)*; Changsheng Xu (CASIA)"

192 - Leveraging QoE Heterogenity for Large-Scale Livecaset Scheduling

"Ruixiao Zhang (Tsinghua University)*; Ming Ma (Beijing Kuaishou Technology Co., Ltd); Tianchi Huang (Tsinghua University); Hanyu Li (Tsinghua University); Jiangchuan Liu (Simon Fraser University); Lifeng Sun (Tsinghua University)"

193 - Dual-Structure Disentangling Variational Generation for Data-Limited Face Parsing

"Peipei Li ( Institute of Automation Chinese Academy of Sciences)*; Yinglu Liu (JD AI); Hailin Shi (JD AI); Xiang Wu (Reconova); Yibo Hu (Institute of Automation, Chinese Academy of Sciences); Ran He (Institute of Automation, Chinese Academy of Sciences); Zhenan Sun (Chinese of Academy of Sciences)"

194 - Surface Reconstruction with Unconnected Normal Maps: An Efficient Mesh-based Approach

Miaohui Wang (Shenzhen University); Wuyuan Xie (Shenzhen University)*

197 - A Human-Computer Duet System for Music Performance

Yuen-Jen Lin (Academia Sinica)*; Hsuan-Kai Kao (Academia Sinica); Yih-Chih Tseng (Academia Sinica); Ming Tsai (KoKo Lab); Li Su (Academia Sinica)

199 - LSOTB-TIR: A Large-Scale High-Diversity Thermal Infrared Object Tracking Benchmark

"Qiao Liu (Harbin Institute of Technology, Shenzhen); Xin Li (Harbin Institute of Technology, Shenzhen); Zhenyu He (Harbin Institute of Technology (Shenzhen); Peng Cheng Laboratory)*; Chenglong Li (Anhui University); Jun Li (HARBIN INSTITUTE OF TECHNOLOGY, SHENZHEN); Zikun Zhou (Harbin Institute of Technology, Shenzhen); Di Yuan (Harbin Institute of Technology, Shenzhen; Monash University); Jing Li (Harbin Institute of Technology, Shenzhen); kai yang (Harbin Institute of Technology, Shenzhen); Nana Fan (Harbin Institute of Technology, Shenzhen); Feng Zheng (SUSTech)"

202 - Invisible: Federated Learning over Non-Informative Intermediate Updates against Multimedia Privacy Leakages

Qiushi Li (Tsinghua University)*; Wenwu Zhu (Tsinghua University); Chao Wu (Tsinghua University); xinglin pan (University of Electronic Science and Technology of China); Fan Yang (Tsinghua University); Yuezhi Zhou (Tsinghua University); Yaoxue Zhang (Tsinghua University)

204 - Cascade Reasoning Network For Text-based Visual Question Answering

Fen Liu (South China University of Technology); Guanghui Xu (South China University of Technology); Qi Wu (University of Adelaide); Qing Du (South China Univercity of Technology); Wei Jia (CVTE Research); Mingkui Tan (South China University of Technology)*

208 - Fast Enhancement for Non-Uniform Illumination Images using Light-weight CNNs

Feifan Lv (Beihang University); Bo Liu (Beihang University); Feng Lu (Beihang University)*

209 - Animating Through Warping: an Efficient Method for High-Quality Facial Expression Animation

Zili Yi (Huawei Canada)*; Qiang Tang (Huawei Canada); Vishnu Sanjay Ramiya Srinivasan (Huawei); Zhan Xu (Huawei Canada)

211 - Exploiting Better Feature Aggregation for Video Object Detection

Liang Han (Stony Brook University); Pichao Wang (Alibaba Group (U.S.) Inc.)*; Zhaozheng Yin (Stony Brook University); Fan Wang (Alibaba Group); Hao Li (Alibaba Group)

220 - NuI-Go: Recursive Non-local Encoder-Decoder Network for Retinal Image Non-uniform Illumination Removal

"Chongyi Li ( Nanyang Technological University)*; Huazhu Fu (Inception Institute of Artificial Intelligence); Runmin Cong (Beijing Jiaotong University); Zechao Li (Nanjing University of Science and Technology); Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences)"

221 - Every Moment Matters: Detail-Aware Networks to Bring a Blurry Image Alive

"Kaihao Zhang (Australian National University)*; Wenhan Luo (Tencent AI Lab); Bjorn Stenger (Rakuten Institute of Technology); Wenqi Ren (Institute of Information Engineering, Chinese Academy of Sciences); Lin Ma (Tencent AI Lab); HONGDONG LI (Australian National University, Australia)"

227 - Online Filtering Training Samples for Robust Visual Tracking

Jie Zhao (Dalian University of Technology); Kenan Dai (Dalian University of Technology); Dong Wang (Dalian University of Technology)*; Huchuan Lu (Dalian University of Technology)

228 - Boosting Continuous Sign Language Recognition via Cross Modality Augmentation

Junfu Pu (University of Science and Technology of China)*; Hezhen Hu (University of Science and Technology of China); Wengang Zhou (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China)

230 - Self-supervised Dance Video Synthesis Conditioned on Music

Xuanchi Ren (HKUST); Haoran Li (The Hong Kong University of Science and Technology); Zijian HUANG (the Hong Kong University of Science and Technology); Qifeng Chen (HKUST)*

232 - Co-Attentive Lifting for Infrared-Visible Person Re-Identification

Xing Wei (Xi'an Jiaotong University)*; Diangang Li (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Wei Ke (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

235 - MOR-UAV: A Benchmark Dataset and Baselines for Moving Object Recognition in UAV Videos

Murari Mandal (Malaviya National Institute of Technology Jaipur)*; Lav Kush Kumar (Malaviya National Institute of Technology Jaipur); Santosh Kumar vipparthi (MNIT)

241 - Jointly Cross- and Self-Modal Graph Attention Network for Query-Based Moment Localization

Daizong Liu (Huazhong University of Science and Technology)*; Xiaoye Qu (Huazhong University of Science and Technology); Xiao-Yang Liu (Columbia University); Jianfeng Dong (Zhejiang Gongshang University); Pan Zhou ( Huazhong University of Science and Technology); Zichuan Xu (Dalian University of Technology)

251 - Learning Tuple Compatibility for Conditional Outfit Recommendation

"Xuewen Yang (Stony Brook University)*; Jiangbo Yuan (eBay Inc.,); Wanying Ding (JPMorgan Chase & Co ); Pengyun Yan (Vipshop Inc); Dongliang Xie (Beijing University of Posts and Telecommunications); Xin Wang (Stony Brook University)"

265 - ThumbNet: One Thumbnail Image Contains All You Need for Recognition

Chen Zhao (KAUST)*; Bernard Ghanem (KAUST)

267 - Efficient Crowd Counting via Structured Knowledge Transfer

Lingbo Liu (Sun Yat-sen University)*; Jiaqi Chen (Sun Yat-sen University); Hefeng Wu (Sun Yat-sen University); Tianshui Chen (DarkMatter AI); Guanbin Li (Sun Yat-sen University); Liang Lin (DarkMatter AI)

274 - Text-guided Image Inpainting

"Zijian Zhang (Zhejiang University)*; Zhou Zhao (Zhejiang University); Zhu Zhang (Zhejiang University); Baoxing Huai (HUAWEI TECHNOLOGIES CO., LTD.); Jing Yuan (Huawei Cloud BU)"

282 - Lab2Pix: Label-Adaptive Generative Adversarial Network for Unsupervised Image Synthesis

Lianli Gao (The University of Electronic Science andTechnology of China); junchen zhu (University of Electronic Science and Technology of China); Jingkuan Song (UESTC)*; Feng Zheng (SUSTech); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

291 - Dynamic GCN: Context-enriched Topology Learning for Skeleton-based Action Recognition

Fanfan Ye ( Hikvision Research Institute); Shiliang Pu (Hikvision Research Institute); Qiaoyong Zhong (Hikvision Research Institute)*; Chao Li (Hikvision Research Institute); Di Xie (Hikvision Research Institute); Huiming Tang (Zhejiang University)

298 - Dual Temporal Memory Network for Efficient Video Object Segmentation

Kaihua Zhang (NUIST)*; Long Wang (Nanjing University of Information Science & Technology); Dong Liu (Netflix Inc); Bo Liu (JD.com); Qingshan Liu (Nanjing University of Information Science & Technology); Zhu Li (university of missouri-kansas city)

304 - Boosting Visual Question Answering with Context-aware Knowledge Aggregation

Guohao Li (Tsinghua University)*; Xin Wang (Tsinghua University); Wenwu Zhu (Tsinghua University)

306 - Meta Parsing Networks: Towards Generalized Few-shot Scene Parsing with Adaptive Metric Learning

Peike Li (UTS)*; Yunchao Wei (University of Technology Sydney); Yi Yang (UTS)

312 - CODAN: Counting-driven Attention Network for Vehicle Detection in Congested Scenes

Wei Li (Southwest Jiaotong University); Zhenting Wang (Southwest Jiaotong University); Xiao Wu (Southwest Jiaotong University)*; Ji Zhang (Southwest Jiaotong University); Qiang Peng (Southwest Jiaotong University); Hongliang Li (University of Electronic Science and Technology of China)

318 - Coorperative Bi-path Metric for Few-shot Learning

Zeyuan Wang (Beihang University); Yifan Zhao (Beihang University); Jia Li (Beihang University)*; Yonghong Tian (Peking University)

325 - Deep Unsupervised Hybrid-similarity Hadamard Hashing

"Wanqian Zhang (Institute of Information Engineering, Chinese Academy of Sciences); Dayan Wu (Institute of Information Engineering, Chinese Academy of Sciences)*; Yu Zhou (Institute of Information Engineering, CAS); Bo Li ( Institute of Information Engineering, Chinese Academy of Sciences); Weiping Wang (Institute of Information Engineering, CAS, China); Dan Meng (Institute of Information Engineering, CAS)"

330 - Semi-supervised Online Multi-Task Metric Learning for Visual Recognition and Retrieval

"Yangxi Li (National Computer network Emergency Response technical Team/Coordination Center of China)*; Han Hu (Beijing Institute of Technology, China); Jin Li (Beijing University of Posts and Telecommunications); Yong Luo (Nanyang Technological University); Yonggang Wen (Nanyang Technological University)"

352 - Modeling both Intra- and Inter-modal Influence for Real-Time Emotion Detection in Conversations

Dong Zhang (Soochow University)*; Weisheng Zhang (Soochow University); Shoushan Li (Soochow University); Zhu Qiaoming (Soochow University); Zhou Guodong (Soochow University)

355 - WIKI Food-500: A dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network

"Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences)*; Linhu Liu (ICT); Zhiling Wang (Institute of Computing Technology, Chinese Academy of Sciences); Zhengdong Luo (University of Chinese Academy of Sciences); Xiaoming Wei (MeituanDianping group ); Xiaolin Wei (MeituanDianping group ); Shuqiang Jiang (ICT, China Academy of Science)"

356 - RT-VENet: A Convolutional Network for Real-time Video Enhancement

Mohan Zhang (Zhejiang university); Qiqi Gao (Microsoft Research Asia); Jinglu Wang (Microsoft Research Asia); Henrik Turbell (Microsoft); David Zhao (Microsoft); Jinhui Yu (Zhejiang Unviersity); Yan Lu (Microsoft Research Asia)*

358 - Learning Image Classifier from Only Web Labels and Metadata: Automatic Label Correction through Graph

Jingkang Yang (Sensetime Research)*; Weirong Chen (SenseTime Research); Litong Feng (Sensetime Research); Xiaopeng Yan (SenseTime Research); Huabin Zheng (SenseTime Research); Wayne Zhang (SenseTime Research)

359 - From Design Draft to Real Attire: Unaligned Fashion Image Translation

Yu Han (Peking University)*; Shuai Yang (Peking University); Wenjing Wang (Peking University); Jiaying Liu (Peking University)

362 - Towards More Explainability: Concept Knowledge Mining Network for Event Recognition

"Zhaobo Qi (University of Chinese Academy of Sciences)*; Shuhui Wang (VIPL,ICT,Chinese academic of science); Chi Su (Kingsoft Cloud); Li Su (University of Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences); Qi Tian (Huawei Cloud & AI)"

365 - Multi-task Regression for Facial Action Unit Intensity Estimation via Differentiable Renderer

Xinhui Song (Netease Fuxi AI Lab)*; Tianyang Shi (NetEase Fuxi AI Lab); Zunlei Feng (Zhejiang University); Mingli Song (Zhejiang University); Jackie Lin (Netease Fuxi AI Lab); Chuanjie Lin (Netease Fuxi AI Lab); Yi Yuan (NetEase Fuxi AI Lab); Changjie Fan (NetEase Fuxi AI Lab)

370 - Siamese Attentive Graph Tracking

"Fei zhao (Alibaba Group & Institute of Automation,Chinese Academy of Sciences)*; Ting Zhang (CEIEC); Chao Ma (Shanghai Jiao Tong University); Ming Tang (Chinese Academy of Sciences, China); Jinqiao Wang (Institute of Automation, Chinese Academy of Sciences); Xiaobo Wang (Alibaba Group)"

373 - Photo Stand-Out: Photography with Virtual Character

Yujia Wang (Beijing Institute of Technology)*; Sifan Hou (Beijing Institute of Technology); Wei Liang (Beijing Institute of Technology); Bing Ning (Beijing Institute of Fashion Technology)

376 - DeSmoothGAN: Recovering Details of Smoothed Images via Spatial Feature-wise Transformation and Full Attention

Yifei Huang (East China Normal University)*; Chenhui Li (East China Normal University); Xiaohu Guo (The University of Texas at Dallas); Jing Liao (City University of Hong Kong); Chenxu Zhang (The University of Texas at Dallas); Changbo Wang (East China Normal University)

378 - Accurate UAV Tracking with Distance-Injected Overlap Maximization

Chunhui Zhang (Chinese Academy of Sciences); Shiming Ge (Chinese Academy of Sciences)*; Kangkai Zhang (Chinese Academy of Sciences); Dan Zeng (Shanghai University)

380 - Look Through Masks: Towards Occluded Face Recognition with Amodal Completion

Chenyu Li (Chinese Academy of Sciences); Shiming Ge (Chinese Academy of Sciences)*; Daichi Zhang (Chinese Academy of Sciences); Jia Li (Beihang University)

383 - Context-Aware Multi-View Summarization Network for Image-Text Matching

Leigang Qu (Shandong University); Meng Liu (Shandong Jianzhu University); Da Cao (Hunan University); Liqiang Nie (Shandong University )*; Qi Tian (Huawei Cloud & AI)

389 - Supervised Hierarchical Deep Hashing for Cross-Modal Retrieval

Yu-Wei Zhan (Shandong University); Xin Luo (Shandong University)*; Yongxin Wang (Shandong University); Xin-Shun Xu (Shandong University)

391 - "PiRhDy: LearningPitch-,Rhythm-,and Dynamics-aware Embeddings for Symbolic Music"

Hongru Liang (Nankai University); Wenqiang Lei (National University of Singapore)*; Paul Yaozhu Chan (A_STAR); Zhenglu Yang (Nankai University); Maosong Sun (Tsinghua University); Tat-Seng Chua (National Univ. of Singapore)

395 - An Egocentric Action Anticipation Framework via Fusing Intuition and Analysis

"Tianyu Zhang (ICT)*; Weiqing Min (Institute of Computing Technology, Chinese Academy of Sciences); Ying Zhu (University of Chinese Academy of Sciences); Yong Rui (Lenovo); Shuqiang Jiang (ICT, China Academy of Science)"

396 - HiFaceGAN: Face Renovation via Collaborative Suppression and Replenishment

"Lingbo Yang (Peking University)*; Chang Liu (University of Chinese Academy of Sciences); Pan Wang (Alibaba Group); Shanshe Wang (Peking University); Peiran Ren (Alibaba ); Siwei Ma (Peking University, China); Wen Gao (PKU)"

397 - PatchMatch based Multiview Stereo with Local Quadric Window

"Hyewon Song (Yonsei university); Jaeseong Park (Yonsei University); Suwoong Heo (Yonsei University); Jiwoo Kang (Yonsei University); Sanghoon Lee (Yonsei University, Korea)*"

403 - Regularized Two-Branch Proposal Networks for Weakly-Supervised Moment Retrieval in Videos

Zhu Zhang (Zhejiang University)*; Zhijie Lin (Zhejiang University); Zhou Zhao (Zhejiang University); jieming zhu (Huawei Noah''s Ark Lab); Xiuqiang He (Huawei Noah's Ark Lab)

411 - Discernible Image Compression

Zhaohui Yang (Peking University)*; Yunhe Wang (Huawei Technologies); Chang Xu (University of Sydney); Peng Du (Hangzhou Dianzi University); Chao Xu (Peking University); Chunjing Xu (Huawei Noah's Ark Lab); Qi Tian (Huawei Cloud & AI)

419 - Feature Reintegration over Differential Treatment: A Top-down and Adaptive Fusion Network for RGB-D Salient Object Detection

Miao Zhang (Dalian University of Technology); Yu Zhang (Dalian University of Technology); Yongri Piao (Dalian University of Technology)*; Beiqi Hu (Dalian University of Technology); Huchuan Lu (Dalian University of Technology)

434 - Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation

"Jialian Wu (State University of New York at Buffalo)*; Liangchen Song (University at Buffalo); Tiancai Wang (Tianjin University); Qian Zhang (Horizon Robotics); Junsong Yuan (""State University of New York at Buffalo, USA"")"

436 - Label Embedding Online Hashing for Cross-Modal Retrieval

Yongxin Wang (Shandong University); Xin Luo (Shandong University); Xin-Shun Xu (Shandong University)*

442 - Privacy-sensitive Objects Pixelation for Live Video Streaming

Jizhe Zhou (University of Macau); Chi-Man Pun (University of Macau)*; Yu Tong (University of Macau)

444 - Cloze Test Helps: Effective Video Anomaly Detection via Learning to Complete Video Events

Guang Yu (National University of Defense Technology)*; Siqi Wang (National University of Defense Technology); Zhiping Cai (NUDT); En Zhu (National University of Defense Technology); Chuanfu Xu (National University of Defense Technology); Jianping Yin (National University of Defense Technology); Marius Kloft (TU Kaiserslautern)

446 - All-in-depth via Cross-baseline Light Field Camera

"Dingjian Jin (Tsinghua university)*; Anke Zhang (Tsinghua University); Jiamin Wu (Tsinghua University); Gaochang Wu (Northeastern University); haoqian wang (Graduate School at Shenzhen, Tsinghua University); Lu Fang (Tsinghua University)"

458 - Dual Path Interaction Network for Video Moment Localization

Hao Wang (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China); Jiebo Luo (U. Rochester)

464 - Adv-watermark: A Novel Watermark Perturbation for Adversarial Examples

"Xiaojun Jia (Institute of Information Engineering£¬Chinese Academy of Sciences); Xingxing Wei (Beihang University); Xiaochun Cao (Chinese Academy of Sciences)*; Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen))"

480 - Deep Multimodal Neural Architecture Search

Zhou Yu (Hangzhou Dianzi University); Yuhao Cui (Hangzhou Dianzi University); Jun Yu (HDU)*; Meng Wang (Hefei University of Technology); Dacheng Tao (The University of Sydney); Qi Tian (Huawei Cloud & AI)

484 - CRSSC: Salvage Reusable Samples from Noisy Data for Robust Learning

Zeren Sun (Nanjing University of Science and Technology ); Xian-Sheng Hua (Alibaba Group); Yazhou Yao (Nanjing University of Science and Technology)*; Xiu-Shen Wei (Nanjing University of Science and Technology); Guosheng Hu (AnyVision); Jian Zhang (UTS)

488 - Deep Local Binary Coding for Person Re-Identification by Delving into the Details

Jiaxin Chen (Inception Institute of Artificial Intelligence)*; Jie Qin (Inception Institute of Artificial Intelligence); Yichao Yan (inception institute of artificial intelligence); Lei Huang (Inception Institute of Artificial Intelligence); Li Liu (the inception institute of artificial intelligence); Fan Zhu (Inception Institute of Artificial Intelligence); Ling Shao (Inception Institute of Artificial Intelligence)

497 - Expert Performance in the Examination of Interior Surfaces in an Automobile: Virtual Reality vs. Reality

Alexander Tesch (Volkswagen AG)*; Ralf Doerner (HS Rhein-Main)

508 - A Dual In-painting Model for Unsupervised Gaze Correction and Animation in the Wild

Jichao Zhang (University of Trento)*; Jingjing Chen (Shandong University); Hao Tang (University of Trento); Wei Wang (EPFL); Yan Yan (Texas State University); Enver Sangineto (University of Trento); Nicu Sebe (University of Trento)

519 - MMFL: Multimodal Fusion Learning for Text-Guided Image Inpainting

Qing Lin (Fudan University); Bo Yan (Fudan University)*; Jichun Li (Fudan University); Weimin Tan (Fudan University)

527 - Learning Hierarchical Graph for Occluded Pedestrian Detection

Gang LI (Nanjing University of Science and Technology); Jian Li (Tencent Youtu); Shanshan Zhang (Max Planck Institute for Informatics)*; Jian Yang (Nanjing University of Science and Technology)

531 - Vision Meets Wireless Positioning: Effective Person Re-identification with Recurrent Context Propagation

Yiheng Liu (University of Science and Technology of China)*; Wengang Zhou (University of Science and Technology of China); Mao Xi (University of Science and Technology of China); Sanjing Shen (University of Science and Technology of China); Houqiang Li (University of Science and Technology of China)

541 - Learning From Music to Visual Storytelling of Shots: A Deep Interactive Learning Mechanism

"Jen-Chun Lin (Academia Sinica)*; Wen-Li Wei (Academia Sinica); Yen-Yu Lin (National Chiao Tung University); Tyng-Luh Liu (Academia Sinica); Hong-Yuan Mark Liao (Institute of Information Science, Academia Sinica, Taiwan)"

543 - Adaptively-Accumulated Knowledge Transfer for Partial Domain Adaptation

(Video) ACM Multimedia 2020 Tutorial-part2-Vehicle re-ID: past, present and future - Wu Liu

Taotao Jing (Indiana University-Purdue University Indianapolis); Haifeng Xia (Indiana University-Purdue University Indianapolis); Zhengming Ding (Indiana University-Purdue University Indianapolis)*

549 - Multi-graph convolutional network for unsupervised 3D shape retrieval

"Weizhi Nie (Tianjin University); Yue Zhao (Tianjin University); An-An Liu (Tianjin University)*; Zan Gao (Qilu University of Technology (Shandong Academy of Sciences), Shandong Computer Science Center (National Supercomputer Center in Jinan), Shandong Artificial Intelligence Institute, China); Yu-ting Su (Tianjin University)"

553 - Asymmetric Deep Hashing for Efficient Hash Code Compression

"Shu Zhao (Institute of Information Engineering, Chinese Academy of Sciences); Dayan Wu (Institute of Information Engineering, Chinese Academy of Sciences)*; Wanqian Zhang (Institute of Information Engineering, Chinese Academy of Sciences); Yu Zhou (Institute of Information Engineering, CAS); Bo Li ( Institute of Information Engineering, Chinese Academy of Sciences); Weiping Wang (Institute of Information Engineering, CAS, China)"

554 - Box Guided Convolution for Pedestrian Detection

Jinpeng Li (Inception Institute of Artificial Intelligence); Shengcai Liao (Inception Institute of Artificial Intelligence)*; Hangzhi Jiang (CASIA); Ling Shao (Inception Institute of Artificial Intelligence)

580 - Cap2Seg: Inferring Semantic and Spatial Context from Captions for Zero-Shot Image Segmentation

Guiyu Tian (Peking University); Shuai Wang (BOE); Jie Feng (BOE); Li Zhou (BOE); Yadong Mu (Peking University)*

585 - Bottom-Up Foreground-Aware Feature Fusion for Person Search

"Wenjie Yang (Institute of Automation, Chinese Academy of Sciences)*; Dangwei Li (Institute of Automation, Chinese Academy of Sciences); Xiaotang Chen (Institute of Automation, Chinese Academy of Sciences); Kaiqi Huang (Institute of Automation, Chinese Academy of Sciences)"

588 - Quaternion-Based Knowledge Graph Network for Recommendation

"Zhaopeng Li (State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences)*; Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Yangbangyan Jiang (Institute of Information Engineering, Chinese Academy of Sciences; University of Chinese Academy of Sciences); Xiaochun Cao (Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)"

601 - Multi-Person Action Recognition in Microwave Sensors

Diangang Li (Xi'an Jiaotong University); Jianquan Liu (NEC Corporation)*; Shoji Nishimura (NEC Corporation); Yuka Hayashi (NEC Corporation); Jun Suzuki (NEC Corporation); Yihong Gong (Xi'an Jiaotong University)

605 - "Stronger, Faster and More Explainable: A Graph Convolutional Baseline for Skeleton-based Action Recognition"

"Yi-Fan Song (University of Chinese Academy of Sciences)*; Zhang Zhang (Institute of Automation, Chinese Academy of Sciences); Caifeng Shan (CAS-AIR); Liang Wang (NLPR, China)"

606 - Spatial-Temporal Knowledge Integration: Robust Self-Supervised Facial Landmarks Tracking

Congcong Zhu (Shanghai University); Xiaoqiang Li (Shanghai University)*; Jide Li ( Shanghai University); Guangtai Ding (Shanghai University); Weiqin Tong (Shanghai University)

612 - Norm-in-Norm Loss with Faster Convergence and Better Performance for Image Quality Assessment

Dingquan Li (Peking University); Tingting Jiang (Peking University)*; Ming Jiang (Peking University)

615 - Weakly Supervised 3D Object Detection from Point Clouds

Zengyi Qin (MIT); Jinglu Wang (Microsoft Research Asia)*; Yan Lu (Microsoft Research Asia)

619 - Neutral Face Game Character Auto-Creation via Poker-GAN

Tianyang Shi (NetEase Fuxi AI Lab)*; Zhengxia Zou (University of Michigan); Xinhui Song (Netease Fuxi AI Lab); Zheng Song (NetEase Fuxi AI Lab); Changjian Gu (NetEase Fuxi AI Lab); Yi Yuan (NetEase Fuxi AI Lab); Changjie Fan (NetEase Fuxi AI Lab)

621 - DIMC-net: Deep Incomplete Multi-view Clustering Network

"Jie Wen (Harbin Institute of Technology, Shenzhen)*; Zheng Zhang (Harbin Institute of Technology, Shenzhen); Zhihao Wu (Harbin Institute of Technology, Shenzhen); Lunke Fei (Guangdong University of Technology); Zhao Zhang (Hefei University of Technology); Yong Xu (Harbin Institute of Technology Shenzhen Graduate School); Bob Zhang (Univerisity of Macau)"

630 - Adversarial Image Attacks Using Multi-Sample and Most-Likely Ensemble Methods

Xia Du (University of Macau); Chi-Man Pun (University of Macau)*

632 - Cross-domain Cross-modal Food Transfer

Bin Zhu (City University of Hong Kong)*; Chong-Wah Ngo (City University of Hong Kong); Jingjing Chen (Fudan University)

639 - Coupling deep textural and shape features for sketch retrieval

Qi Jia (Dalian University of Technology); Xin Fan (Dalian University of Technology)*; Meiyu Yu (Didi Chuxing); Yuqing Liu (Dalian University of Technology); Dingrong Wang (Dalian University of Technology); Longin Jan Latecki (Temple University)

647 - Memory-Augmented Relation Network for Few-Shot Learning

He Jun (Hefei University of Technology)*; Richang Hong (Hefei University of Technology); Xueliang Liu (Hefei University of Technology); Mingliang Xu (Zhengzhou University); Zheng-Jun Zha (University of Science and Technology of China); Meng Wang (Hefei University of Technology)

654 - Panoptic Image Annotation with a CollaborativeAssistant

Jasper Uijlings (Google Research)*; Misha Andriluka (Google); Vittorio Ferrari (Google Research)

663 - Rethinking Generative Zero-Shot Learning: An Ensemble Learning Perspective for Recognising Visual Patches

Zhi Chen (The University of Queensland)*; Sen Wang (The University of Queensland); Jingjing Li (University of Electronic Science and Technology of China); Zi Huang (University of Queensland)

668 - Performance Optimization of Federated Person Re-identification via Benchmark Analysis

Weiming Zhuang (Nanyang Technological University)*; Yonggang Wen (Nanyang Technological University); Xuesen Zhang (SenseTime); Xin Gan (SenseTime); Daiying Yin (SenseTime); Dongzhan Zhou (The University of Sydney); shuai zhang (Sensetime Ltd); Shuai Yi (SenseTime Group Limited)

688 - Surpassing Real-World Source Training Data: Random 3D Characters for Generalizable Person Re-Identification

Yanan Wang (Inception Institute of Artificial Intelligence)*; Shengcai Liao (Inception Institute of Artificial Intelligence); Ling Shao (Inception Institute of Artificial Intelligence)

691 - Guided Attention Network for Object Detection and Counting on Drones

CAI YuanQiang (UCAS); Dawei Du (University of Chinese Academy of Sciences); Libo Zhang (Institute of Software Chinese Academy of Sciences)*; Longyin Wen (JD Digit); Weiqiang Wang (University of Chinese Academy of Sciences); Yanjun Wu (Institute of Software Chinese Academy of Sciences ); Siwei Lyu (University at Albany)

696 - K-armed Bandit based Multi-Modal Network Architecture Search for Visual Question Answering

"Yiyi Zhou (Xiamen University); Rongrong Ji (Xiamen University, China)*; Xiaoshuai Sun ( Xiamen University); Gen Luo (Xiamen University); Xiaopeng Hong (Xi'an Jiaotong University); Jinsong Su (Xiamen University); Xinghao Ding (Xiamen University); Ling Shao (Inception Institute of Artificial Intelligence)"

700 - Simultaneous Semantic Alignment Network for Heterogeneous Domain Adaptation

Shuang Li (Beijing Institute of Technology); Binhui Xie (Beijing Institute of Technology ); Jiashu Wu (University of Melbourne); Ying Zhao (Beijing Institute of Technology); Chi Harold Liu (Beijing Institute of Technology)*; Zhengming Ding (Indiana University-Purdue University Indianapolis)

701 - TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection

"Fangfang Wang (Zhejiang University)*; Yifeng Chen (Zhejiang University); Fei Wu (Zhejiang University, China); Xi Li (Zhejiang University)"

703 - Deep Cross-scale Fusion Network for Single Image Rain Removal

Cong Wang (Dalian University of Technology)*; Xiaoying Xing (Tsinghua University); Zhixun Su (Dalian University of Technology); junyang chen (University of Macau)

704 - Class-Aware Modality Mix and Center-Guided Metric Learning for Visible-Thermal Person Re-Identification

"Yongguo Ling (Xiamen University)*; Zhun Zhong (University of Trento); Zhiming Luo (Xiamen University); Paolo Rota (University of Trento); Shaozi Li (Xiamen University, China); Nicu Sebe (University of Trento)"

707 - Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition

Yuan Xie (DarkMatter AI); Tianshui Chen (DarkMatter AI)*; Tao Pu (Sun Yat-sen University); Hefeng Wu (Sun Yat-sen University); Liang Lin (DarkMatter AI)

708 - Self-Paced Video Data Augmentation by Generative Adversarial Networks with Insufficient Samples

Yumeng Zhang (Tsinghua University); GaoGuo Jia (Tsinghua University ); Li Chen (Tsinghua University)*; MingRui Zhang (Beijing University of Posts and Telecommunications); JunHai Yong (Tsinghua University)

710 - Weakly Supervised Real-time Image Cropping based on Aesthetic Distributions

"Peng Lu (Beijing University of Posts and Telecommunications)*; Jiahui Liu (Beijing University of Posts and Telecommunications); Xujun Peng (Information Sciences Institute, University of Southern California); Xiaojie Wang (Beijing University of Posts and Telecommunications)"

732 - Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer

Yuting Liu (Sichuan University)*; Zheng Wang (National Institute of Informatics); Miaojing Shi (King's College London); Shin'ichi Satoh (National Institute of Informatics); Qijun Zhao (Sichuan University); hongyu yang (sichuan university)

734 - KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning in Visual Dialogue

"Xiaoze Jiang (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Siyi Du (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University); Zengchang Qin (Intelligent Computing & Machine Learning Lab, School of ASEE, Beihang University)*; Yajing Sun (Institute of Information Engineering,Chinese Academy of Sciences); JING YU (Institute of Information Engineering, Chinese Academy of Sciences)"

736 - Uncertainty-based Traffic Accident Anticipation with Spatio-Temporal Relational Learning

Wentao Bao (Rochester Institute of Technology)*; Qi Yu (Rochester Institute of Technology); Yu Kong (Rochester Institute of Technology)

737 - Occluded Prohibited Items Detection: An X-ray Security Inspection Benchmark and De-occlusion Attention Module

Yanlu Wei (Beihang University); Renshuai Tao (Beihang University)*; Zhangjie Wu (Beihang University); Yuqing Ma (Beihang University); Libo Zhang (Institute of Software Chinese Academy of Sciences); Xianglong Liu (BUAA)

738 - CF-SIS: Semantic-Instance Segmentation of 3D Point Clouds by Context Fusion with Self-Attention

"Xin Wen (Tsinghua University); Zhizhong Han (University of Maryland, College Park); Geunhyuk Youk (Tsinghua University); Yu-Shen Liu (Tsinghua University)*"

743 - Diverter-Guider Recurrent Network for Diverse Poems Generation from Image

"liang li (Institute of Computing Technology, Chinese Academy of Sciences)*; Shijie Yang (vipl,ict,Chinese academic of science); Li Su (University of Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Chenggang Yan (Hangzhou Dianzi University); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)"

758 - Hybrid Resolution Network Using Edge Guided Region Mutual Information Loss for Human Parsing

Yunan Liu (Nanjing University of Science & Technology)*; Liang Zhao (Nanjing University of Science & Technology); Shanshan Zhang (Max Planck Institute for Informatics); Jian Yang (Nanjing University of Science and Technology)

760 - Meta-RCNN: Meta Learning for Few-Shot Object Detection

Xiongwei Wu (Singapore Management U)*; Doyen Sahoo (Salesforce); Steven Hoi (Singapore Management University)

762 - Texture Semantically Aligned with Visibility-aware for Partial Person Re-identification

"Lishuai Gao (Tianjin University of Technology); Hua Zhang (Tianjin University of Technology); Zan Gao (1. Shandong AI Institute, QiLU University of Technology, 2. Shandong Computer Science Center(National Supercomputer Center in Jinan), 3. Tianjing University of Technology)*; Weili Guan (Monash University); Zhiyong Cheng (Shandong Academy of Sciences); Meng Wang (Hefei University of Technology)"

765 - Context-aware Attention Network for Predicting Image Aesthetic Subjectivity

"Munan Xu (Shenzhen Graduate School, Peking University); Jia-Xing Zhong (School of Electronic and Computer Engineering, Peking University); Yurui Ren (Shenzhen Graduate School, Peking University); Shan Liu (Tencent America); Ge Li (SECE, Shenzhen Graduate School, Peking University)*"

769 - OCR: Objectness Consistent Representation for Weakly Supervised Object Detection

"Ke Yang (NUDT)*; Peng Zhang (NUDT); Peng Qiao (NUDT); Zhiyuan Wang (AIRC); Dongsheng Li (School of Computer Science, National University of Defense Technology); Yong Dou (National University of Defense Technology)"

774 - Bridging the Gap between Vision and Language Domains for Improved Image Captioning

Fenglin Liu (Peking University)*; Xian Wu (Tencent Medical AI Lab); Shen Ge (Tencent Medical AI Lab); Xiaoyu Zhang ( Peking University); Wei Fan (Tencent); Yuexian Zou (Peking University)

783 - PIDNet: An Efficient Network for Dynamic Pedestrian Intrusion Detection

Jingchen Sun (Zhejiang University); Jiming Chen (Zhejiang University); Tao Chen (Fudan University); jiayuan fan (Fudan University); Shibo He (Zhejiang University)*

787 - ChoreoNet: Torwards Music to Dance Synthesis with Choreographic Action Unit

"Zijie Ye (Tsinghua University)*; Haozhe Wu (Tsinghua University); Jia Jia (Tsinghua University); Yaohua Bu (Tsinghua University); Wei Chen (Beijing Sougou Science and Technology Development Co., Ltd); Fanbo Meng (Sogou Corporation, Beijing, China); Yanfeng Wang ( Beijing Sougou Science and Technology Development Co., Ltd)"

790 - Unpaired Image Enhancement with Quality-Attention Generative Adversarial Network

Zhangkai NI (City University of Hong Kong)*; Wenhan Yang (Peking University); Shiqi Wang (CityU); Lin Ma (Tencent AI Lab); Sam Kwong (City Univeristy of Hong Kong)

793 - STRONG: Spatio-Temporal Reinforcement Learning for Cross-Modal Video Moment Localization

Da Cao (Hunan University)*; Yawen Zeng (Hunan University); Meng Liu (Shandong Jianzhu University); Xiangnan He (University of Science and Technology of China); Meng Wang (Hefei University of Technology); Zheng Qin (Hunan University)

794 - Adversarial Video Moment Retrieval by Jointly Modeling Ranking and Localization

Da Cao (Hunan University)*; Yawen Zeng (Hunan University); Xiaochi Wei (Baidu Inc.); Liqiang Nie (Shandong University ); Richang Hong (Hefei University of Technology); Zheng Qin (Hunan University)

795 - Pose-native Network Architecture Search for Multi-person Human Pose Estimation

Qian Bao (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jun Hong (AI Research of JD.com); Lingyu Duan (Peking University); Tao Mei (AI Research of JD.com)

798 - ASTA-Net: Adaptive Spatio-Temporal Attention Network for Person Re-Identification in Videos

Xierong Zhu (University of Science and Technology of China)*; Jiawei Liu (University of Science and Technology of China); Haoze Wu (University of Science and Technology of China); Meng Wang (Hefei University of Technology); Zheng-Jun Zha (University of Science and Technology of China)

801 - Talking Face Generation with Expression-Tailored Generative Adversarial Network

Dan Zeng (Shanghai University); Han Liu (Shanghai University); Hui Lin (); Shiming Ge (Chinese Academy of Sciences)*

808 - Blind Natural Video Quality Prediction via Statistical Temporal Features and Deep Spatial Features

Jari Korhonen (Shenzhen University)*; Yicheng Su (Shenzhen University); Junyong You (Norwegian Research Centre)

812 - Cross-Modal Omni Interaction Modeling for Phrase Grounding

"Tianyu Yu (Beihang University)*; Tianrui Hui (Institute of Information Engineering, Chinese Academy of Sciences); Zhihao Yu (Beihang University); Yue Liao (Beihang University); si liu (Beihang University); Sansi Yu (Tencent); Faxi Zhang (Tencent)"

814 - Cascade Grouped Attention Network for Referring Expression Segmentation

"Gen Luo (Xiamen University); Rongrong Ji (Xiamen University, China)*; Yiyi Zhou (Xiamen University); Xiaoshuai Sun ( Xiamen University); Jinsong Su (Xiamen University); Chia-Wen Lin (National Tsing Hua University); Qi Tian (Huawei Cloud & AI)"

816 - Temporally Guided Music-to-Body-Movement Generation

Hsuan-Kai Kao (Academia Sinica); Li Su (Academia Sinica)*

818 - Compositional Few-Shot Recognition with Primitive Discovery and Enhancing

Yixiong Zou (Peking University)*; Shanghang Zhang (UC Berkeley); Ke Chen (South China University of Technology); Jos¨¦ M. F. Moura (Carnegie Mellon University); Yaowei Wang (PengCheng Laboratory); Yonghong Tian (Peking University)

820 - Language-Aware Fine-Grained Object Representation for Referring Expression Comprehension

Heqian Qiu (University of Electronic Science and Technology of China)*; Hongliang Li (University of Electronic Science and Technology of China); Qingbo Wu (University of Electronic Science and Technology of China); Fanman Meng (University of Electronic Science and Technology of China); Hengcan Shi ( University of Electronic Science and Technology of China); Taijin Zhao (University of Electronic Science and Technology of China); King Ngi Ngan (University of Electronic Science and Technology of China)

821 - Bridging the Web Data and Fine-Grained Visual Recognition via Alleviating Label Noise and Domain Mismatch

Yazhou Yao (Nanjing University of Science and Technology)*; Xian-Sheng Hua (Alibaba Group); Guanyu Gao (Nanjing University of Science and Technology); Zeren Sun (Nanjing University of Science and Technology ); Zhibin Li (University of Technology Sydney ); Jian Zhang (UTS)

823 - March on Data Imperfections: Domain Division and Domain Generalization for Semantic Segmentation

Hai Xu (University of Science and Technology of China); Hongtao Xie (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Sun-Ao Liu (University of Science and Technology of China); Yongdong Zhang (University of Science and Technology of China)

825 - Aesthetic-Aware Image Style Transfer

Zhiyuan Hu (Tsinghua University); Jia Jia (Tsinghua University)*; Bei Liu (Microsoft Research); Yaohua Bu (Tsinghua University); Jianlong Fu (Microsoft Research)

830 - InteractGAN: Learning to Generate Human-Object Interaction

"Chen Gao (Institute of Information Engineering, CAS)*; si liu (Beihang University); Defa Zhu (Institute of Information Engineering, CAS); Quan Liu (Beihang University); Jie Cao (Institute of Automation, Chinese Academy of Sciences); Haoqian He (Beihang University); Ran He (Institute of Automation, Chinese Academy of Sciences); Shuicheng Yan (YITU Tech)"

831 - Is Depth Really Necessary for Salient Object Detection?

Jiawei Zhao (Beihang University); Yifan Zhao (Beihang University); Jia Li (Beihang University)*; Xiaowu Chen (Beihang University)

836 - Zero-Shot Multi-View Indoor Localization via Graph Location Networks

Meng-Jiun Chiou (National University of Singapore)*; Zhenguang Liu (Zhejiang Gongshang University); Yifang Yin (National University of Singapore); An-An Liu (Tianjin University); Roger Zimmermann (NUS)

845 - Self-Play Reinforcement Learning for Fast Image Retargeting

Nobukatsu Kajiura (The University of Tokyo)*; Satoshi Kosugi (The University of Tokyo); Xueting Wang (The University of Tokyo); Toshihiko Yamasaki (The University of Tokyo)

849 - Brain-media: A dual conditioned and lateralization supported GAN (DCLS-GAN) towards visualization of image-evoked brain activities

Ahmed Fares (Shenzhen University); Sheng-hua Zhong (Shenzhen University); Jianmin Jiang (Shenzhen University)*

850 - Hierarchical Scene Graph Encoder-Decoder for Image Paragraph Captioning

XU YANG (Nanyang Technological University)*; Chongyang Gao (Dartmouth College); Hanwang Zhang (Nanyang Technological University); Jianfei Cai (Monash University)

854 - Deep Concept-wise Temporal Convolutional Networks for Action Localization

"Xin Li (Baidu); Tianwei Lin (Baidu)*; Xiao Liu (Baidu); Wangmeng Zuo (Harbin Institute of Technology, China); Chao Li (Baidu); Xiang Long (Baidu); Dongliang He (Baidu); Fu Li (Baidu); Shilei Wen (Baidu Research); Chuang Gan (MIT-IBM Watson AI Lab)"

859 - Gait Recognition with Multiple-Temporal-Scale 3D Convolutional Neural Network

BeiBei Lin (Beijing Jiaotong University); Shunli Zhang (Beijing Jiaotong University)*; Feng Bao (Beijing Jiaotong University)

867 - Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos

"Jie Wu (Sun Yat-sen University)*; Guanbin Li (Sun Yat-sen University); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen)); Liang Lin (DarkMatter AI)"

876 - Traffic-Aware Multi-Camera Tracking of Vehicles Based on ReID and Camera Link Model

Hung-Min Hsu (UW)*; Yizhou Wang (University of Washington); Jenq-Neng Hwang (University of WA_)

887 - Hierarchical Gumbel Attention Network for Text-based Person Search

Kecheng Zheng (University of Science and Technology of China); Wu Liu (AI Research of JD.com)*; Jiawei Liu (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China); Tao Mei (AI Research of JD.com)

892 - Mesh Guided One-shot Face Reenactment Using Graph Convolutional Networks

Guangming Yao (NetEase Fuxi AI Lab)*; Yi Yuan (NetEase Fuxi AI Lab); Tianjia Shao (Zhejiang University); Kun Zhou (Zhejiang University)

893 - VONAS: Network Design in Visual Odometry using Neural Architecture Search

"Xing Cai (Peking University); Lanqing Zhang (Peking University); Chengyuan Li (Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University); Thomas H Li (Advanced Institute of Information Technology, Peking University)*"

910 - "A Tightly-coupled Semantic SLAM System with Visual, Inertial and Surround-view Sensors for Autonomous Indoor Parking"

"Xuan Shao (Tongji University); Lin Zhang (Tongji University, China)*; Tianjun Zhang (Tongji University); Ying Shen (Tongji University); Hongyu Li (tongdun); Yicong Zhou (University of Macau)"

911 - Controllable Continuous Gaze Redirection

"Weihao Xia (Tsinghua University)*; Yujiu Yang (Tsinghua University); Jing-Hao Xue (University College London); Wensen Feng (College of Computer Science & Software Engineering, Shenzhen University)"

915 - "Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning"

Ying Cheng (Fudan University)*; Ruize Wang (Fudan University); Zhihao Pan (Fudan University); Rui Feng (Fudan University); Yuejie Zhang (Fudan University)

918 - SRHEN: Stepwise-Refining Homography Estimation Networkvia Parsing Geometric Correspondences in Deep Latent Space

"Yi Li (Harbin Institute of Technology (Shenzhen)); Wenjie Pei (Harbin Institute of Technology, Shenzhen); Zhenyu He (Harbin Institute of Technology (Shenzhen); Peng Cheng Laboratory)*"

921 - Category-specific Semantic Coherency Learning for Fine-grained Image Recognition

Shijie Wang (Dalian University of Technology); zhihui wang (Dalian University of Technology); Haojie Li (Dalian University of Technology)*; Wanli Ouyang (The University of Sydney)

922 - Preserving Global and Local Temporal Consistency for Arbitrary Video Style Transfer

Xinxiao Wu (Beijing Institute of Technology)*; Jialu Chen (Beijing Institute of Technology)

925 - Deep Shapely Portraits

"Qinjie Xiao (Zhejiang University)*; Xiangjun Tang (Zhejiang University); Leyang Jin (The Chinese University of Hong Kong, Shenzhen); Yu Wu (Zhejiang University); Yongliang Yang (University of Bath); Xiaogang Jin (Zhejiang University)"

927 - Depth Super-Resolution via Deep Controllable Slicing Network

Xinchen Ye (Dalian University of Technology)*; Baoli Sun (Dalian University of Technology); zhihui wang (Dalian University of Technology); Jingyu Yang (Tianjin University); Rui Xu (Dalian University of Techonology); Haojie Li (Dalian University of Technology); Baopu Li (Baidu Research(USA))

931 - Efficient Joint Gradient Based Attack Against SOR Defense for 3D Point Cloud Classification

"Chengcheng Ma (Institute of Automation, Chinese Academy of Sciences)*; Weiliang Meng (Institute of Automation, Chinese Academy of Sciences); Baoyuan Wu (Tencent AI Lab); Shibiao Xu (Institute of Automation, Chinese Academy of Sciences); Xiaopeng Zhang (Institute of Automation, Chinese Academy of Sciences)"

933 - Discrete Haze Level Dehazing Network

Xiao-Feng Cong (Anhui University); Jie Gui (Umich); Kai-Chao Miao (Anhui Meteorological Bureau); Jun Zhang (Anhui university)*; Bing Wang (Anhui University of Technology); Peng Chen (Anhui University)

942 - Improving Intra- and Inter-Modality Visual Relation for Image Captioning

"Yong Wang (Aerospace Information Research Institute, Chinese Academy of Sciences;University of Chinese Academy of Sciences)*; WenKai Zhang (Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China); Qing Liu (Aerospace Information research Institute, Chinese Academy of Sciences); Zhengyuan Zhang (Aerospace Information Research Institute, Chinese Academy of Sciences); Xin Gao (Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100190, China); Xian Sun (IECAS)"

950 - Dual Context-Aware Refinement Network for Person Search

Jiawei Liu (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Richang Hong (HeFei University of Technology); Meng Wang (Hefei University of Technology); Yongdong Zhang (University of Science and Technology of China)

951 - Exploring Language Prior for Mode-Sensitive Visual Attention Modeling

"Xiaoshuai Sun ( Xiamen University)*; Xuying Zhang (Xiamen University); Liujuan Cao (Xiamen University); Yongjian Wu (Tencent Technology (Shanghai) Co.,Ltd); Feiyue Huang (Tencent); Rongrong Ji (Xiamen University, China)"

961 - Poet: Product-oriented Video Captioner for E-commerce

"Shengyu Zhang (Zhejiang University)*; Ziqi Tan (Zhejiang University); Jin Yu (Alibaba Group); Zhou Zhao (Zhejiang University); Kun Kuang (Zhejiang University); jie liu (Alibaba); Jingren Zhou (Alibaba Group); Hongxia Yang (Alibaba Group); Fei Wu (Zhejiang University, China)"

966 - Building Movie Map - A Tool for Exploring in a City - and its Evaluations

Naoki Sugimoto (University of Tokyo)*; Yoshihito Ebine (VTEC Laboratories Inc.); Kiyoharu Aizawa (The University of Tokyo)

970 - Searching Privately by Imperceptible Lying: A Novel Private Hashing Method with Differential Privacy

Yimu Wang (Nanjing University)*; Shiyin Lu (Nanjing University); Lijun Zhang (Nanjing University)

976 - Beyond the Attention: Distinguish the Discriminative and Confusable Features For Fine-grained Image Classification

"Xiruo Shi (Beijing University of Posts and Telecommunications ); Liutong Xu (Beijing University of Posts and Telecommunications); Pengfei Wang (School of Computer Science, Beijing University of Posts and Telecommunications); Yuanyuan Gao (Beihang Univeristy); Haifang Jian (Institute of Semiconductors, Chinese Academy of Sciences); Wu Liu (AI Research of JD.com)*"

977 - BlockMix: Meta Regularization and Self-Calibrated Inference for Metric-Based Meta-Learning

Hao Tang (Nanjing University of Science and Technology); Zechao Li (Nanjing University of Science and Technology)*; Zhimao Peng (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

980 - Structural Semantic Adversarial Active Learning for Image Captioning

"Beichen Zhang (University of Chinese Academy of Sciences)*; liang li (Institute of Computing Technology, Chinese Academy of Sciences); Li Su (University of Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)"

983 - Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling

"Jiacheng Li (Zhejiang University); Siliang Tang (Zhejiang University)*; Juncheng Li (Zhejiang University); Jun Xiao (Zhejiang University); Fei Wu (Zhejiang University, China); Shiliang Pu (Hikvision Research Institute); Yueting Zhuang (Zhejiang University)"

988 - Scene-Aware Context Reasoning for Unsupervised Abnormal Event Detection in Videos

"Che Sun (Beijing Institute of Technology); Yunde Jia (Beijing Institute of Technology); Yao Hu (Alibaba Youku Cognitive and Intelligent Lab); Yuwei WU (Beijing Institute of Technology (BIT), China)*"

1002 - Active Object Search

Jie Wu (Sun Yat-sen University)*; Tianshui Chen (DarkMatter AI); Lishan Huang (Sun Yat-Sen University); Hefeng Wu (Sun Yat-sen University); Guanbin Li (Sun Yat-sen University); Ling Tian (University of Electronic Science and Technology of China); Liang Lin (DarkMatter AI)

1009 - Deep-Modal: Real-Time Impact Sound Synthesis for Arbitrary Shapes

Xutong Jin (Peking University); Sheng Li (Peking University)*; Tianshu Qu (Peking University); Dinesh Manocha (UMD); Guoping Wang (Peking University)

1011 - Fine-grained Feature Alignment with Part Perspective Transformation for Vehicle ReID

"Dechao Meng (vipl,ict,Chinese academic of science)*; Liang Li (Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Xingyu Gao (Chinese Academy of Sciences); Zheng-Jun Zha (University of Science and Technology of China); Qingming Huang (University of Chinese Academy of Sciences)"

1013 - Deep Heterogeneous Multi-Task Metric Learning for Visual Recognition and Retrieval

"Shikang Gan (Nanyang Technological University); YONG LUO (Nanyang Technological University)*; Yonggang Wen (Nanyang Technological University); Tongliang Liu (The University of Sydney); Han Hu (Beijing Institute of Technology, China)"

1016 - HOSE-Net:Higher Order Structure Embedded Network for Scene Graph Generation

meng wei (Graduate school at ShenZhen£¬Tsinghua university)*; Chun Yuan (Graduate school at ShenZhen£¬Tsinghua university); Xiaoyu Yue (SenseTime); Kuo Zhong (Graduate school at ShenZhen£¬Tsinghua university)

1022 - ICECAP: Information Concentrated Entity-aware Image Captioning

Anwen Hu (Renming University of China)*; Shizhe Chen (Renmin University of China); Qin Jin (Renmin University of China)

1035 - Transformer-based Label Set Generation for Multi-modal Multi-label Emotion Detection

xincheng Ju (Soochow University)*; Dong Zhang (Soochow University); Junhui Li (Soochow University); Zhou Guodong (Soochow University)

1038 - Beyond the Parts: Learning Multi-view Cross-part Correlation for Vehicle Re-identification

Xinchen Liu (AI Research of JD.com); Wu Liu (AI Research of JD.com)*; Jinkai Zheng (Hangzhou Dianzi University); Chenggang Yan (Hangzhou Dianzi University); Tao Mei (AI Research of JD.com)

1039 - Semi-supervised Multi-modal Emotion Recognition with Cross-Modal Distribution Matching

Jingjun Liang (Renmin University of China); Ruichen Li (Renmin University of China); Qin Jin (Renmin University of China)*

1043 - Attacking Image Captioning Towards Accuracy-Preserving Target Words Removal

"Jiayi Ji (Xiamen University)*; Rongrong Ji (Xiamen University, China); Yiyi Zhou (Xiamen University); Xiaoshuai Sun ( Xiamen University); Fuhai Chen (Xiamen University); Jianzhuang Liu (Huawei Noah's Ark Lab); Qi Tian (Huawei Cloud & AI)"

1046 - Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization

Haoming Xu (South China University of Technology); Runhao Zeng (South China University of Technology); Qingyao Wu (South China University of Technology); Mingkui Tan (South China University of Technology)*; Chuang Gan (MIT-IBM Watson AI Lab)

1064 - "Look, Read and Feel: Benchmarking Ads Understanding with Multimodal Multitask Learning"

"Huaizheng Zhang (Nanyang Technological University)*; YONG LUO (Nanyang Technological University); Qiming Ai (Nanyang Technological University); Han Hu (Beijing Institute of Technology, China); Yonggang Wen (Nanyang Technological University)"

1068 - Dual Semantic Fusion Network for Video Object Detection

"Lijian Lin (Xiamen University); Haosheng Chen (Xiamen University); Honglun Zhang (Applied Research Center, Tencent PCG); Jun Liang (Xiamen University); Yu Li (Tencent ); Ying Shan (Tencent); Hanzi Wang (Xiamen University)*"

1073 - Sharp Multiple Instance Learning for DeepFake Video Detection

"Xiaodan Li (Alibaba Group, China); Yining Lang (Alibaba Group); Yuefeng Chen (Alibaba Group)*; Xiaofeng Mao (Alibaba Group); Yuan He (Alibaba Group ); Shuhui Wang (VIPL,ICT,Chinese academic of science); hui xue (Alibaba); Quan Lu (Alibaba Group)"

1075 - Light Field Super-resolution via Attention-Guided Fusion of Hybrid Lenses

"Jing Jin (City University of Hong Kong); Junhui Hou (City University of Hong Kong, Hong Kong)*; Jie Chen (Hong Kong Baptist University); Sam Kwong (City Univeristy of Hong Kong); Jingyi Yu (Shanghai Tech University)"

1077 - Learning to Detect Specular Highlights from Real-world Images

"Gang Fu (Wuhan University)*; Qing Zhang ( Sun Yat-sen University); Qifeng Lin (School of Computer Science, Wuhan University); Lei Zhu (The Chinese University of Hong Kong); Chunxia Xiao (Wuhan University)"

1080 - Video Super-Resolution using Multi-scale Pyramid 3D Convolutional Networks

Jianping Luo (Shenzhen University); Shaofei Huang (Shenzhen University); yuan yuan (Shenzhen University)*

1085 - Tactile Sketch Saliency

Jianbo Jiao (University of Oxford); Ying Cao (City University of Hong Kong)*; Manfred Lau (City University of Hong Kong); Rynson W.H. Lau (City University of Hong Kong)

1098 - Who You Are Decides How You Tell

"Shuang Wu (National University of Singapore)*; Shaojing Fan (National University of Singapore); Zhiqi Shen (National University of Singapore); Mohan Kankanhalli (National University of Singapore,); Anthony Tung (NUS)"

1112 - PCA-SRGAN: Incremental Orthogonal Projection Discrimination for Face Super-resolution

"Hao Dou (Institude Of Automation,chinese Academy Of Sciences; University of Chinese Academy of Sciences)*; Chen Chen (The Chinese academy of science); Xiyuan Hu (School of Computer Science and Engineering, Nanjing University of Science and Technology); zuxing Xuan (Beijing Union University); Zhisen Hu (Beijing University of Posts and Telecommunications); Silong Peng (The Chinese academy of science)"

1122 - PersonalitySensing: A Multi-View Multi-Task Learning Approach for Personality Detection based on Smartphone Usage

Songcheng Gao (Nanjing University); Wenzhong Li (Nanjing University)*; Lynda J. Song (University of Leeds); Xiao Zhang (Shandong University); Mingkai Lin (Nanjing University); Sanglu Lu (NJU)

1132 - Exploring Font-independent Features for Scene Text Recognition

Yizhi Wang (Peking University)*; Zhouhui Lian (Peking University)

1133 - Context-aware Feature Generation For Zero-shot Semantic Segmentation

Zhangxuan Gu (Shanghai Jiao Tong University); Siyuan Zhou (Shanghai Jiao Tong University); Li Niu (Shanghai Jiao Tong University)*; Zihan Zhao (Shanghai Jiao Tong University); Liqing Zhang (Shanghai Jiao Tong University)

1141 - Gray2ColorNet: Transfer More Colors from Reference Image

"Peng Lu (Beijing University of Posts and Telecommunications)*; Jinbei Yu (Beijing University of Posts and Telecommunications); Xujun Peng (Information Sciences Institute, University of Southern California); Zhaoran Zhao (Beijing University of Posts and Telecommunications); Xiaojie Wang (Beijing University of Posts and Telecommunications)"

1147 - Compact Bilinear Augmented Query Structured Attention for Sport Highlights Classification

Yanbin Hao (City University of Hong Kong); Hao Zhang (City University of Hong Kong)*; Chong-Wah Ngo (City University of Hong Kong); Qiang Liu (DeepAIT (Hong Kong) Limited); Xiaojun Hu (DeepAIT (Hong Kong) Limited)

1152 - Leverage Social Media for Personalized Stress Detection

Xin Wang (Tsinghua University)*; Huijun Zhang (Tsinghua university); Lei Cao (Tsinghua university); Ling Feng (Tsinghua university)

1157 - Towards Clustering-friendly Representations: Subspace Clustering via Graph Filtering

Zhengrui Ma (University of Electronic Science and Technology); Zhao Kang (University of Electronic Science and Technology of China)*; Guangchun Luo (University of Electronic Science and Technology of China); Ling Tian (University of Electronic Science and Technology of China); Wenyu Chen (University of Electronic Science and Technology of China)

1173 - Heterogeneous Fusion of Semantic and Collaborative Information for Visually-Aware Food Recommendation

Lei Meng (National University of Singapore)*; Xiangnan He (University of Science and Technology of China); Fuli Feng (National University of Singapore); Xiaoyan Gao (Beijing Institute of Technology ); Tat-Seng Chua (National university of Singapore)

1180 - KTN: Knowledge Transfer Network for Multi-person DensePose Estimation

xuanhan wang (University of Electronic Science and Technology of China)*; Lianli Gao (The University of Electronic Science andTechnology of China); Jingkuan Song (UESTC); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

1189 - ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection

"Ye Liu (Wuhan University)*; Junsong Yuan (""State University of New York at Buffalo, USA""); Chang Wen Chen (The Chinese University of Hong Kong and University at Buffalo)"

1195 - Semantic Image Analogy with a Conditional Single-Image GAN

Jiacheng Li (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Dong Liu (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

1196 - Trajectory Prediction in Heterogeneous Environment via Attended Ecology Embedding

(Video) Injong Rhee - Building Multi-Modal Interfaces for Smartphones (ACM-MM 2017 Keynote)

"Wei-Cheng Lai (National Chiao Tung University); Zi-Xiang Xia (National Chiao Tung University); Hao-Siang Lin (National Chiao Tung University); Lien-Feng Hsu (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University); I-Hong Jhuo (IBM); Wen-Huang Cheng (EE, NCTU)*"

1198 - A Unified Framework for Detecting Audio Adversarial Examples

"Xia Du (University of Macau); Chi-Man Pun (University of Macau)*; Zheng Zhang (Harbin Institute of Technology, Shenzhen)"

1200 - Defending Adversarial Examples via DNN Bottleneck Reinforcement

Wenqing Liu (Tongji University ); Miaojing Shi (King's College London); Teddy Furon (Inria); Li Li (Tongji University )*

1205 - AU-assisted Graph Attention Convolutional Network for Micro-Expression Recognition

"Hong-Xia Xie (National Chiao Tung University)*; Ling Lo ( National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University); Wen-Huang Cheng (EE, NCTU)"

1209 - Revealing True Identity: Detecting Makeup Attacks in Face-based Biometric Systems

Mohammad Amin Arab (Simon Fraser University)*; Puria Azadi Moghadam (Simon Fraser University); Mohamed Hussein (USC/ISI); Wael Abd-Almageed (Information Sciences Institute); Mohamed Hefeeda (Simon Fraser University)

1214 - A Structured Graph Attention Network for Vehicle Re-Identification

Yangchun Zhu (University of Science and Technology of China)*; Zheng-Jun Zha (University of Science and Technology of China); Tianzhu Zhang (University of Science and Technology of China); Jiawei Liu (University of Science and Technology of China); Jiebo Luo (U. Rochester)

1217 - Arbitrary Style Transfer via Multi-Adaptation Network

"Yingying Deng (Institute of Automation£¬Chinese Academy of Sciences); Fan Tang (Fosafer); Weiming Dong (NLPR, Institute of Automation, Chinese Academy of Sciences)*; Wen Sun (University of Chinese Academy of Sciences); Feiyue Huang (Tencent); Changsheng Xu (CASIA)"

1224 - Scoring High: Analysis and Prediction of Viewer Behavior and Engagement in the Context of 2018 FIFA WC Live Streaming

Nikolas Wehner (University of W¨¹rzburg)*; Michael Seufert (University of W¨¹rzburg); Sebastian Egger-Lampl (AIT Austrian Institute of Technology GmbH); Bruno Gardlo (AIT Austrian Institute of Technology GmbH); Pedro Casas (AIT Austrian Institute of Technology GmbH); Raimund Schatz (AIT)

1226 - Weakly-Supervised Video Object Grounding by Exploring Spatio-Temporal Contexts

Xun Yang (National University of Singapore)*; Xueliang liu (Hefei University of Technology); Meng Jian (Beijing University of Technology); Xinjian Gao (Hefei University of Technology); Meng Wang (Hefei University of Technology)

1228 - S^2SiamFC: Self-supervised Fully Convolutional Siamese Network for Visual Tracking

"Chon Hou Sio (National Chiao Tung University); Yu-Jen Ma (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University)*; Jun-Cheng Chen (Academia Sinica); Wen-Huang Cheng (EE, NCTU)"

1263 - Learnable Optimal Sequential Grouping for Video Scene Detection

Daniel Rotman (IBM Research)*; Yevgeny Yaroker (IBM Research); Elad Amrani (IBM / Technion); Udi Barzelay (IBM ); Rami Ben-Ari (IBM-Research)

1266 - Dual-view Attention Networks for Single Image Super-Resolution

Jingcai Guo (The Hong Kong Polytechnic University)*; Shiheng Ma (Shanghai Jiao Tong University); Jie Zhang (The Hong Kong Polytechnic University); Qihua Zhou (The Hong Kong Polytechnic University); Song Guo (The Hong Kong Polytechnic University)

1269 - Activity-driven Weakly-Supervised Spatio-Temporal Grounding from Untrimmed Videos

Junwen Chen (Rochester Institute of Technology); Wentao Bao (Rochester Institute of Technology); Yu Kong (Rochester Institute of Technology)*

1275 - Text-Guided Neural Image Inpainting

"Lisai Zhang (Harbin Institute of Technology, Shenzhen)*; Qingcai Chen ( Harbin Institute of Technology, Shenzhen); Baotian Hu (Harbin Institute of Technology, Shenzhen); Shuoran Jiang (Harbin Institute of Technology, Shenzhen)"

1283 - One-shot Scene Graph Generation

Yuyu Guo (UESTC); Jingkuan Song (UESTC)*; Lianli Gao (The University of Electronic Science andTechnology of China); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

1285 - NOH-NMS: Improving Pedestrian Detection by Nearby Objects Hallucination

Penghao Zhou (Tencent Youtu Lab)*; Chong Zhou (Tencent Youtu Lab); Pai Peng (Tencent Youtu Lab); Junlong Du (Tencent Youtu Lab); Xing Sun (Tencent); Xiaowei Guo (Tencent Youtu Lab); Feiyue Huang (Tencent)

1286 - Modeling Temporal Concept Receptive Field Dynamically for Untrimmed Video Analysis

"Zhaobo Qi (University of Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science)*; Chi Su (Kingsoft Cloud); Li Su (University of Chinese Academy of Sciences); Weigang Zhang (Harbin Institute of Technology, Weihai); Qingming Huang (University of Chinese Academy of Sciences)"

1298 - A probabilistic graphical model for analyzing the subjective visual quality assessment data from crowdsourcing

"Jing Li (Alibaba Group)*; Suiyi Ling (University of Nantes); Junle Wang (Tencent); Patrick Le Callet (""Universite de Nantes, France"")"

1300 - DFEW: A Large-Scale Database for Recognizing Dynamic Facial Expressions in the Wild

Xingxun Jiang (Southeast University)*; Yuan Zong (Southeast University); Wenming Zheng (Southeast University); Chuangao Tang (Southeast University); WanChuang Xia (Southeast University); Cheng Lu (Southeast University); Jiateng Liu (Southeast University)

1303 - Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion

Yikai Wang (Tsinghua University); Fuchun Sun (Tsinghua University); Ming Lu (Intel Labs China); Anbang Yao (Intel Labs China)*

1304 - Dual-Gradients Localization framework for Weakly Supervised Object Localization

Chuangchuang Tan (Beijing Jiaotong University); Guanghua Gu (Yanshan University); Tao Ruan (Beijjing Jiaotong University); Shikui Wei (Beijing Jiaotong University); Yao Zhao (Beijing Jiaotong University)*

1307 - DualLip: A System for Joint Lip Reading and Generation

Weicong Chen (Tsinghua University); Xu Tan (Microsoft Research Asia); Yingce Xia (Microsoft Research Asia); Tao Qin (Microsoft Research Asia); Yu Wang (Tsinghua University)*; Tieyan Liu (Microsoft Research)

1308 - Crossing You in Style: Cross-modal Style Transfer from Music to Visual Arts

Cheng-Che Lee (MediaTek); Wan-Yi Lin (National Tsing-Hua University); Yen-Ting Shih (National Tsing-Hua University); Pei-Yi (Patricia) Kuo (National Tsing-Hua University); Li Su (Academia Sinica)*

1314 - Single Image Shape-from-Silhouettes

Yawen Lu (Rochester Institute of Technology); Yuxing Wang (Rochester Institute of Technology)*; Guoyu Lu (Rochester Institute of Technology)

1319 - Weakly-supervised Image Hashing through Masked Visual Semantic Graph Reasoning

Lu Jin (Nanjing University of Science and Technology ); Zechao Li (Nanjing University of Science and Technology)*; Yonghua Pan (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology)

1320 - "Look, Listen and Infer"

Ruijian Jia (Xi'an Jiaotong University); Xinsheng Wang (Xi¡¯an Jiaotong University); Shanmin Pang (Xi'an Jiaotong University)*; Jihua Zhu (Xi'an Jiaotong University); Jianru Xue (Xi'an Jiaotong University)

1324 - How to Learn Item Representation for Cold-Start Multimedia Recommendation?

Xiaoyu Du (National University of Singapore)*; Xiang Wang (National University of Singapore); Xiangnan He (University of Science and Technology of China); Zechao Li (Nanjing University of Science and Technology); Jinhui Tang (Nanjing University of Science and Technology); Tat-Seng Chua (National university of Singapore)

1326 - Dual Attention GANs for Semantic Image Synthesis

Hao Tang (University of Trento)*; Song Bai (University of Oxford); Nicu Sebe (University of Trento)

1327 - MRI Measurement Matrix Learning via Correlation Reweighting

"Zhongnian Li (Nanjing University of Aeronautics and Astronautics,China); Tao Zhang (Nanjing University of Aeronautics and Astronautics,China); Ruoyu Chen (Nanjing University of Aeronautics and Astronautics); Daoqiang Zhang (Nanjing University of Aeronautics and Astronautics, China)*"

1329 - SimSwap: An Efficient Framework For High Fidelity Face Swapping

Renwang Chen (Shanghai Jiaotong University); Xuanhong Chen (Shanghai Jiao Tong University); Bingbing Ni (Shanghai Jiao Tong University)*; Yanhao Ge (Tencent)

1344 - Semantic Consistency Guided Instance Feature Alignment for 2D Image-Based 3D Shape Retrieval

"Heyu Zhou (Tianjin University, China); Weizhi Nie (Tianjin University)*; Dan Song (Tianjin University); Nian Hu (Tianjin University); Xuanya Li (Baidu); An-An Liu (Tianjin University)"

1347 - Performance over Random: A robust evaluation protocol for video summarization methods

"Evlampios Apostolidis (QMUL & CERTH-ITI)*; Eleni Adamantidou (CERTH); Alexandros I Metsai (CERTH-ITI); Vasileios Mezaris (Information Technologies Institute, Centre for Research and Technology Hellas, Greece); Ioannis Patras (Queen Mary University of London)"

1355 - ARSketch: Sketch-Based User Interface for Augmented Reality Glasses

"Zhaohui Zhang (Rokid); Haichao Zhu (The Chinese University of Hong Kong)*; Qian Zhang (California University, Los Angeles)"

1356 - Self-Mimic Learning for Small-scale Pedestrian Detection

"Jialian Wu (State University of New York at Buffalo)*; CHUNLUAN ZHOU (Wormpex AI Research); Qian Zhang (Horizon Robotics); Ming Yang (Horizon Robotics); Junsong Yuan (""State University of New York at Buffalo, USA"")"

1359 - Action2Motion: Conditioned Generation of 3D Human Motions

"Chuan Guo (University of Alberta)*; Xinxin Zuo (University of Alberta); Sen Wang (University of Alberta); Shihao Zou (University of Alberta); Qingyao Sun (University of Chicago); Annan Deng (Yale University); Minglun Gong (University of Guelph); Li Cheng (ECE dept., University of Alberta)"

1363 - ChefGAN: Food Image Generation from Recipes

siyuan pan (shanghai jiao tong university)*; Ling Dai (Shanghai Jiao Tong University); Xuhong Hou (Shanghai Jiao Tong University ); Huating Li (Shanghai Jiao Tong University); Bin Sheng (Shanghai Jiao Tong University)

1365 - Skin Textural Generation via Blue-noise Gabor Filtering based Generative Adversarial Network

HUI ZHANG (The University of Hong Kong)*; Chuan Wang (Face++ (Megvii)); Nenglun Chen (The University of Hong Kong); Wenping Wang (The University of Hong Kong); jue wang (Megvii Technology)

1367 - Text-Embedded Bilinear Model for Fine-Grained Visual Recognition

Liang Sun (University of Electronic Science and Technology of China); Xiang Guan (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)*; Lei Zhang (Chongqing University)

1369 - VVSec: Securing Volumetric Video Streaming via Benign Use of Adversarial Perturbation

Zhongze Tang (Rutgers University)*; Xianglong Feng (Rutgers University); Yi Xie (Rutgers University); Huy Phan (Rutgers University); Tian Guo (Worcester Polytechnic Institute); bo yuan (rutgers university); Sheng Wei (Rutgers University - New Brunswick)

1373 - Personalized Item Recommendation for Second-hand Trading Platform

Xuzheng Yu (Shandong University)*; Tian Gan (Shandong University); Yinwei Wei (Shandong University); Zhiyong Cheng (Shandong Academy of Sciences); Liqiang Nie (Shandong University )

1374 - A Slow-I-Fast-P Architecture for Compressed Video Action Recognition

Jiapeng Li (Xi'an Jiaotong University); Ping Wei (Xi'an Jiaotong University)*; Yongchi Zhang (Xi'an Jiaotong University); Nanning Zheng (Xi'an Jiaotong University)

1384 - Learning Scales from Points: A Scale-aware Probabilistic Model for Crowd Counting

Zhiheng Ma (Xi'an Jiaotong University)*; Xing Wei (Xi'an Jiaotong University); Xiaopeng Hong (Xi'an Jiaotong University); Yihong Gong (Xi'an Jiaotong University)

1391 - Modeling Caricature Expressions by 3D Blendshape and Dynamic Texture

Keyu Chen (University of Science and Technology of China); Juyong Zhang (University of Science and Technology of China)*; Jianfei Cai (Monash University); Jianmin Zheng (Nanyang Technological University)

1394 - Learning Global Structure Consistency for Robust Object Tracking

Bi Li (Huazhong University of Science and Technology); Chengquan Zhang (Baidu Inc); Zhibin Hong (Baidu Inc.); Xu Tang (Baidu); jingtuo liu (baidu); Junyu Han (Baidu Inc.); Errui Ding (Baidu Inc.); Wenyu Liu (Huazhong University of Science and Technology)*

1396 - DMVOS: Discriminative Matching for Real-time Video Object Segmentation

"Peisong Wen (Nankai University); Ruolin Yang (SenseTime); Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Chen Qian (SenseTime); Qingming Huang (University of Chinese Academy of Sciences); Runmin Cong (Beijing Jiaotong University); Jianlou Si ( SenseTime)*"

1397 - Multi-Group Multi-Attention: Towards Discriminative Spatiotemporal Representation

Zhensheng Shi (Ocean University of China); Liangjie Cao (Ocean University of China); Cheng Guan (Ocean University of China); Ju Liang (Ocean University of China); Qianqian Li (Ocean University of China); Zhaorui Gu (Ocean University of China); Haiyong Zheng (Ocean University of China)*; Bing Zheng (Ocean University of China)

1399 - RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

Niluthpol c Mithun (SRI International)*; Karan Sikka (SRI International); Han-Pang Chiu (SRI International); Supun Samarasekera (SRI International); Rakesh Kumar (SRI International)

1401 - Vaccine-style-net: Point Cloud Completion in Implicit Continuous Function Space

"Wei Yan (Peking university); Ruonan Zhang ( Peng Cheng Laboratory); Jing Wang (Artificial Intelligence Research Center Peng Cheng Laboratory); Shan Liu (Tencent America); Thomas H Li (Advanced Institute of Information Technology, Peking University); Ge Li (SECE, Shenzhen Graduate School, Peking University)*"

1417 - Dual Hierarchical Temporal Convolutional Network with QA-Aware Dynamic Normalization for Video Story Question Answering

"Fei Liu (National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences and University of Chinese Academy of Sciences)*; Jing Liu (National Lab of Pattern Recognition, Institute of Automation,Chinese Academy of Sciences); Xinxin Zhu (National Lab of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences); Richang Hong (Hefei University of Technology); Hanqing Lu (NLPR, Institute of Automation, CAS)"

1418 - Multimodal Representation with Embedded Visual Guiding Objects for Named Entity Recognition in Social Media Posts

"Zhiwei Wu (School of Software Engineering, South China University of Technology); Changmeng Zheng (South China University of Technology); Yi Cai (School of Software Engineering, South China University of Technology)*; Junying Chen (South China University of Technology); Ho-fung Leung (The Chinese University of Hong Kong); Qing Li (The Hong Kong Polytechnic University)"

1421 - Adaptive Wasserstein Hourglass for Weakly Supervised RGB 3D Hand Pose Estimation

Yumeng Zhang (Tsinghua University); Li Chen (Tsinghua University)*; Yufeng Liu (Kuaishou Technology); Wen Zheng (Kuaishou Technology); JunHai Yong (Tsinghua University)

1422 - Weakly Supervised Segmentation with Maximum Bipartite Graph Matching

WEIDE LIU (Nanyang Technological University)*; Chi Zhang (Nanyang Technological University); Guosheng Lin (Nanyang Technological University); Tzu-Yi HUNG (Delta Research Center); Chunyan Miao (NTU)

1429 - What Aspect Do You Like: Multi-scale Time-aware User Interest Modeling for Micro-video Recommendation

"Hao Jiang (Shandong University)*; Wenjie Wang (National University of Singapore); Yinwei Wei (Shandong University); Zan Gao (1. Shandong AI Institute, QiLU University of Technology, 2. Shandong Computer Science Center(National Supercomputer Center in Jinan), 3. Tianjing University of Technology); Yinglong Wang (Shandong Artificial Intelligence Institute); Liqiang Nie (Shandong University )"

1431 - Recognizing Camera Wearer from Hand Gestures in Egocentric Videos

"Daksh Thapar (Indian Institute of Technology, Mandi)*; Chetan Arora (Indian Institute of Technology Delhi); Aditya Nigam (IIT mandi)"

1441 - Domain-Specific Alignment Network for Multi-Domain Image-Based 3D Object Retrieval

Yu-ting Su (Tianjin University); Yuqian Li (Tianjin University); Dan Song (Tianjin University)*; Zhendong Mao (University of Science and Technology of China); Xuanya Li (Baidu); An-An Liu (Tianjin University)

1443 - Cross-Granularity Learning for Multi-Domain Image-to-Image Translation

Huiyuan Fu (Beijing University of Posts and Telecommunications)*; Ting Yu (Beijing University of Posts and Telecommunications); Xin Wang (Stony Brook University); Huadong Ma (Beijing University of Posts and Telecommunications)

1444 - Generalized Zero-Shot Learning using Generated Proxy Unseen Samples and Entropy Separation

Omkar Anil Gune (Indian Institute of Technology Bombay)*; Biplab Banerjee (Indian Institute of Technology Bombay); Subhasis Chaudhuri (Indian Institute of Technology Bombay); Fabio Cuzzolin (Oxford Brookes University)

1445 - Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Neural Networks

Negin Ghamsarian (Alpen-Adria University of Klagenfurt); Hadi Amirpourazarian (Alpen-Adria-Universit_t Klagenfurt); Christian Timmerer (Alpen-Adria-Universit_t Klagenfurt); Mario Taschwer (Klagenfurt University); Klaus Sch_ffmann (Klagenfurt University)*

1448 - Complementary-View Co-Interest Person Detection

"Ruize Han (College of Intelligence and Computing, Tianjin University); Jiewen Zhao (College of Intelligence and Computing, Tianjin University); Wei Feng (College of Intelligence and Computing, Tianjin University, China)*; Yiyang Gan (College of Intelligence and Computing, Tianjin University); Liang Wan (College of Intelligence and Computing, Tianjin University); Song Wang (University of South Carolina)"

1453 - Contextual Multi-Scale Feature Learning for Person Re-Identification

"Baoyu Fan (Inspur Electronic Information Industry Co.,Ltd.); Li Wang (inspur)*; Runze Zhang (Inspur Electronic Information Industry Co.,Ltd.); Zhenhua Guo (Inspur Electronic Information Industry Co.,Ltd.); Yaqian Zhao (Inspur); Rengang Li (Inspur); Weifeng Gong ( Inspur Electronic Information Industry Co.,Ltd.)"

1456 - Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene

"Xinke Li (National University of Singapore); Chongshou Li (National University of Singapore)*; Zekun Tong (National University of Singapore); Andrew Lim (National University of Singapore); Junsong Yuan (""State University of New York at Buffalo, USA""); Yuwei Wu (National University of Singapore); Jing Tang (National University of Singapore); Raymond Huang (National University of Singapore)"

1458 - Prototype-Matching Graph Network for Heterogeneous Domain Adaptation

Zijian Wang (University of Queensland)*; Yadan Luo (University of Queensland); Zi Huang (University of Queensland); Mahsa Baktashmotlagh (University of Queensland)

1459 - VIMES: A Wearable Memory Assistance System for Automatic Information Retrieval

"Carlos Bermejo Fernandez (Hong Kong University of Science and Technology)*; Tristan Braud (HKUST); Ji Yang (The Hong Kong University of Science and Technology); Shayan Mirjafari (Dartmouth College ); Bowen Shi (Hong Kong University of Science and Technology); Yu Xiao (Department of Communications and Networking, Aalto University, Finland); Pan Hui (Hong Kong University of Science and Technology)"

1462 - Towards Lighter and Faster: Learning Wavelets Progressively for Image Super-Resolution

"Huanrong Zhang (Sun Yat-Sen University); Zhi Jin (Sun Yat-sen University)*; Xiaojun Tan (Sun Yat-sen University); Xiying Li (Research Center of ITS, Sun Yat-sen University)"

1464 - Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval

Rui Zhao (University of Science and Technology of China)*; Kecheng Zheng (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China); Hongtao Xie (University of Science and Technology of China); Jiebo Luo (U. Rochester)

1472 - Spatio-Temporal Inception Graph Convolutional Networks for Skeleton-Based Action Recognition

"Zhen Huang (University of Science and Technology of China); Xu Shen (Alibaba Group); Xinmei Tian (USTC)*; Houqiang Li (University of Science and Technology of China); Jianqiang Huang (Alibaba Group); Xian-Sheng Hua (Damo Academy, Alibaba Group)"

1473 - Space-Time Video Super-Resolution using Temporal Profiles

Zeyu Xiao (University of Science and Technology of China); Zhiwei Xiong (University of Science and Technology of China)*; Xueyang Fu (University of Science and Technology of China); Dong Liu (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

1475 - Answer-driven Visual State Estimator for Goal-oriented Visual Dialogue

Zipeng Xu (Beijing University of Posts and Telecommunications)*; Xiaojie Wang (Beijing University of Posts and Telecommunications); Fangxiang Feng (Beijing University of Posts and Telecommunications); Yushu Yang (Meituan-Dianping Group); Huixing Jiang (Meituan-Dianping Group); Zhongyuan Wang (Meituan-Dianping Group)

1482 - Dynamic Future Net: Diversified Human Motion Generation

Chen WenHeng (NetEase Fuxi AI Lab)*; He E Wang (Leeds University); Yi Yuan (NetEase Fuxi AI Lab); Tianjia Shao (Zhejiang University); Kun Zhou (Zhejiang University)

1491 - ATF : Towards robust face alignment via leveraging similarity and diversity across different datasets

"Xing Lan (University of Chinese Academy of Sciences;Institute of Automation,Chinese Academy of Sciences)*; Qinghao Hu (Institute of Automation, Chinese Academy of Sciences); Fangzhou Xiong (Nanjing Aritificial Intelligence Chip Research, Institute of Automation, Chinese Academy of Sciences;Nanjing University of Science and Technology); Cong Leng ( Institute of Automation,Chinese Academy of Sciences; Nanjing Aritificial Intelligence Chip Research, Institute of Automation, Chinese Academy of Sciences); Jian Cheng (""Chinese Academy of Sciences, China"")"

1493 - Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions

Yu-Siang Huang (Academia Sinica)*; Yi-Hsuan Yang (Academia Sinica)

1495 - DCNet: Dense Correspondence Neural Network for 6DoF Object Pose Estimation in Occluded Scenes

Zhi Chen (University of Science and Technology of China); Wei Yang (University of Science and Technology of China)*; Zhenbo Xu (University of Science and Technology of China); Xike Xie (University of Science and Technology of China); Liusheng Huang (University of Science and Technology of China)

1508 - Dual Gaussian-based Variational Subspace Disentanglement for Visible-Infrared Person Re-Identification

Nan Pu (Leiden University)*; Wei Chen (Leiden University); Yu Liu (KU Leuven); Erwin M. Bakker (Leiden University); Michael S Lew (Leiden University)

1517 - Region of Interest Based Graph Convolution: A Heatmap Regression Approach for Action Unit Detection

Zheng Zhang (State University of New York at Binghamton)*; Taoyue Wang (State Univerisity of New York at Binghamton); Lijun Yin (State University of New York at Binghamton)

1525 - DroidCloud: Scalable High Density Android Cloud Rendering

Linsheng Li (SJTU)*; bin yang (Intel); cathy bao (Intel); shuo liu (Intel); randy xu (Intel); yong yao (Intel); Haghighat Mohammad R (Intel); Jerry W Hu (Intel); Shoumeng Yan (Intel); Zhengwei Qi (SJTU)

1534 - Incomplete Cross-modal Retrieval with Dual-Aligned Variational Autoencoders

Mengmeng Jing (University of Electronic Science and Technology of China); Jingjing Li (University of Electronic Science and Technology of China)*; Lei Zhu (Shandong Normal Unversity); Ke Lu (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China); Zi Huang (University of Queensland)

1538 - Transferrable Referring Expression Grounding with Concept Transfer and Context Inheritance

"Xuejing Liu (CAS)*; Liang Li (Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Zheng-Jun Zha (University of Science and Technology of China); Dechao Meng (vipl,ict,Chinese academic of science); Qingming Huang (University of Chinese Academy of Sciences)"

1541 - MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

"Devamanyu Hazarika (NUS, Singapore)*; Roger Zimmermann (NUS); Soujanya Poria (Singapore University of Technology and Design)"

1546 - Multimodal Dialogue Systems via Capturing Context-aware Dependencies of Semantic Elements

"Weidong He (University of Science and Technology of China)*; Zhi Li (University of Science and Technology of China); Dongcai Lu (Huawei Cloud BU); Enhong Chen (University of Science and Technology of China); Tong Xu (University of Science and Technology of China); Jing Yuan (Huawei Cloud BU); Baoxing Huai (HUAWEI TECHNOLOGIES CO., LTD.)"

1549 - Instability of Successive Deep Image Compression

"Jun-Hyuk Kim (Yonsei University); Soobeom Jang (Yonsei University); Jun-Ho Choi (Yonsei University); Jong-Seok Lee (""Yonsei University, Korea"")*"

1551 - Bitrate Requirements of Non-Panoramic VR Remote Rendering

Viktor Kelkkanen (Blekinge Institute of Technology)*; Markus Fiedler (Blekinge Institute of Technology); David Lindero (Ericsson)

1555 - Fine-grained Iterative Attention Network for Temporal Language Localization in Videos

Xiaoye Qu (Huazhong University of Science and Technology)*; Pengwei Tang ( Huazhong University of Science and Technology); Zhikang Zou (Huazhong university of science and technology); Yu Cheng (Microsoft); Jianfeng Dong (Zhejiang Gongshang University); Pan Zhou (Huazhong University of Science and Technology); Zichuan Xu (Dalian University of Technology)

1562 - EyeShopper: Estimating Shoppers' Gaze using CCTV Cameras

Carlos Bermejo Fernandez (Hong Kong University of Science and Technology)*; Dimitris Chatzopoulos (Hong Kong University of Science and Technology); Pan Hui (Hong Kong University of Science and Technology)

1570 - DeepFacePencil: Creating Face Images from Freehand Sketches

Yuhang Li (University of Science and Technology of China); Xuejin Chen (University of Science and Technology of China)*; Binxin Yang (University of Science and Technology of China); Zihan Chen (University of Science and Technology of China); Zhihua Cheng (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China)

1572 - Attention Based Dual Branches Fingertip Detection Network and Virtual Key System

Chong Mou (South China University of Technology)*; Xin Zhang (South China University of Technology)

1576 - ALANET: Adaptive Latent Attention Network for Joint Video Deblurring and Interpolation

"Akash Gupta (University of California, Riverside)*; Abhishek Aich (University of California, Riverside); Amit K. Roy-Chowdhury (University of California, Riverside)"

1578 - Action Completeness Modeling with Background Aware Networks for Weakly-Supervised Temporal Action Localization

Md Moniruzzaman (Stony Brook University)*; Zhaozheng Yin (Stony Brook University); Zhihai He (University of Missouri Columbia); Ruwen Qin (MST); Ming Leu (University of Misssouri of Science and Technology)

1579 - Adversarial Knowledge Transfer from Unlabeled Data

"Akash Gupta (University of California, Riverside)*; Rameswar Panda (MIT-IBM Watson AI Lab); Sujoy Paul (UC Riverside); Jianming Zhang (Adobe Research); Amit K. Roy-Chowdhury (University of California, Riverside)"

1592 - Hierarchical Bi-Directional Feature Perception Network for Person Re-Identification

Zhipu Liu (Chongqing University); Lei Zhang (Chongqing University)*; Yang Yang (University of Electronic Science and Technology of China)

1595 - CM-BERT: Cross-Modal BERT for Text-Audio Sentiment Analysis

"Kaicheng Yang (Hebei University Of Science and Technology); Hua Xu (State Key Laboratory of Intelligent Technology and Systems, Department of Computer Science and Technology, Tsinghua University, Beijing 100084, China)*; kai gao (Hebei University Of Science and Technology)"

1598 - Single-Shot Two-Pronged Detector with Rectified IoU Loss

Keyang Wang (chongqing university); Lei Zhang (Chongqing University)*

1601 - Hard Negative Samples Emphasis Tracker without Anchors

Zhongzhou Zhang (Chongqing University)*; Lei Zhang (Chongqing University)

1605 - Task Decoupled Knowledge Distillation For Lightweight Face Detectors

"Xiaoqing Liang (University of Chinese Academy of Sciences)*; Xu Zhao (Chinese Academy of Sciences); Chaoyang Zhao (National Laboratory of Pattern Recognition, CASIA); Nanfei Jiang (University of Chinese Academy of Sciences ); Ming Tang (Chinese Academy of Sciences, China); Jinqiao Wang (Institute of Automation, Chinese Academy of Sciences)"

1606 - Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework

Li Tao (The University of Tokyo)*; Xueting Wang (The University of Tokyo); Toshihiko Yamasaki (The University of Tokyo)

1612 - Object-level Attention for Aesthetic Rating Distribution Prediction

"Jingwen Hou (Nanyang Technological University)*; Sheng Yang (Nanyang Technological University); Weisi Lin (Nanyang Technological University, Singapore)"

1619 - Memory Recursive Network for Single Image Super-Resolution

"Jie Liu (Nanjing University)*; Minqiang Zou (Department of Computer Science and Technology, Nanjing University); Jie Tang (Nanjing University); Gangshan Wu (Nanjing University)"

1623 - A Modular Approach for Synchronized Wireless Multimodal Multisensor Data Acquisition in Highly Dynamic Social Settings

Chirag Raman (Delft University of Technology )*; Stephanie Tan (TU Delft); Hayley Hung (TU Delft)

1625 - Scale-aware Progressive Optimization Network

Ying Chen (Sun Yat-sen University)*; Lifeng Huang (SunYat-sen university); Chengying Gao (Sun Yat-sen University ); Ning Liu (Sun Yat-sen University )

1630 - Kalman Filter-based Head Motion Prediction for Cloud-based Mixed Reality

Serhan G¨¹l (Fraunhofer HHI)*; Sebastian Bosse (Fraunhofer HHI); Dimitri Podborski (Fraunhofer HHI); Thomas Schierl (Fraunhofer HHI); Cornelius Hellge (Fraunhofer HHI)

1633 - Not made for each other - Audio-Visual Dissonance-based Deepfake Detection and Localization

Komal Chugh (Indian Institute of Technology Ropar); Parul Gupta (Indian Institute of Technology Ropar); Abhinav Dhall (Monash University)*; Ramanathan Subramanian (Indian Institute of Technology Ropar)

1637 - Resource Efficient Domain Adaptation

"Junguang Jiang (Tsinghua University); Ximei Wang (Tsinghua University); Mingsheng Long (Tsinghua University)*; Jianmin Wang (""Tsinghua University, China"")"

1653 - A Multi-update Deep Reinforcement Learning Algorithm for Edge Computing Service Offloading

Hao Hao (Beijing University of Posts and Telecommunications); Changqiao Xu (Beijing University of Posts and Telecommunications)*; Lujie Zhong (Capital Normal University); Gabriel-Miro Muntean (Dublin City University)

1655 - MGAAttack: Toward more query-efficient black-box attack by microbial genetic algorithm

"Lina Wang (Computer School of Wuhan University, China)*; Kang Yang (Wuhan University); Wenqi Wang (Wuhan University); Run Wang (Nanyang Technological University); Aoshuang Ye (Wuhan University)"

1656 - Make your favorite music curative: music style transfer for anxiety reduction

Zhejing Hu (The Hong Kong Polytechnic University); Yan Liu (The Hong Kong Polytechnic University)*; Gong Chen (The Hong Kong Polytechnic University); Sheng-hua Zhong (Shenzhen University); Aiwei Zhang (St. Paul¡¯s Co-educational College)

1657 - JointFontGAN: Joint Geometry-Content GAN for Font Generation via Few-Shot Learning

Yankun Xi (Wayne State University); Guoli Yan (Wayne State University); Jing Hua (Wayne State University); Zichun Zhong (Wayne State University)*

1673 - Enhancing Self-supervised Monocular Depth Estimation via Incorporating Robust Constraints

Rui Li (Northwestern Polytechnical University)*; Xiantuo He (Northwestern Polytechnical University); Yu Zhu (Northwestern Polytechnical University); Xianjun Li (Northwestern Polytechnical University); Jinqiu Sun (Northwestern Polytechnical University); Yanning Zhang (Northwestern Polytechnical University)

1675 - DeepRhythm: Exposing DeepFakes with Attentional Visual Heartbeat Rhythms

"Hua Qi (Kyushu University); Qing Guo (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Wei Feng (College of Intelligence and Computing, Tianjin University, China); Yang Liu (Nanyang Technology University, Singapore); Jianjun Zhao (Kyushu University)"

1678 - Query Twice: Dual Mixture Attention Meta Learning for Video Summarization

Junyan Wang (MeituanDianping group); Yang Bai (Newcastle University); Yang Long (Durham University); BingZhang Hu (Newcastle University); Zhenhua Chai (MeituanDianping group)*; Yu Guan (Newcastle University); Xiaolin Wei (MeituanDianping group )

1679 - Deep Multi-modality Soft-decoding of Very Low Bit-rate Face Videos

Yanhui Guo (Mcmaster University); Xi Zhang (Shanghai Jiao Tong University); Xiaolin Wu (McMaster University)*

1685 - Hearing like Seeing: Improving Voice-Face Interactions and Associations via Adversarial Deep Semantic Matching Network

Kai Cheng (Huaqiao University); Xin Liu (Huaqiao University)*; Yiu-ming CHEUNG (Hong Kong Baptist University); Rui Wang (Huaqiao University); Xing Xu (University of Electronic Science and Technology of China); Bineng Zhong (Huaqiao University)

1696 - Multi-modal Attentive Graph Pooling Model for Community Question Answer Matching

Jun Hu (HeFei University of Technology); Quan Fang (Institute of Automation Chinese Academy of Sciences); Shengsheng Qian (institute of automation chinese academy of sciences); Changsheng Xu (CASIA)*

1701 - Towards Viewport-dependent 6DoF 360 Video Tiled Streaming for Virtual Reality Systems

Jong-Beom Jeong (Sungkyunkwan University)*; Soonbin Lee (Sungkyunkwan University); Il-Woong Ryu (Gachon University); Tuan Thanh Le (Gachon University); Eun-Seok Ryu (Sungkyunkwan University)

1702 - Concept Drift Detection for Multivariate Data Streams and Temporal Segmentation of Daylong Egocentric Videos

Pravin Nagar (IIIT Delhi)*; Mansi Khemka (Columbia University); Chetan Arora (Indian Institute of Technology Delhi)

1706 - A Novel Graph-TCN with a Graph Structured Representation for Micro-expression Recognition

Ling Lei (Southwest University); Jianfeng Li (Southwest University)*; Tong Chen (Southwest University); SHIGANG LI (Hiroshima City University)

1708 - Dynamic Context-guided Capsule Network for Multimodal Machine Translation

Huan Lin (Xiamen University)*; Fandong Meng (Tencent WeChat AI - Pattern Recognition Center Tencent Inc.); Jinsong Su (Xiamen University); Yongjing Yin (Xiamen University); Zhengyuan Yang (University of Rochester); Yubin Ge (University of Illinois at Urbana-Champaign); Jie Zhou (Tencent); Jiebo Luo (U. Rochester)

1710 - DeepSonar: Towards Effective and Robust Detection of AI-Synthesized Fake Voices

"Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Yihao Huang (East China Normal University); Qing Guo (Nanyang Technological University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)"

1717 - RIRNet: Recurrent-In-Recurrent Network for Video Quality Assessment

"Pengfei Chen (Xidian University / China University of Mining and Technology); Leida Li (Xidian University)*; Lei Ma (Hangzhou Multi-Color Optoelctronics Co., Ltd.); Jinjian Wu (Xidian University); Guangming Shi (Xidian University)"

1718 - Incremental facial expression recognition

Junjie Zhu (Tsinghua University)*; bingjun luo (Tsinghua University); Sicheng Zhao (University of California Berkeley); Shihui Ying (Shanghai University); Xibin Zhao (Tsinghua University); Yue Gao (Tsinghua University)

1719 - Black Re-ID: A Head-shoulder Descriptor for the Challenging Problem of Person Re-Identification

BOQIANG XU (University of Chinese Academy of Sciences£»Institute of Automation£¬Chinese Academy of Sciences)*; Lingxiao He (AI Research of JD.com); Xingyu Liao (AI Research of JD.com); Wu Liu (AI Research of JD.com); Zhenan Sun (Chinese of Academy of Sciences); Tao Mei (AI Research of JD.com)

1720 - SketchMan: Learning to Create Professional Sketch

Jia Li (Communication University of China)*; Nan Gao (Communication University of China); Tong Shen (JD AI Research); Wei Zhang (JD AI Research); Hui Ren (Communication University of China); Tao Mei (AI Research of JD.com)

1722 - PopMAG: Pop Music Accompaniment Generation

Yi Ren (Zhejiang University)*; Jinzheng He (Zhejiang University); Xu Tan (Microsoft Research Asia); Tao Qin (Microsoft Research Asia); Zhou Zhao (Zhejiang University); Tie-Yan Liu (Microsoft)

1729 - PCPL: Predicate-Correlation Perception Learning for Unbiased Scene Graph Generation

Shaotian Yan (Zhejiang University)*; Chen Shen (Alibaba Group); Zhongming Jin (Alibaba Group); Jianqiang Huang (Alibaba Group); Rongxin Jiang (Zhejiang University); Yaowu Chen (Zhejiang University); Xian-Sheng Hua (Alibaba Group)

1736 - Masked Face Recognition with Generative Data Augmentation and Domain Constrained Ranking

Mengyue Geng (Peking University)*; Peixi Peng (Peking University); Yangru Huang (Beijing University); Yonghong Tian (Peking University)

1737 - SST-EmotionNet: Spatial-Spectral-Temporal based Attention 3D Dense Network for EEG Emotion Recognition

Ziyu Jia (Beijing Jiaotong University); Youfang Lin (Beijing Jiaotong University); Xiyang Cai (Beijing Jiaotong University); Haobin Chen (Beijing Jiaotong University); Haijun Gou (Beijing Jiaotong University); Jing Wang (Beijing Jiaotong University)*

1754 - Occlusion Detection for Automatic Video Editing

"Junhua Liao (Sichuan University); Haihan Duan (The Chinese University of Hong Kong, Shenzhen); Xin Li (Sichuan University); Haoran Xu (Sichuan University); Yanbing Yang (Sichuan University); Wei Cai (""The Chinese University of Hong Kong, Shenzhen""); Yanru Chen (Sichuan University); Liangyin Chen (Sichuan University)*"

1758 - Cartoon Face Recognition: A Benchmark Dataset

(Video) Webinar 2020/01: How to write a good SYSTEMS paper (that will hopefully get accepted)?

"Yi Zheng (iQIYI,Inc.); Yifan Zhao (Beihang University); Mengyuan Ren (iQIYI,Inc.); he yan (iQiYi,Inc.); Xiangju Lu (iQIYI,Inc.); Junhui Liu (iQIYI Inc); Jia Li (Beihang University)*"

1761 - Differentiable Manifold Reconstruction for Point Cloud Denoising

Shitong Luo (Peking University)*; Wei Hu (Peking University)

1765 - Perception-Lossless Codec of Haptic Data with Low Delay

Chaoyang Zeng (Fuzhou University)*; Tiesong Zhao (Fuzhou University); Qian Liu (Dalian University of Technology); Yiwen Xu (Fuzhou University); Kai Wang (Fuzhou University)

1770 - Reversible Watermarking in Deep Convolutional Neural Networks for Integrity Authentication

Xiquan Guan (University of Science and Technology of China)*; Weiming Zhang (University of Science and Technology of China); Huamin Feng (Beijing Electronic Science and Technology Institute); Hang Zhou (University of Science and Technology of China); Jie Zhang (University of Science and Technology in China); Nenghai Yu (University of Science and Technology of China)

1775 - Discriminative Spatial Feature Learning for Person Re-Identification

Peixi Peng (Peking University)*; Yonghong Tian (Peking University); Yangru Huang (Beijing University); Xiangqian Wang (Huawei); Huilong An (AI Application Research Center)

1779 - Masked Face Recognition with Latent Part Detection

Feifei Ding (Peking University)*; Peixi Peng (Peking University); Yangru Huang (Beijing University); Mengyue Geng (Peking University); Yonghong Tian (Peking University)

1781 - FakePolisher: Making DeepFakes More Detection-Evasive by Shallow Reconstruction

"Yihao Huang (East China Normal University)*; Felix Juefei-Xu (Alibaba Group); Run Wang (Nanyang Technological University); Qing Guo (Nanyang Technological University); Lei Ma (Kyushu University); Xiaofei Xie (Nanyang Technological University); Jianwen Li (East China Normal University); Weikai Miao (East China Normal University); Yang Liu (Nanyang Technology University, Singapore); Geguang Pu (East China Normal University)"

1784 - SalGCN: Saliency Prediction for 360-Degree Images Based on Spherical Graph Convolutional Networks

Haoran Lv (Shanghai Jiao Tong University)*; Qin Yang (Shanghai Jiao Tong University); Chenglin Li (Shanghai Jiao Tong University); Wenrui Dai (Shanghai Jiao Tong University); Junni Zou (Shanghai Jiao Tong University); Hongkai Xiong (Shanghai Jiao Tong University)

1789 - Neural3D: Light-weight Neural Portrait Scanning via Context-aware Correspondence Learning

Xin Suo (Shanghaitech university); Minye Wu (Shanghaitech University); Yanshun Zhang (Dgene); Yingliang Zhang (Dgene); Qiang Hu (ShanghaiTech University)*; LAN XU (HKUST); Jingyi Yu (Shanghai Tech University)

1790 - PanelNet: A Novel Deep Neural Network for Predicting Collective Diagnostic Ratings by a Panel of Radiologists for Pulmonary Nodules

"Chunyan Zhang (Institute of Medical Artificial Intelligence, the Second Affiliated Hospital of Xi'an Jiaotong University); Songhua Xu (School of Mathematics and Statistics, Xi'an Jiaotong University)*; Zongfang Li (Institute of Medical Artificial Intelligence, the Second Affiliated Hospital of Xi'an Jiaotong University)"

1795 - Multi-modal Multi-relational Feature Aggregation Network for Medical Knowledge Representation Learning

"Yingying Zhang (Institute of Automation, Chinese Academy of Sciences;Univiersity of Chinese Academy of Sciences); Quan Fang (Institute of Automation Chinese Academy of Sciences); Shengsheng Qian (institute of automation chinese academy of sciences); Changsheng Xu (CASIA)*"

1800 - AdaHGNN: Adaptive Hypergraph Neural Networks for Multi-Label Image Classification

"Xiangping Wu (Harbin Institute of Technology, Shenzhen); Qingcai Chen ( Harbin Institute of Technology, Shenzhen)*; Wei Li (Harbin Institute of Technology, Shenzhen); Yulun Xiao (Harbin Institute of Technology, Shenzhen); Baotian Hu (University of Massachusetts)"

1801 - Privacy-Preserving Visual Content Tagging using Graph Transformer Networks

"Xuan-Son Vu (Ume_ University)*; Duc-Trong Le (Vietnam National Univeristy); Christoffer K Edlund (Sartorius); Lili Jiang (Department of Computing Science, Ume_ University, Sweden); Hoang D. Nguyen (University of Glasgow)"

1803 - Task-distribution-aware Meta-learning for Cold-start CTR Prediction

"Tianwei Cao (University of Chinese Academy of Sciences)*; Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Zhiyong Yang (SKLOIS, Institute of Information Engineering, Chinese Academy of Sciences; SCS, University of Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)"

1806 - FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire

"Jinglin Liu (Zhejiang University)*; Yi Ren (Zhejiang University); Zhou Zhao (Zhejiang University); Chen Zhang (Zhejiang University); Baoxing Huai (HUAWEI TECHNOLOGIES CO., LTD.); Jing Yuan (Huawei Cloud BU)"

1816 - Identity-Aware Attribute Recognition via Real-Time Distributed Inference in Mobile Edge Clouds

Zichuan Xu (Dalian University of Technology); Jiangkai Wu (Dalian University of Technology); Qiufen Xia (Dalian University of Technology)*; Pan Zhou ( Huazhong University of Science and Technology); Jiankang Ren (Dalian University of Technology); Huizhi Liang ()

1825 - A Novel Object Re-Track Framework for 3D Point Clouds

Tuo Feng (Xidian University)*; Licheng Jiao (Xidian University); Hao Zhu (Xidian University); Long Sun (Xidian University)

1828 - Reinforced Similarity Learning: Siamese Relation Networks for Robust Object Tracking

Dawei Zhang (Zhejiang Normal University)*; Zhonglong Zheng (Zhejiang Normal University); Minglu Li (Zhejiang Normal University); Xiaowei He (Zhejiang Normal University); Tianxiang Wang (Zhejiang Normal University); Liyuan Chen (Zhejiang Normal University); Riheng Jia (Zhejiang Normal University); Feilong Lin (Zhejiang Normal University)

1832 - "AffectI: A Game for Diverse, Reliable, and Efficient Affective Image Annotation"

xingkun zuo (University of Yamanashi); Jiyi Li (University of Yamanashi / RIKEN AIP); qili zhou (hangzhou dianzi university); jianjun li (HangZhou Dianzi University); Xiaoyang mao (University of Yamanashi)*

1833 - Photo Stream Question Answer

"Wenqiao Zhang (Zhejiang University)*; Siliang Tang (Zhejiang University); Yanpeng Cao (Zhejiang University); Jun Xiao (Zhejiang University); Shiliang Pu (Hikvision Research Institute); Fei Wu (Zhejiang University, China); Yueting Zhuang (Zhejiang University)"

1835 - Relational Graph Learning for Grounded Video Description Generation

Wenqiao Zhang (Zhejiang University)*; Xin Wang (UC Santa Barbara); Siliang Tang (Zhejiang University); Haochen Shi (Zhejiang University); Jun Xiao (Zhejiang University); Haizhou Shi (Zhejiang University); Yueting Zhuang (Zhejiang University); William Yang Wang (UC Santa Barbara)

1837 - Cognitive Representation Learning of Self-Media Online Article Quality

"Yiru Wang (Tencent Inc.; Tsinghua University)*; Shen Huang (Tencent Inc.); Gongfu Li (Tencent Inc.); Qiang Deng (Tencent Inc.); Dongliang Liao (Data Quality Team, WeChat, Tencent Inc., China); Pengda Si (Tsinghua University); Yujiu Yang (Tsinghua University); Jin Xu (Tencent Inc.)"

1841 - Exploiting Active Learning in Novel Refractive Error Detection with Smartphones

Eugene Yujun Fu (The Hong Kong Polytechnic University)*; Zhongqi Yang (The Hong Kong Polytechnic University); Hong Va Leong (The Hong Kong Polytechnic University); Grace Ngai (The Hong Kong Polytechnic University); Chi-wai Do (The Hong Kong Polytechnic University); Lily Chan (The Hong Kong Polytechnic University)

1852 - Describing Subjective Experiment Consistency by p-value qq-plot

Jakub Nawa_a (AGH University of Science and Technology)*; Lucjan Janowski (AGH University of Science and Technology); Bogdan _miel (); Krzysztof Rusek (AGH University of Science and Technology)

1859 - Deep Structural Contour Detection

Ruoxi Deng (Central South University)*; Shengjun Liu (Central South University)

1862 - Low-latency FoV-adaptive Coding and Streaming for Interactive 360-Degree Video Streaming

Yixiang Mao (New York University)*; Liyang Sun (New York University); Yong Liu (NYU); Yao Wang (New York University)

1874 - Multimodal Multi-Task Financial Risk Forecasting

"Ramit Sawhney (Netaji Subhas Institute of Technology)*; Puneet Mathur (University of Maryland, College Park); Ayush Mangal (IIT Roorkee); Piyush Khanna (Delhi Technological University); Rajiv Ratn Shah (""Indraprastha Institute of Information Technology, Delhi""); Roger Zimmermann (NUS)"

1888 - Multimodal Attention with Image Text Spatial Relationship for OCR-Based Image Captioning

Jing Wang (Nanjing University of Science and Technology)*; Jinhui Tang (Nanjing University of Science and Technology); Jiebo Luo (U. Rochester)

1889 - Rotationally-Consistent Novel View Synthesis for Humans

YoungJoong Kwon (The University of North Carolina at Chapel Hill)*; Stefano Petrangeli (Adobe); Haoliang Wang (Adobe Research); Dahun Kim (KAIST); Henry Fuchs (unc); Viswanathan Swaminathan (Adobe)

1890 - Language Models as Emotional Classifiers for Textual Conversation

Connor Heaton (Pennsylvania State University)*; David M Schwartz (Penn State)

1893 - Cross-modal Non-linear Guided Attention and TemporalCoherence in Multi-modal Deep Video Models

Saurabh Sahu (); Palash Goyal (Samsung Research); Shalini Ghosh (Samsung Research)*; Chul Lee (Samsung Research America)

1918 - Integrating Semantic Segmentation and Retinex Model for Low-Light Image Enhancement

Minhao Fan (Peking University)*; Wenjing Wang (Peking University); Wenhan Yang (Peking University); Jiaying Liu (Peking University)

1919 - Alleviating Human-level Shift : A Robust Domain Adaptation Method for Multi-person Pose Estimation

Xixia Xu (Beijing Jiaotong university)*; Qi Zou (Beijing Jiaotong University); Xue Lin ( Beijing Jiaotong University)

1924 - Price Suggestion for Online Second-hand Items with Texts and Images

Liang Han (Stony Brook University)*; Zhaozheng Yin (Stony Brook University); Zhurong Xia (Alibaba Group); Minqian Tang (Alibaba Group); rong jin (alibaba group)

1927 - SpatialGAN: Progressive Image Generation Based on Spatial Recursive Adversarial Expansion

"Lei Zhao (Zhejiang University)*; Sihuan Lin (Zhejiang university); Ailin Li (College of Computer Science and Technology, Zhejiang University); Huaizhong Lin (Zhejiang University); Wei Xing (Zhejiang University); Dongming Lu (Zhejiang University)"

1932 - Medical Visual Question Answering via Conditional Reasoning

Li-Ming Zhan (The Hong Kong Polytechnic University)*; Bo Liu (The Hong Kong Polytechnic University); Lu Fan (The Hong Kong Polytechnic University); JIAXIN CHEN (The Hong Kong Polytechnic University); Xiao-Ming Wu (PolyU Hong Kong)

1938 - Towards Modality Transferable Visual Information Representation with Optimal Model Compression

"Rongqun Lin (City University of Hong Kong)*; Linwei Zhu (Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences); Shiqi Wang (CityU); Sam Kwong (City Univeristy of Hong Kong)"

1939 - Nighttime Dehazing with a Synthetic Benchmark

Jing Zhang (The University of Sydney)*; Yang Cao (University of Science and Technology of China); Zheng-Jun Zha (University of Science and Technology of China); Dacheng Tao (The University of Sydney)

1942 - Video Relation Detection via Multiple Hypothesis Association

Zixuan Su (Fudan University)*; Xindi Shang (National University of Singapore); Jingjing Chen (Fudan University); Yu-Gang Jiang (Fudan University); Zhiyong Qiu (Tencent); Tat-Seng Chua (National Univ. of Singapore)

1946 - Multi-modal Cooking Workflow Construction for Food Recipes

Liang-Ming Pan (National University of Singapore)*; Jingjing Chen (Fudan University); Jianlong Wu (Fudan University); Shaoteng Liu (Xi'an Jiaotong University); Chong-Wah Ngo (City University of Hong Kong); Min-Yen Kan (National University of Singapore); Yu-Gang Jiang (Fudan University); Tat-Seng Chua (National university of Singapore)

1947 - Pay Attention Selectively and Comprehensively: Pyramid Gating Network for Human Pose Estimation

Chenru Jiang (XJTLU)*; Kaizhu Huang (Xi'an Jiaotong-Liverpool Univ.); Shufei Zhang (University of Liverpool); xinheng wang ( Xi'an Jiaotong-Liverpool University); Jimin Xiao (Xi'an Jiaotong-Liverpool University)

1950 - Distributed Multi-agent Video Fast-forwarding

"Shuyue Lan (Northwestern University)*; Zhilu Wang (Northwestern University); Amit K. Roy-Chowdhury (University of California, Riverside); Ermin Wei (); Zhu Qi (Northwestern University)"

1954 - Data-driven Meta-set Based Fine-Grained Visual Recognition

Chuanyi Zhang (Nanjing University of Science and Technology); Yazhou Yao (Nanjing University of Science and Technology)*; Xiangbo Shu (Nanjing University of Science and Technology); Zechao Li (Nanjing University of Science and Technology); Zhenmin Tang ( Nanjing University of Science and Technology); Qi Wu (University of Adelaide)

1958 - WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection

Bojia Zi (Fudan University)*; Xingjun Ma (Deakin University); Jingjing Chen (Fudan University); Minghao Chang (Fudan University); Yu-Gang Jiang (Fudan University)

1964 - Anisotropic Stroke Control for Multiple Artists Style Transfer

Xuanhong Chen (Shanghai Jiao Tong University); Xirui Yan (Shanghai Jiao Tong University); Naiyuan Liu (Shanghai Jiao Tong University); Ting Qiu (Shanghai Jiao Tong University); Bingbing Ni (Shanghai Jiao Tong University)*

1965 - LodoNet: A Deep Neural Network with Keypoint Matching for LiDAR Odometry

Ce Zheng (University of North Carolina at Charlotte)*; Yecheng Lyu (Worcester Polytechnic Institute); Ming Li (Worcester Polytechnic Institute); Ziming Zhang (Worcester Polytechnic Institute)

1966 - Towards Accuracy-Fairness Paradox: Adversarial Example-based Data Augmentation for Visual Debiasing

"Yi Zhang (Beijing Jiaotong University, China)*; Jitao Sang (Beijing Jiaotong University, China)"

1967 - Occluded Facial Expression Recognition with Step-Wise Assistance from Unpaired Non-Occluded Images

Bin Xia (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*

1968 - Learning from Macro-expression: a Micro-expression Recognition Framework

Bin Xia (University of Science and Technology of China); Weikang Wang (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*; Enhong Chen (University of Science and Technology of China)

1980 - HOT-Net: Non-Autoregressive Transformer for 3D Hand-Object Pose Estimation

"Lin Huang (University at Buffalo)*; Jianchao Tan (Kwai Inc.); Jingjing Meng (State University of New York at Buffalo); Ji Liu (Kwai Inc.); Junsong Yuan (""State University of New York at Buffalo, USA"")"

1987 - Emotion-Based End-to-End Matching Between Image and Music in Valence-Arousal Space

"Sicheng Zhao (University of California Berkeley)*; Yaxian Li (Renmin University of China); Xingxu Yao (Nankai University); Weizhi Nie (Tianjin University); Pengfei Xu (Didi Chuxing); Jufeng Yang (Nankai University ); Kurt Keutzer (EECS, UC Berkeley)"

1988 - IR-GAN: Image Manipulation with Linguistic Instruction by Increment Reasoning

"Zhenhuan Liu (Institute of Computing Technology, Chinese Academy of Sciences); liang li (Institute of Computing Technology, Chinese Academy of Sciences)*; Shaofei Cai (Institute of Computing Technology, Chinese Academy of Sciences); Jincan Deng (Institute of Computing Technology, Chinese Academy of Sciences); Qianqian Xu (Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences); Shuhui Wang (VIPL,ICT,Chinese academic of science); Qingming Huang (University of Chinese Academy of Sciences)"

1994 - LIGHTEN: Learning Interactions with Graph and Heirarchical TEmporal Networks for HOI in videos

"Sai Praneeth Reddy Sunkesula (Indian Institute of Technology, Bombay)*; Rishabh Dabral (IIT Bombay); Ganesh Ramakrishnan (IIT Bombay)"

2004 - Learning Semantic Concepts and Temporal Alignment for Narrated Video Procedural Captioning

"Botian Shi (Beijing Institute of Technology)*; Lei Ji (Microsoft); Zhendong Niu (Beijing Institute of Technology); Nan Duan (Microsoft Research); Ming Zhou (Microsoft Research); Xilin Chen (Institute of Computing Technology, Chinese Academy of Sciences)"

2010 - Multi-Features Fusion and Decomposition for Age-Invariant Face Recognition

"Lixuan Meng (School of Mechanical, Electrical and Information Engineering, Shandong University, China); Chenggang Yan (Hangzhou Dianzi University); Jun Li (School of Mechanical, Electrical and Information Engineering, Shandong University, China); Jian Yin (Department of Computer, Shandong University, Weihai, China)*; Wu Liu (AI Research of JD.com); Hongtao Xie (University of Science and Technology of China); liang li (Institute of Computing Technology, Chinese Academy of Sciences)"

2014 - BS-MCVR: Binary-sensing based Mobile-cloud Visual Recognition

"Hongyi Zheng (The Hong Kong Polytechnic University); Lei Zhang (""Hong Kong Polytechnic University, Hong Kong, China"")*"

2015 - Part-Aware Interactive Learning for Scene Graph Generation

Hongshuo Tian (Tianjin University); Ning Xu (Tianjin University)*; An-An Liu (Tianjin University); Yongdong Zhang (University of Science and Technology of China)

2017 - Depth Guided Adaptive Meta-Fusion Network for Few-shot Video Recognition

Yuqian Fu (Fudan University)*; Yanwei Fu (Fudan University); junke wang (Fudan University); Li Zhang (University of Oxford); Xing Zhang (Fudan University); Yu-Gang Jiang (Fudan University)

2030 - Learning Modality-Invariant Latent Representations for Generalized Zero-shot Learning

Jingjing Li (University of Electronic Science and Technology of China)*; Mengmeng Jing (University of Electronic Science and Technology of China); Lei Zhu (Shandong Normal Unversity); Zhengming Ding (Indiana University-Purdue University Indianapolis); Ke Lu (University of Electronic Science and Technology of China); Yang Yang (University of Electronic Science and Technology of China)

2032 - When Bitstream Prior Meets Deep Prior: Compressed Video Super-resolution with Learning from Decoding

Peilin Chen (City University of Hong Kong)*; Wenhan Yang (City University of Hong Kong); Long Sun (Huawei); Shiqi Wang (CityU)

2035 - Describe What to Change: A Text-guided Unsupervised Image-to-image Translation Approach

"Yahui Liu (University of Trento); Marco De Nadai (Fondazione Bruno Kessler)*; Deng Cai (The Chinese University of Hong Kong); Huayang Li (Tencent AI Lab); Xavier Alameda-Pineda (INRIA); Nicu Sebe (University of Trento); Bruno Lepri (FBK, Trento, Italy)"

2042 - Exploiting Multi-Emotion Relations at Feature and Label Levels for Emotion Tagging

Zhiwei Xu (University of science and technology of China); Shangfei Wang (University of Science and Technology of China)*; Can Wang (USTC)

2043 - Memory-Based Network for Scene Graph with Unbalanced Relations

Weitao Wang (Southeast University); Ruyang Liu (Southeast University); Meng Wang (Southeast University); Sen Wang (The University of Queensland)*; Xiaojun Chang (Monash University); Y ang Chen (Southeast University)

2052 - Increasing Video Perceptual Quality with GANs and Semantic Coding

Leonardo Galteri (University of Florence); Marco Bertini (University of Florence)*; Lorenzo Seidenari (University of Florence); Tiberio Uricchio (University of Florence); Alberto Del Bimbo (University of Florence)

2053 - Attentive One-Dimensional Heatmap Regression for Facial Landmark Detection and Tracking

Shi Yin (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*; Xiaoping Chen (University of Science and Technology of China); Enhong Chen (University of Science and Technology of China); Cong Liang (University of Science and Technology of China)

2071 - Fine-Grained Similarity Measurement between Educational Videos and Exercises

"Xin Wang (University of Science and Technology of China); Wei Huang (University of Science and Technology of China); Qi Liu ("" University of Science and Technology of China, China"")*; Yu Yin (University of Science and Technology of China); Zhenya Huang (University of Science and Technology of China ); Le Wu (Hefei University of Technology); Jianhui Ma (University of Science and Technology of China); Xue Wang (Nankai University)"

2073 - One-shot Text Field labeling using Attention and Belief Propagation for Structure information extraction

Mengli Cheng (Alibaba Group)*; Minghui Qiu (Alibaba)

2081 - GRAD: Learning for Overhead-aware Adaptive Video Streaming with Scalable Video Coding

Yunzhuo Liu (Shanghai Jiao Tong University); Bo Jiang (Shanghai Jiao Tong University)*; Tian Guo (Worcester Polytechnic Institute); Ramesh K. Sitaraman (UMass Amherst & Akamai Technologies); Don Towsley (University of Massachusetts Amherst); Xinbing Wang (Shanghai Jiao Tong University)

2084 - LGNN: A context-aware line segment detector

Quan Meng (ShanghaiTech University)*; Jiakai Zhang (ÉϺ£¿Æ¼¼´óѧ); Qiang Hu (ShanghaiTech University); Xuming He (ShanghaiTech University); Jingyi Yu (Shanghai Tech University)

2088 - Down to the Last Detail: Virtual Try-on with Fine-grained Details

Jiahang Wang (Huazhong University of Science and Technology)*; Tong Sha (Beihang University); Wei Zhang (JD AI Research); Zhoujun Li (Beihang University); Tao Mei (AI Research of JD.com)

2095 - Uncertainty-aware Cross-dataset Facial Expression Recognition via Regularized Conditional Alignment

Linyi Zhou (Nanjing Forestry University ); Xijian fan (Nanjing Forestry University)*; Yingjie Ma (Nanjing Forestry University ); Dr.Tardi Tjahjadi (Warwick University); Qiaolin Ye ( Nanjing Forestry University)

2103 - Pairwise Similarity Regularization for Adversarial Domain Adaptation

Haotian Wang (National University of Defense Technology); Wenjing Yang (National University of Defense Technology)*; Ji Wang (National University of Defense Technology); Ruxin Wang (Union Vision Innovation Co Ltd.); long lan (NUDT); Mingyang Geng (National University of Defense Technology)

2106 - Generalized Zero-Shot Video Classification via Generative Adversarial Networks

Mingyao Hong (University of Chinese Academy of Sciences)*; Guorong Li (University of Chinese Academy of Sciences); xinfeng zhang (University of Chinese Academy of Sciences); Qingming Huang (University of Chinese Academy of Sciences)

2111 - DeVLBert: Learning Deconfounded Visio-Linguistic Representations

"Shengyu Zhang (Zhejiang University)*; Tan Jiang (ZhangJiang University); Tan Wang (University of Electronic Science and Technology of China); Kun Kuang (Zhejiang University); Zhou Zhao (Zhejiang University); Jianke Zhu (Zhejiang University); Jin Yu (Alibaba Group); Hongxia Yang (Alibaba Group); Fei Wu (Zhejiang University, China)"

2112 - Drum Synthesis and Rhythmic Transformation with Adversarial Autoencoders

Maciej Tomczak (Birmingham City University)*; Jason Hockman (Birmingham City University); Masataka Goto (National Institute of Advanced Industrial Science and Technology (AIST))

2115 - "Presence, embodied interaction and motivation: distinct learning phenomena in an immersive virtual environment"

Jack Ratcliffe (QMUL)*

2116 - AdaP-360: User-Adaptive Area-of-Focus Projections for Bandwidth-Efficient 360-Degree Video Streaming

Chao Zhou (SUNY Binghamton); Shuoqian Wang (SUNY Binghamton); Mengbai Xiao (the Ohio State University); Sheng Wei (Rutgers University - New Brunswick); Yao Liu (SUNY Binghamton)*

2130 - Retrieval Guided Unsupervised Multi-domain Image to Image Translation

"Raul Gomez (Eurecat, Unitat de Tecnologies Audiovisuals - Computer Vision Centre, Universitat Aut¨°noma de Barcelona); Yahui Liu (University of Trento); Marco De Nadai (Fondazione Bruno Kessler)*; Dimosthenis Karatzas (Computer Vision Centre); Bruno Lepri (FBK, Trento, Italy); Nicu Sebe (University of Trento)"

2145 - MMNet: Multi-Stage and Multi-Scale Fusion Network for RGB-D Salient Object Detection

Guibiao Liao (Peking University); Wei Gao (Peking University)*; Qiuping Jiang (Ningbo University); Ronggang Wang (Peking University); Ge Li (Peking University)

2151 - Reduce the Influence of Stability in Content Delivery Network via Learning-Based Caching Algorithm

Gang Yan (Binghamton University-SUNY); Jian Li (Binghamton University-SUNY )*

2158 - Temporal Denoising Mask Synthesis Network for Learning Blind Video Temporal Consistency

Yifeng Zhou (University of Electronic Science and Technology of China); Xing Xu (University of Electronic Science and Technology of China)*; Fumin Shen (UESTC); Lianli Gao (The University of Electronic Science andTechnology of China); Huimin Lu (Kyushu Institute of Technology); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

2161 - Stable Video Style Transfer Based on Partial Convolution with Depth-Aware Supervision

Songhua Liu (Nanjing University); Wu Hao (Nanjing University); Shoutong Luo (Nanjing University); Zhengxing Sun (Nanjing University)*

2171 - Interpretable Video Synthesis via Transform-Based Tensor Reconstruction Network

Yimeng Zhang (Columbia University); Xiao-Yang Liu (Columbia University); Bo Wu (MIT-IBM Watson AI Lab)*; Anwar Walid (Bell Laboratories)

2174 - INCLUDE: A Large Scale Dataset for Indian Sign Language Recognition

Advaith Sridhar (IIT Madras)*; Rohith Gandhi G (IIT Madras); Pratyush Kumar (IIT Madras); Mitesh Khapra (IIT Madras)

2178 - Cluster Attention Contrast for Video Anomaly Detection

Ziming Wang (Peking University); Yuexian Zou (Peking University)*; Zeming Zhang (Harbin institute of technology)

2198 - Automatic Interest Recognition from Posture and Behaviour

Wolmer Bigi (Univeristy of Florence); Claudio Baecchi (University of Florence)*; Alberto Del Bimbo (University of Florence)

2199 - Finding Achilles' Heel: Adversarial Attack on Multi-modal Action Recognition

"Deepak Kumar (University of Massachusetts Dartmouth)*; Chetan Kumar (University of Massachusetts Dartmouth); Chun Wei Seah (University of Massachusetts); Siyu Xia (Southeast University, China); Ming Shao (University of Massachusetts Dartmouth)"

2205 - A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild

"Prajwal K R (International Institute of Information Technology, Hyderabad)*; Rudrabha Mukhopadhyay (IIIT Hyderabad); Vinay Namboodiri (University of Bath); C.V. Jawahar (IIIT-Hyderabad)"

2209 - PanoRTC: A System for Content-Adaptive Real-Time 360-Degree Video Communication

Shuoqian Wang (SUNY Binghamton); Xiaoyang Zhang (SUNY Binghamton); Mengbai Xiao (The Ohio State University); Kenneth Chiu (Binghamton University ); Yao Liu (SUNY Binghamton)*

2225 - Fonts Like This but Happier: A New Way to Discover Fonts

Tugba Kulahcioglu (Rutgers University)*; Gerard de Melo (Hasso Plattner Institute)

2226 - User Centered Adaptive Streaming of Dynamic Point Clouds with Low Complexity Tiling

"Shishir Subramanyam (Centrum Wiskunde & Informatica)*; Irene Viola (CWI); Alan Hanjalic (TU Delft, Netherlands); Pablo Cesar (CWI, The Netherlands)"

2237 - Efficient adaptation of neural network filter for video compression

Yat-Hong Lam (Nokia Technologies)*; Alireza Zare (Nokia Technologies); Francesco Cricri (Nokia Technologies); Jani Lainema (Nokia); Miska Hannuksela (Nokia Technologies)

2239 - An Advanced LiDAR Point Cloud Sequence Coding Scheme for Autonomous Driving

Xuebin Sun (CUHK)*; Sukai Wang (HKUST); Miaohui Wang (Shenzhen University); Shing shin Cheng (CUHK); Ming Liu (HKUST)

2242 - Adaptive Multimodal Fusion for Facial Action Units Recognition

Huiyuan Yang (Binghamton University-SUNY)*; Lijun Yin (State University of New York at Binghamton); Taoyue Wang (State Univerisity of New York at Binghamton)

2246 - An Analysis of Delay in Live 360¡ã Video Streaming Systems

"Jun Yi (Georgia State University)*; Md Reazul Islam (Georgia State University); Shivang Aggarwal (University at Buffalo, The State University of New York); Dimitrios Koutsonikolas (SUNY Buffalo); Y. Charlie Hu (Purdue University); Zhisheng Yan (Georgia State University)"

2249 - Adaptive Temporal Triplet-loss for Cross-modal Embedding Learning

David Semedo (Universidade NOVA de Lisboa)*; Joao Magalhaes (Universidade NOVA Lisboa)

2253 - CFVMNet: A Multi-branch Network for Vehicle Re-identification based the Common Field of View

ziruo sun (Shandong University); Xiushan Nie (Shandong Jianzhu University)*; Xiaoming Xi (Shandong Jianzhu University ); Yilong Yin (Shandong University)

2257 - SonoSpace: Visual Feedback of Timbre with Unsupervised Learning

Naoki Kimura (The University of Tokyo)*; Keisuke Shiro (The University of Tokyo); Yota Takakura (Innoqua Inc.); Hiromi Nakamura (The University of Tokyo); Jun Rekimoto (The Univertsity of Tokyo)

2262 - Learning Optimization-based Adversarial Perturbations for Attacking Sequential Recognition Models

Xing Xu (University of Electronic Science and Technology of China)*; Jiefu Chen (University of Electronic Science and Technology of China); Jinhui Xiao (University of Electronic Science and Technology of China); Zheng Wang (UESTC); Yang Yang (University of Electronic Science and Technology of China); Heng Tao Shen (University of Electronic Science and Technology of China (UESTC))

2264 - Amora: Black-box Adversarial Morphing Attack

"Run Wang (Nanyang Technological University)*; Felix Juefei-Xu (Alibaba Group); Qing Guo (Nanyang Technological University); Yihao Huang (East China Normal University); Xiaofei Xie (Nanyang Technological University); Lei Ma (Kyushu University); Yang Liu (Nanyang Technology University, Singapore)"

2269 - PmR-QP: Prediction-Based R-QP Modeling on Bitrate Estimation

Yangfan Sun (University of Missouri-Kansas City); Li Li (University of Missouri-Kansas City); Zhu Li (university of missouri-kansas city)*; Shan Liu (Tencent America)

2272 - GangSweep: Sweep out Neural Backdoors by GAN

LIUWAN ZHU (Old Dominion University)*; Rui Ning (Old Dominion University); Cong Wang (Old Dominion University); Chunsheng Xin (Old Dominion University); Michael Wu (Nil)

2273 - Exploiting Self-Supervised and Semi-Supervised Learning for Facial Landmark Tracking with Unlabeled Data

Shi Yin (University of Science and Technology of China); Shangfei Wang (University of Science and Technology of China)*; Xiaoping Chen (University of Science and Technology of China); Enhong Chen (University of Science and Technology of China)

2274 - MS^2L: Multi-task Self-supervised Learning for Skeleton Based Action Recognition

Lilang Lin (Peking University)*; Sijie Song (Peking University); Wenhan Yang (Peking University); Jiaying Liu (Peking University)

2275 - Exploiting Heterogeneous Composer and Listener Preference Graph for Music Genre Classification

"Chunyuan Yuan (Institute of Information Engineering, Chinese Academy of Sciences)*; Qianwen Ma ( Institute of Information Engineering, School of Cyber Security, University of Chinese Academy of Sciences); junyang chen (University of Macau); Yijun Lu (Alibaba Cloud Computing Co. Ltd.); Wei Zhou (Institute of Information Engineering, School of Cyber Security, University of Chinese Academy of Sciences); Jizhong Han ( Institute of Information Engineering,Chinese Academy of Sciences); Songlin Hu ( Institute of Information Engineering,Chinese Academy of Sciences)"

2292 - Tile Rate Allocation for 360-Degree Tiled Adaptive Video Streaming

Praveen Kumar Yadav (National University of Singapore)*; Wei Tsang Ooi (National University of Singapore)

2297 - Sequential Attention GAN for Interactive Image Editing

Yu Cheng (Microsoft)*; Zhe Gan (Microsoft); Yitong Li (Apple Inc); Jingjing Liu (Microsoft); Jianfeng Gao (Microsoft Research)

2298 - Cross Corpus Physiological-based Emotion Recognition Using a Learnable Visual Semantic Graph Convolutional Network

"Woan-Shiuan Chien (Department of Electrical Engineering, National Tsing Hua University ); Hao-Chun Yang (Department of Electrical Engineering, National Tsing Hua University); Chi-Chun Lee (Department of Electrical Engineering, National Tsing Hua University)*"

2314 - Domain-Adaptive Object Detection via Uncertainty-Aware Distribution Alignment

Dang-Khoa Nguyen (National Chiao Tung University); Wei-Lun Tseng (National Chiao Tung University); Hong-Han Shuai (National Chiao Tung University)*

2323 - Single Image Deraining via Scale-space Invariant Attention Neural Network

Bo Pang (Harbin Institute of Technology); Deming Zhai (Harbin Institute of Technolgy); Junjun Jiang (Harbin Institute of Technology); Xianming Liu (Harbin Institute of Technology)*

2327 - MM-Hand: 3D-Aware Multi-Modal Guided Hand Generation for 3D Hand Pose Synthesis

"Zhenyu Wu (Texas A&M University)*; Duc Hoang (Texas A&M); Shih-Yao Lin (Tencent America); Yusheng Xie (Amazon); Liangjian Chen (University of California, Irvine); Yen-Yu Lin (National Chiao Tung University); Zhangyang Wang (University of Texas at Austin); Wei Fan (Tencent)"

2340 - Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback

Yinwei Wei (Shandong University)*; Xiang Wang (National University of Singapore); Liqiang Nie (Shandong University ); Xiangnan He (University of Science and Technology of China); Tat-Seng Chua (National Univ. of Singapore)

2342 - Concept-based Explanation for Fine-grained Images and Its Application in Infectious Keratitis Classification

Zhengqing Fang (Zhejiang University)*; Kun Kuang (Zhejiang University); Yuxiao Lin (Zhejiang University); Fei Wu (Zhejiang University); Yufeng Yao (Zhejiang University)

2346 - Visually Precise Query

"Riddhiman Dasgupta (Microsoft); Francis Tom (Microsoft); Sudhir Kumar (Microsoft); Mithun Das Gupta (Microsoft,India)*; Yokesh Kumar (Microsoft); Badri Patro (IIT Kanpur); Vinay P Namboodiri (IIT Kanpur)"

2380 - Joint Self-Attention and Scale-Aggregation for Self-Calibrated Deraining Network

Yutong Wu (Dalian University of Technology)*; Cong Wang (Dalian University of Technology); Zhixun Su (Dalian University of Technology); junyang chen (University of Macau)

2390 - Hybrid Dynamic-static Context-aware Attention Network for Action Assessment in Long Videos

"Ling-An Zeng (Sun Yat-sen University); Fa-Ting Hong (Sun Yat-Sen University); WEI-SHI ZHENG (Sun Yat-sen University, China)*; Qizhi Yu (Zhejiang Laboratory); Wei Zeng (Peking University, China); Yaowei Wang (PengCheng Laboratory); Jian-Huang Lai (Sun Yat-sen University)"

2394 - F2GAN: Fusing-and-Filling GAN for Few-shot Image Generation

Yan Hong (Shanghai Jiao Tong University); Li Niu (Shanghai Jiao Tong University)*; Jianfu Zhang (Shanghai Jiao Tong University); Weijie Zhao (Versa-AI); Chen Fu (Versa-AI); Liqing Zhang (Shanghai Jiao Tong Univercity)

2405 - JAFPro: Joint Appearance Fusion and Propagation for Human Video Motion Transfer from Multiple Reference Images

"Xianggang Yu (The Chinese University of Hong Kong, Shenzhen); Haolin Liu (The Chinese University of Hong Kong, Shenzhen); Xiaoguang Han (Shenzhen Research Institute of Big Data, the Chinese University of Hong Kong (Shenzhen))*; Zhen Li (Chinese University of Hong Kong, Shenzhen); Zixiang Xiong (Texas A&M University); Shuguang Cui (The Chinese University of Hong Kong, Shenzhen )"

2407 - A W2VV++ Case Study with Automated and Interactive Text-to-Video Retrieval

"Jakub Lokoc (Charles University in Prague); Tom¨¢_ Sou_ek (Charles University, Prague ); Patrik Vesel_ (Charles University, Prague ); Franti_ek Mejzl¨ªk (Charles University, Prague ); Jiaqi Ji (Renmin University of China); Chaoxi Xu (Renmin University of China); Xirong Li (Renmin University of China)*"

2415 - Attention Cube Network for Image Restoration

Yucheng Hang (Tsinghua University); Qingmin Liao (Tsinghua Univeristy); Wenming Yang (Tsinghua University)*; Yupeng Chen (Peng Cheng Laboratory); Jie Zhou (Tsinghua University)

2419 - CRNet: A Center-aware Representation for Detecting Text of Arbitrary Shapes

Yu Zhou (University of Science and Technology of China)*; Hongtao Xie (University of Science and Technology of China); Shancheng Fang (University of Science and Technology of China); Yan Li (Kuaishou); Yongdong Zhang (University of Science and Technology of China)

2448 - Visual Relation of Interest Detection

Fan Yu (Nanjing University); Haonan Wang (Nanjing University); Tongwei Ren (Nanjing University)*; Jinhui Tang (Nanjing University of Science and Technology); Gangshan Wu (Nanjing University)

2463 - Expressional Region Retrieval

"xiaoqian guo (Institute of Computing Technology, Chinese Academy of Sciences)*; Xiangyang Li (Institute of Computing Technology, Chinese Academy of Sciences); Shuqiang Jiang (ICT, China Academy of Science)"

2464 - Generalized Zero-shot Learning with Multi-source Semantic Embeddings for Scene Recognition

"Xinhang Song (ICT)*; Haitao Zeng (China University of Mining & Technology (Beijing),and ICT, Chinese Academy of Sciences); sixian zhang (Key Lab of Intelligent Information Processing of Chinese Academy of Sciences (CAS)); Luis Herranz (Computer Vision Center); Shuqiang Jiang (ICT, China Academy of Science)"

2485 - ATRW: A Benchmark for Amur Tiger Re-identification in the Wild

Shuyuan Li (Shanghai Jiao Tong University); Jianguo Li (Ant Group)*; Hanlin Tang (Intel Corporation); Rui Qian (Shanghai Jiao Tong University); Weiyao Lin (Shanghai Jiao Tong university)

2486 - Emotions Don't Lie: An Audio-Visual Deepfake Detection Method using Affective Cues

(Video) Paper Introduction: A Smart Adversarial Attack on Deep Hashing Based Image Retrieval

"Trisha Mittal (University of Maryland)*; Uttaran Bhattacharya (University of Maryland, College Park); Rohan Chandra (University of Maryland); Aniket Bera (University of Maryland, College Park); Dinesh Manocha (UMD)"

Videos

1. New Workflow Template for ACM Authors
(Association for Computing Machinery (ACM))
2. Residual GANs for artifacts removal SUMAC20 #24
(Margarita Khokhlova)
3. My Chemical Romance - I Don't Love You [Official Music Video] [HD]
(My Chemical Romance)
4. ACM Projects Spring 2020 Presentations
(ACM UT Dallas)
5. ACM Tutorial
(Imac Project)
6. Semantic Comparison of Alloy Models for MoDELS 2020 Artefact Evaluation
(Jan Oliver Ringert)
Top Articles
Latest Posts
Article information

Author: Domingo Moore

Last Updated: 03/25/2023

Views: 5742

Rating: 4.2 / 5 (73 voted)

Reviews: 88% of readers found this page helpful

Author information

Name: Domingo Moore

Birthday: 1997-05-20

Address: 6485 Kohler Route, Antonioton, VT 77375-0299

Phone: +3213869077934

Job: Sales Analyst

Hobby: Kayaking, Roller skating, Cabaret, Rugby, Homebrewing, Creative writing, amateur radio

Introduction: My name is Domingo Moore, I am a attractive, gorgeous, funny, jolly, spotless, nice, fantastic person who loves writing and wants to share my knowledge and understanding with you.