Dr Xin Yu

ARC DECRA

School of Electrical Engineering and Computer Science

Faculty of Engineering, Architecture and Information Technology

xin.yu@uq.edu.au

Overview

My name is Xin Yu, a Senior Lecturer at the University of Queensland. I am an Australian Research Council Discovery Early Career Researcher Award 2023-2025 (DECRA) recipient and an awardee of the prestigious Google Research Scholar Program in 2021. Previously, I was a research fellow at the Australian National University (ANU). I received my PhD degree from the Australian National Unversity under the supervision of Prof. Richard Hartley, Prof. Fatih Porikli and Dr. Basura Fernando. I also received a PhD degree from Tsinghua University supervised by Prof. Li Zhang. I am interested in Computer Vision and Machine Learning topics.

My research topics includes various computer vision and machine learning tasks, especially in efficient low-level image processing, image retrieval and localization, action recognition, 3D pose estimation, visual navigation and sign language recognition and translation.

Research Impacts

One of my research papers has been awarded "Best Paper Honorable Mention" award in the premium computer vision conference WACV 2020, and one paper has been nominated for the Best Paper Award in CVPR 2020.

I was awarded the Outstanding Reviewer Award in ECCV 2020, CVPR 2021 and ICCV 2021. CVPR, ICCV and ECCV are internationally world-leading computer vision and machine learning conferences. My research interests include deep learning techniques, image processing, and computer vision tasks. I am a program committee member of top-tier computer vision and machine learning conferences, such as CVPR, ICCV, ECCV, ICML, ICLR and NeurIPS, and a reviewer of prestigious journals, such as TPAMI, IJCV and TIP.

I am happy to supervise self-motivated PhD and MPhil students. If you are an undergraduate student and willing to conduct your honour project, please drop me an email.

Publications

Journal Article: AI empowered Auslan learning for parents of deaf children and children of deaf adults

Sheng, Hongwei, Shen, Xin, Du, Heming, Zhang, Hu, Huang, Zi and Yu, Xin (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 1-11. doi: 10.1007/s43681-024-00457-y
Journal Article: Detecting facial action units from global-local fine-grained expressions

Zhang, Wei, Li, Lincheng, Ding, Yu, Chen, Wei, Deng, Zhigang and Yu, Xin (2024). Detecting facial action units from global-local fine-grained expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34 (2), 983-994. doi: 10.1109/tcsvt.2023.3288903
Journal Article: CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses

Fu, Hongyu, Yu, Xin, Li, Lincheng and Zhang, Li (2024). CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses. IEEE Transactions on Multimedia, 1-12. doi: 10.1109/tmm.2024.3388929

View all Publications

Grants

Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production

(2024) Google Inc
Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production

(2023–2028) Google Asia Pacific Pte Ltd
Analytics for the Australian Grains Industry (AAGI)

(2023–2027) Grains Research & Development Corporation

View all Grants

Supervision

Two way Auslan Translation

Doctor Philosophy
The prediction, diagnosis, and severity estimation models for plant disease

Doctor Philosophy
Digital Asset IP Protection

Doctor Philosophy

View all Supervision

Publications

Book Chapter

Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation

Liu, Jinhui, Zou, Zhikang, Ye, Xiaoqing, Tan, Xiao, Ding, Errui, Xu, Feng and Yu, Xin (2020). Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation. Computer Vision – ECCV 2020 Workshops. (pp. 707-714) Cham: Springer International Publishing. doi: 10.1007/978-3-030-66096-3_47
Learning Object Relation Graph and Tentative Policy for Visual Navigation

Du, Heming, Yu, Xin and Zheng, Liang (2020). Learning Object Relation Graph and Tentative Policy for Visual Navigation. Computer Vision – ECCV 2020. (pp. 19-34) Cham: Springer International Publishing. doi: 10.1007/978-3-030-58571-6_2

Journal Article

AI empowered Auslan learning for parents of deaf children and children of deaf adults

Sheng, Hongwei, Shen, Xin, Du, Heming, Zhang, Hu, Huang, Zi and Yu, Xin (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 1-11. doi: 10.1007/s43681-024-00457-y
Detecting facial action units from global-local fine-grained expressions

Zhang, Wei, Li, Lincheng, Ding, Yu, Chen, Wei, Deng, Zhigang and Yu, Xin (2024). Detecting facial action units from global-local fine-grained expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34 (2), 983-994. doi: 10.1109/tcsvt.2023.3288903
CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses

Fu, Hongyu, Yu, Xin, Li, Lincheng and Zhang, Li (2024). CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses. IEEE Transactions on Multimedia, 1-12. doi: 10.1109/tmm.2024.3388929
CMGNet: Collaborative multi-modal graph network for video captioning

Rao, Qi, Yu, Xin, Li, Guang and Zhu, Linchao (2024). CMGNet: Collaborative multi-modal graph network for video captioning. Computer Vision and Image Understanding, 238 103864, 1-10. doi: 10.1016/j.cviu.2023.103864
MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers

Hu, Zhipeng, Tang, Jilin, Li, Lincheng, Hou, Jie, Xin, Haoran, Yu, Xin and Bu, Jiajun (2024). MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers. Computer Animation and Virtual Worlds, 35 (1). doi: 10.1002/cav.2228
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads

Wang, Suzhen, Ma, Yifeng, Ding, Yu, Hu, Zhipeng, Fan, Changjie, Lv, Tangjie, Deng, Zhidong and Yu, Xin (2024). StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1-17. doi: 10.1109/tpami.2024.3357808
DMMG: Dual min-max games for self-supervised skeleton-based action recognition

Guan, Shannan, Yu, Xin, Huang, Wei, Fang, Gengfa and Lu, Haiyan (2023). DMMG: Dual min-max games for self-supervised skeleton-based action recognition. IEEE Transactions on Image Processing, 33, 395-407. doi: 10.1109/tip.2023.3338410
UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation

Yu, Xin, Yang, Qi, Zhou, Yinchi, Cai, Leon Y., Gao, Riqiang, Lee, Ho Hin, Li, Thomas, Bao, Shunxing, Xu, Zhoubing, Lasko, Thomas A., Abramson, Richard G., Zhang, Zizhao, Huo, Yuankai, Landman, Bennett A. and Tang, Yucheng (2023). UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation. Medical Image Analysis, 90 102939, 1-15. doi: 10.1016/j.media.2023.102939
Deep idempotent network for efficient single image blind deblurring

Mao, Yuxin, Wan, Zhexiong, Dai, Yuchao and Yu, Xin (2023). Deep idempotent network for efficient single image blind deblurring. IEEE Transactions on Circuits and Systems for Video Technology, 33 (1), 172-185. doi: 10.1109/tcsvt.2022.3202361
Single slice thigh CT muscle group segmentation with domain adaptation and self-training

Yang, Qi, Yu, Xin, Lee, Ho Hin, Cai, Leon Y., Xu, Kaiwen, Bao, Shunxing, Huo, Yuankai, Moore, Ann Zenobia, Makrogiannis, Sokratis, Ferrucci, Luigi and Landman, Bennett A (2023). Single slice thigh CT muscle group segmentation with domain adaptation and self-training. Journal of Medical Imaging, 10 (4) 044001, 1-12. doi: 10.1117/1.JMI.10.4.044001
A consensus protocol for functional connectivity analysis in the rat brain

Grandjean, Joanes, Desrosiers-Gregoire, Gabriel, Anckaerts, Cynthia, Angeles-Valdez, Diego, Ayad, Fadi, Barrière, David A., Blockx, Ines, Bortel, Aleksandra, Broadwater, Margaret, Cardoso, Beatriz M., Célestine, Marina, Chavez-Negrete, Jorge E., Choi, Sangcheon, Christiaen, Emma, Clavijo, Perrin, Colon-Perez, Luis, Cramer, Samuel, Daniele, Tolomeo, Dempsey, Elaine, Diao, Yujian, Doelemeyer, Arno, Dopfel, David, Dvořáková, Lenka, Falfán-Melgoza, Claudia, Fernandes, Francisca F., Fowler, Caitlin F., Fuentes-Ibañez, Antonio, Garin, Clément, Gelderman, Eveline ... Hess, Andreas (2023). A consensus protocol for functional connectivity analysis in the rat brain. Nature Neuroscience, 26 (4), 673-681. doi: 10.1038/s41593-023-01286-8
Accurate 3-DoF camera geo-localization via ground-to-satellite image matching

Shi, Yujiao, Yu, Xin, Liu, Liu, Campbell, Dylan, Koniusz, Piotr and Li, Hongdong (2023). Accurate 3-DoF camera geo-localization via ground-to-satellite image matching. IEEE transactions on pattern analysis and machine intelligence, 45 (3), 2682-2697. doi: 10.1109/TPAMI.2022.3189702
Boosting model inversion attacks with adversarial examples

Zhou, Shuai, Zhu, Tianqing, Ye, Dayong, Yu, Xin and Zhou, Wanlei (2023). Boosting model inversion attacks with adversarial examples. IEEE Transactions on Dependable and Secure Computing, 1-18. doi: 10.1109/TDSC.2023.3285015
Calligraphy Font Generation via Explicitly Modeling Location-aware Glyph Component Deformations

Zhao, Minda, Qi, Xingqun, Hu, Zhipeng, Li, Lincheng, Zhang, Yongqiang, Huang, Zi and Yu, Xin (2023). Calligraphy Font Generation via Explicitly Modeling Location-aware Glyph Component Deformations. IEEE Transactions on Multimedia, 26, 1-13. doi: 10.1109/tmm.2023.3342690
Cyclic self-training with proposal weight modulation for cross-supervised object detection

Xu, Yunqiu, Zhou, Chunluan, Yu, Xin and Yang, Yi (2023). Cyclic self-training with proposal weight modulation for cross-supervised object detection. IEEE Transactions on Image Processing, 32, 1992-2002. doi: 10.1109/TIP.2023.3261752
HairStyle editing via parametric controllable strokes

Song, Xinhui, Liu, Chen, Zheng, Youyi, Feng, Zunlei, Li, Lincheng, Zhou, Kun and Yu, Xin (2023). HairStyle editing via parametric controllable strokes. IEEE Transactions on Visualization and Computer Graphics, 1-14. doi: 10.1109/TVCG.2023.3241894
Deep hierarchical representation of point cloud videos via spatio-temporal decomposition

Fan, Hehe, Yu, Xin, Yang, Yi and Kankanhalli, Mohan (2022). Deep hierarchical representation of point cloud videos via spatio-temporal decomposition. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (12), 9918-9930. doi: 10.1109/TPAMI.2021.3135117
Geometry-guided street-view panorama synthesis from satellite imagery

Shi, Yujiao, Campbell, Dylan, Yu, Xin and Li, Hongdong (2022). Geometry-guided street-view panorama synthesis from satellite imagery. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (12), 10009-10022. doi: 10.1109/TPAMI.2022.3140750
Single image based 3D human pose estimation via uncertainty learning

Han, Chuchu, Yu, Xin, Gao, Changxin, Sang, Nong and Yang, Yi (2022). Single image based 3D human pose estimation via uncertainty learning. Pattern Recognition, 132 108934. doi: 10.1016/j.patcog.2022.108934
Recursive copy and paste GAN: face hallucination from shaded thumbnails

Zhang, Yang, Tsang, Ivor W., Luo, Yawei, Hu, Changhui, Lu, Xiaobo and Yu, Xin (2022). Recursive copy and paste GAN: face hallucination from shaded thumbnails. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (8), 4321-4338. doi: 10.1109/TPAMI.2021.3061312
High frame rate video reconstruction based on an event camera

Pan, Liyuan, Hartley, Richard, Scheerlinck, Cedric, Liu, Miaomiao, Yu, Xin and Dai, Yuchao (2022). High frame rate video reconstruction based on an event camera. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (5), 2519-2533. doi: 10.1109/TPAMI.2020.3036667
Single-image deraining via recurrent residual multiscale networks

Zheng, Yupei, Yu, Xin, Liu, Miaomiao and Zhang, Shunli (2022). Single-image deraining via recurrent residual multiscale networks. IEEE Transactions On Neural Networks and Learning Systems, 33 (3), 1310-1323. doi: 10.1109/TNNLS.2020.3041752
Pro-UIGAN: progressive face hallucination from occluded thumbnails

Zhang, Yang, Yu, Xin, Lu, Xiaobo and Liu, Ping (2022). Pro-UIGAN: progressive face hallucination from occluded thumbnails. IEEE Transactions On Image Processing, 31, 3236-3250. doi: 10.1109/TIP.2022.3167280
Understanding atomic hand-object interaction with human intention

Fan, Hehe, Zhuo, Tao, Yu, Xin, Yang, Yi and Kankanhalli, Mohan (2022). Understanding atomic hand-object interaction with human intention. IEEE Transactions On Circuits and Systems for Video Technology, 32 (1), 275-285. doi: 10.1109/TCSVT.2021.3058688
Weakly supervised RGB-D salient object detection with prediction consistency training and active scribble boosting

Xu, Yunqiu, Yu, Xin, Zhang, Jing, Zhu, Linchao and Wang, Dadong (2022). Weakly supervised RGB-D salient object detection with prediction consistency training and active scribble boosting. IEEE Transactions On Image Processing, 31, 2148-2161. doi: 10.1109/TIP.2022.3151999
Learning with noisy labels via self-reweighting from class centroids

Ma, Fan, Wu, Yu, Yu, Xin and Yang, Yi (2021). Learning with noisy labels via self-reweighting from class centroids. IEEE Transactions On Neural Networks and Learning Systems, 33 (11), 6275-6285. doi: 10.1109/TNNLS.2021.3073248
Progressive transfer learning for face anti-spoofing

Quan, Ruijie, Wu, Yu, Yu, Xin and Yang, Yi (2021). Progressive transfer learning for face anti-spoofing. IEEE Transactions on Image Processing, 30, 3946-3955. doi: 10.1109/TIP.2021.3066912
Face hallucination with finishing touches

Zhang, Yang, Tsang, Ivor W., Li, Jun, Liu, Ping, Lu, Xiaobo and Yu, Xin (2021). Face hallucination with finishing touches. IEEE Transactions On Image Processing, 30 9318504, 1728-1743. doi: 10.1109/TIP.2020.3046918
Pyramidal multiple instance detection network with mask guided self-correction for weakly supervised object detection

Xu, Yunqiu, Zhou, Chunluan, Yu, Xin, Xiao, Bin and Yang, Yi (2021). Pyramidal multiple instance detection network with mask guided self-correction for weakly supervised object detection. IEEE Transactions On Image Processing, 30, 3029-3040. doi: 10.1109/TIP.2021.3056887
Single Image Portrait Relighting via Explicit Multiple Reflectance Channel Modeling

Wang, Zhibo, Yu, Xin, Lu, Ming, Wang, Quan, Qian, Chen and Xu, Feng (2020). Single Image Portrait Relighting via Explicit Multiple Reflectance Channel Modeling. Acm Transactions On Graphics, 39 (6). doi: 10.1145/3414685.3417824
Semantic Face Hallucination: Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes

Yu, Xin, Fernando, Basura, Hartley, Richard and Porikli, Fatih (2020). Semantic Face Hallucination: Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes. IEEE Transactions On Pattern Analysis and Machine Intelligence, 42 (11), 2926-2943. doi: 10.1109/TPAMI.2019.2916881
Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces

Yu, Xin, Shiri, Fatemeh, Ghanem, Bernard and Porikli, Fatih (2020). Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces. Ieee Transactions On Pattern Analysis and Machine Intelligence, 42 (9) 8704962, 2148-2164. doi: 10.1109/TPAMI.2019.2914039
Hallucinating Unaligned Face Images by Multiscale Transformative Discriminative Networks

Yu, Xin, Porikli, Fatih, Fernando, Basura and Hartley, Richard (2019). Hallucinating Unaligned Face Images by Multiscale Transformative Discriminative Networks. International Journal of Computer Vision, 128 (2), 500-526. doi: 10.1007/s11263-019-01254-5
Identity-Preserving Face Recovery from Stylized Portraits

Shiri, Fatemeh, Yu, Xin, Porikli, Fatih, Hartley, Richard and Koniusz, Piotr (2019). Identity-Preserving Face Recovery from Stylized Portraits. International Journal of Computer Vision, 127 (6-7), 863-883. doi: 10.1007/s11263-019-01169-1
Single Image Depth Estimation With Normal Guided Scale Invariant Deep Convolutional Fields

Yan, Han, Yu, Xin, Zhang, Yu, Zhang, Shunli, Zhao, Xiaolin and Zhang, Li (2019). Single Image Depth Estimation With Normal Guided Scale Invariant Deep Convolutional Fields. IEEE Transactions On Circuits and Systems for Video Technology, 29 (1) 8105853, 80-92. doi: 10.1109/TCSVT.2017.2772892
Imagining the Unimaginable Faces by Deconvolutional Networks

Yu, Xin and Porikli, Fatih (2018). Imagining the Unimaginable Faces by Deconvolutional Networks. Ieee Transactions On Image Processing, 27 (6), 2747-2761. doi: 10.1109/TIP.2018.2808840
PMSC: PatchMatch-Based Superpixel Cut for Accurate Stereo Matching

Li, Lincheng, Zhang, Shunli, Yu, Xin and Zhang, Li (2018). PMSC: PatchMatch-Based Superpixel Cut for Accurate Stereo Matching. Ieee Transactions On Circuits and Systems for Video Technology, 28 (3), 679-692. doi: 10.1109/TCSVT.2016.2628782
3D cost aggregation with multiple minimum spanning trees for stereo matching

Li, Lincheng, Yu, Xin, Zhang, Shunli, Zhao, Xiaolin and Zhang, Li (2017). 3D cost aggregation with multiple minimum spanning trees for stereo matching. Applied Optics, 56 (12), 3411-3420. doi: 10.1364/AO.56.003411
Multi-local-task learning with global regularization for object tracking

Zhang, Shunli, Sui, Yao, Zhao, Sicong, Yu, Xin and Zhang, Li (2015). Multi-local-task learning with global regularization for object tracking. Pattern Recognition, 48 (12), 3881-3894. doi: 10.1016/j.patcog.2015.06.005
Self-expressive tracking

Sui, Yao, Zhao, Xiaolin, Zhang, Shunli, Yu, Xin, Zhao, Sicong and Zhang, Li (2015). Self-expressive tracking. Pattern Recognition, 48 (9), 2872-2884. doi: 10.1016/j.patcog.2015.03.007
Hybrid support vector machines for robust object tracking

Zhang, Shunli, Sui, Yao, Yu, Xin, Zhao, Sicong and Zhang, Li (2015). Hybrid support vector machines for robust object tracking. Pattern Recognition, 48 (8), 2474-2488. doi: 10.1016/j.patcog.2015.02.008
Object Tracking With Multi-View Support Vector Machines

Zhang, Shunli, Yu, Xin, Sui, Yao, Zhao, Sicong and Zhang, Li (2015). Object Tracking With Multi-View Support Vector Machines. Ieee Transactions On Multimedia, 17 (3), 265-278. doi: 10.1109/TMM.2015.2390044
Removing blur kernel noise via a hybrid l(p) norm

Yu, Xin, Zhang, Shunli, Zhao, Xiaolin and Zhang, Li (2015). Removing blur kernel noise via a hybrid l(p) norm. Journal of Electronic Imaging, 24 (1). doi: 10.1117/1.JEI.24.1.013011
Handling noise in single image defocus map estimation by using directional filters

Yu, Xin, Zhao, Xiaolin, Sui, Yao and Zhang, Li (2014). Handling noise in single image defocus map estimation by using directional filters. Optics Letters, 39 (21), 6281-6284. doi: 10.1364/OL.39.006281
Efficient Patch-Wise Non-Uniform Deblurring for a Single Image

Yu, Xin, Xu, Feng, Zhang, Shunli and Zhang, Li (2014). Efficient Patch-Wise Non-Uniform Deblurring for a Single Image. Ieee Transactions On Multimedia, 16 (6), 1510-1524. doi: 10.1109/TMM.2014.2321734
Non-rigid Object Tracking as Salient Region Segmentation and Association

Zhao, Xiaolin, Yu, Xin, Sun, Liguo, Hu, Kangqiao, Wang, Guijin and Zhang, Li (2011). Non-rigid Object Tracking as Salient Region Segmentation and Association. Ieice Transactions On Information and Systems, E94D (4), 934-937. doi: 10.1587/transinf.E94.D.934

Conference Publication

Learning efficient unsupervised satellite image-based building damage detection

Zhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206
A new perspective of weakly supervised 3D instance segmentation via bounding boxes

Yu, Qingtao, Du, Heming and Yu, Xin (2023). A new perspective of weakly supervised 3D instance segmentation via bounding boxes. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_9
Context-based masking for spontaneous venous pulsations detection

Sheng, Hongwei, Yu, Xin, Li, Xue and Golzan, Mojtaba (2023). Context-based masking for spontaneous venous pulsations detection. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_42
Toward a unified framework for RGB and RGB-D visual navigation

Du, Heming, Huang, Zi, Chapman, Scott and Yu, Xin (2023). Toward a unified framework for RGB and RGB-D visual navigation. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8391-9_29
Towards reliable and efficient vegetation segmentation for Australian wheat data analysis

Yuan, Bowen, Wang, Zijian and Yu, Xin (2023). Towards reliable and efficient vegetation segmentation for Australian wheat data analysis. 34th Australasian Database Conference (ADC), Melbourne, NSW Australia, 1-3 November 2023. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-47843-7_9
Audio-visual segmentation by exploring cross-modal mutual semantics

Liu, Chen, Li, Peike Patrick, Qi, Xingqun, Zhang, Hu, Li, Lincheng, Wang, Dadong and Yu, Xin (2023). Audio-visual segmentation by exploring cross-modal mutual semantics. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3612373
DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition

Wang, Ming, Guo, Xianda, Lin, Beibei, Yang, Tian, Zhu, Zheng, Li, Lincheng, Zhang, Shunli and Yu, Xin (2023). DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition. IEEE. doi: 10.1109/iccv51070.2023.01235
Scaling up 3D Kernels with Bayesian frequency re-parameterization for medical image segmentation

Lee, Ho Hin, Liu, Quan, Bao, Shunxing, Yang, Qi, Yu, Xin, Cai, Leon Y., Li, Thomas Z., Huo, Yuankai, Koutsoukos, Xenofon and Landman, Bennett A. (2023). Scaling up 3D Kernels with Bayesian frequency re-parameterization for medical image segmentation. MICCAI 2023 26th International Conference, Vancouver, BC, Canada, 8–12 October 2023. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-43901-8_60
Gait Recognition with Mask-based Regularization

Shen, Chuanfu, Lin, Beibei, Zhang, Shunli, Yu, Xin, Huang, George Q. and Yu, Shiqi (2023). Gait Recognition with Mask-based Regularization. IEEE. doi: 10.1109/ijcb57857.2023.10449112
Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement

Qi, Xingqun, Liu, Chen, Sun, Muyi, Li, Lincheng, Fan, Changjie and Yu, Xin (2023). Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00448
Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations

Sheng, Hongwei, Yu, Xin, Wang, Feiyu, Khan, MD Wahiduzzaman, Weng, Hexuan, Shariflou, Sahar and Golzan, S. Mojtaba (2023). Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations. 45th Annual International Conference of the IEEE-Engineering-in-Medicine-and-Biology-Society (EMBC), Sydney Australia, Jul 24-27, 2023. NEW YORK: IEEE. doi: 10.1109/embc40787.2023.10341088
NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination

Wu, Haoqian, Hu, Zhipeng, Li, Lincheng, Zhang, Yongqiang, Fan, Changjie and Yu, Xin (2023). NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00418
Object-goal visual navigation via effective exploration of relations among historical navigation states

Du, Heming, Li, Lincheng, Huang, Zi and Yu, Xin (2023). Object-goal visual navigation via effective exploration of relations among historical navigation states. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17-24 June 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/cvpr52729.2023.00252
Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions

Cai, Junyang, Nguyen, Khai-Nguyen, Shrestha, Nishant, Good, Aidan, Tu, Ruisen, Yu, Xin, Zhe, Shandian and Serra, Thiago (2023). Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions. 20th International Conference on the Integration of Constraint Programming, Artificial Intelligence, and Operations Research, Nice, France, 29 May – 1 June 2023. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-33271-5_14
Exploring active 3D object detection from a generalization perspective

Luo, Yadan, Chen, Zhuoxiao, Wang, Zijian, Yu, Xin, Huang, Zi and Baktashmotlagh, Mahsa (2023). Exploring active 3D object detection from a generalization perspective. 11th International Conference on Learning Representations (ICLR), Kigali, Rwanda, 1 - 5 May 2023. New York, NY, United States: Cornell Tech. doi: 10.48550/arXiv.2301.09249
Topological-preserving membrane skeleton segmentation in multiplex immunofluorescence imaging

Bao, Shunxing, Cui, Can, Li, Jia, Tang, Yucheng, Lee, Ho Hin, Deng, Ruining, Remedios, Lucas W., Yu, Xin, Yang, Qi, Chiron, Sophie, Patterson, Nathan H., Lau, Ken S., Liu, Qi, Roland, Joseph T., Coburn, Lori A., Wilson, Keith T., Landman, Bennett A. and Huo, Yuankai (2023). Topological-preserving membrane skeleton segmentation in multiplex immunofluorescence imaging. Conference on Medical Imaging - Digital and Computational Pathology, San Diego, CA, United States, 19-23 February 2023. Bellingham, WA, United States: SPIE - International Society for Optical Engineering. doi: 10.1117/12.2654087
A divide-and-conquer solution to 3D human motion estimation from raw MoCap data

Tang, Jilin, Li, Lincheng, Hou, Jie, Xin, Haoran and Yu, Xin (2023). A divide-and-conquer solution to 3D human motion estimation from raw MoCap data. 30th IEEE Conference Virtual Reality and 3D User Interfaces (IEEE VR), Shanghai, China, 25-29 March 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/vrw58643.2023.00226
Sign Spotting via Multi-modal Fusion and Testing Time Transferring

Fu, Hongyu, Liu, Chen, Qi, Xingqun, Lin, Beibei, Li, Lincheng, Zhang, Li and Yu, Xin (2023). Sign Spotting via Multi-modal Fusion and Testing Time Transferring. European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23–27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-25085-9_16
Deep whole brain segmentation of 7T structural MRI

Ramadass, Karthik, Yu, Xin, Cai, Leon Y., Tang, Yucheng, Bao, Shunxing, Kerley, Cailey, D'Archangel, Micah, Barquero, Laura A., Newton, Allen T., Gauthier, Isabel, McGugin, Rankin Williams, Dawant, Benoit M., Cutting, Laurie E., Huo, Yuankai and Landman, Bennett A. (2023). Deep whole brain segmentation of 7T structural MRI. SPIE Medical Imaging 2023: Image Processing, San Diego, CA United States, 19–23 February 2023. Bellingham, WA United States: SPIE. doi: 10.1117/12.2654108
Longitudinal Variability Analysis on Low-dose Abdominal CT with Deep Learning-based Segmentation

Yu, Xin, Tang, Yucheng, Yang, Qi, Lee, Ho Hin, Gao, Riqiang, Bao, Shunxing, Moore, Ann Zenobia, Ferrucci, Luigi and Landman, Bennett A (2023). Longitudinal Variability Analysis on Low-dose Abdominal CT with Deep Learning-based Segmentation. SPIE Medical Imaging 2023: Image Processing, San Diego, CA United States, 19–23 February 2023. Bellingham, WA United States: SPIE. doi: 10.1117/12.2653762
Proactive deepfake defence via identity watermarking

Zhao, Yuan, Liu, Bo, Ding, Ming, Liu, Baoping, Zhu, Tianqing and Yu, Xin (2023). Proactive deepfake defence via identity watermarking. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00458
TI2Net: Temporal identity inconsistency network for deepfake detection

Liu, Baoping, Liu, Bo, Ding, Ming, Zhu, Tianqing and Yu, Xin (2023). TI2Net: Temporal identity inconsistency network for deepfake detection. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00467
Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors

Du, Heming, Yu, Xin, Hussain, Farookh, Armin, Mohammad Ali, Petersson, Lars and Li, Weihao (2023). Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors. 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 2-7 January 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/wacv56688.2023.00425
CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization

Shi, Yujiao, Yu, Xin, Wang, Shan and Li, Hongdong (2023). CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization. 16th Asian Conference on Computer Vision, Macao, China, 4–8 December 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-26319-4_8
GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework

Wang, Ming, Lin, Beibei, Guo, Xianda, Li, Lincheng, Zhu, Zheng, Sun, Jiande, Zhang, Shunli, Liu, Yu and Yu, Xin (2023). GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework. 16th Asian Conference on Computer Vision, Macao, China, 4–8 December 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-26316-3_42
Sim2RealVS: A new benchmark for video stabilization with a strong baseline

Rao, Qi, Yu, Xin, Navasardyan, Shant and Shi, Humphrey (2023). Sim2RealVS: A new benchmark for video stabilization with a strong baseline. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00537
MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views

Zeng, Haitian, Yu, Xin, Miao, Jiaxu and Yang, Yi (2022). MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views. European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-20086-1_1
Instance as identity: a generic online paradigm for video instance segmentation

Zhu, Feng, Yang, Zongxin, Yu, Xin, Yang, Yi and Wei, Yunchao (2022). Instance as identity: a generic online paradigm for video instance segmentation. Computer Vision – ECCV 2022 17th European Conference, Tel Aviv, Israel, 23–27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-19818-2_30
Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields

Yao, Guangming, Wu, Hongzhi, Yuan, Yi, Li, Lincheng, Zhou, Kun and Yu, Xin (2022). Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields. Thirty-First International Joint Conference on Artificial Intelligence IJCAI-ECAI 2022, Vienna, Austria, 23-29 July 2022. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2022/218
Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion

Tang, Tianqi, Du, Heming, Yu, Xin and Yang, Yi (2022). Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion. Thirty-Sixth AAAI Conference on Artificial Intelligence, Online, 22 February - 1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i5.20480
Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion

Tang, Tianqi, Du, Heming, Yu, Xin and Yang, Yi (2022). Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i5.20480
One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning

Wang, Suzhen, Li, Lincheng, Ding, Yu and Yu, Xin (2022). One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning. Thirty-Sixth AAAI Conference on Artificial Intelligence, Online, 22 February - 1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i3.20154
One-shot talking face generation from single-speaker audio-visual correlation learning

Wang, Suzhen, Li, Lincheng, Ding, Yu and Yu, Xin (2022). One-shot talking face generation from single-speaker audio-visual correlation learning. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: ASSOC. doi: 10.1609/aaai.v36i3.20154
Batch Multi-Fidelity Active Learning with Budget Constraints

Li, Shibo, Phillips, Jeff M., Yu, Xin, Kirby, Robert M. and Zhe, Shandian (2022). Batch Multi-Fidelity Active Learning with Budget Constraints. 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Online, 28 November - 9 December 2022. Maryland Heights, MO United States: Morgan Kaufmann Publishers.
End-to-end multi-instance robotic reaching from monocular vision

Zhuang, Zheyu, Yu, Xin and Mahony, Robert (2021). End-to-end multi-instance robotic reaching from monocular vision. IEEE International Conference on Robotics and Automation (ICRA), Xian, China, 30 May - 5 June 2021. Washington, DC United States: IEEE Computer Society. doi: 10.1109/ICRA48506.2021.9561518
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion

Wang, Suzhen, Li, Lincheng, Ding, Yu, Fan, Changjie and Yu, Xin (2021). Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion. Thirtieth International Joint Conference on Artificial Intelligence, Montreal, Canada, 19-27 August 2021. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2021/152
Auto-navigator: decoupled neural architecture search for visual navigation

Tang, Tianqi, Yu, Xin, Dong, Xuanyi and Yang, Yi (2021). Auto-navigator: decoupled neural architecture search for visual navigation. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE. doi: 10.1109/WACV48630.2021.00379
The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose

Ben-Shabat, Yizhak, Yu, Xin, Saleh, Fatemeh, Campbell, Dylan, Rodriguez-Opazo, Cristian, Li, Hongdong and Gould, Stephen (2021). The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE Computer Society. doi: 10.1109/WACV48630.2021.00089
Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation

Ding, Yuhang, Yu, Xin and Yang, Yi (2021). Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i2.16212
Write-a-speaker: Text-based emotional and rhythmic talking-head generation

Li, Lincheng, Wang, Suzhen, Zhang, Zhimeng, Ding, Yu, Zheng, Yixing, Yu, Xin and Changjie Fan (2021). Write-a-speaker: Text-based emotional and rhythmic talking-head generation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i3.16286
VTNET: visual transformer network for object goal navigation

Du, Heming, Yu, Xin and Zheng, Liang (2021). VTNET: visual transformer network for object goal navigation. 9th International Conference on Learning Representations, Virtual, 3-7 May 2021. Appleton WI USA: International Conference on Learning Representations.
A general approach to state refinement

Kennedy, Gerard, Gao, Jin, Zhuang, Zheyu, Yu, Xin and Mahony, Robert (2021). A general approach to state refinement. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Virtual, 27 September - 1 October 2021. Piscataway, NJ, United States: IEEE. doi: 10.1109/IROS51168.2021.9636400
ARVo: learning all-range volumetric correspondence for video deblurring

Li, Dongxu, Xu, Chenchen, Zhang, Kaihao, Yu, Xin, Zhong, Yiran, Ren, Wenqi, Suominen, Hanna and Li, Hongdong (2021). ARVo: learning all-range volumetric correspondence for video deblurring. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/CVPR46437.2021.00763
DSC-PoseNet: learning 6DoF object pose estimation via dual-scale consistency

Yang, Zongxin, Yu, Xin and Yang, Yi (2021). DSC-PoseNet: learning 6DoF object pose estimation via dual-scale consistency. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/CVPR46437.2021.00390
Few-shot Weighted Style Matching for Glaucoma Detection

Liu, Jinhui and Yu, Xin (2021). Few-shot Weighted Style Matching for Glaucoma Detection. First CAAI International Conference, CICAI 2021, Hangzhou, China, 5–6 June 2021. Cham, Switzerland: Springer. doi: 10.1007/978-3-030-93046-2_25
Gait recognition via effective global-local feature representation and local temporal aggregation

Lin, Beibei, Zhang, Shunli and Yu, Xin (2021). Gait recognition via effective global-local feature representation and local temporal aggregation. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.01438
Joint 3D human shape recovery and pose estimation from a single image with bilayer graph

Yu, Xin, van Baar, Jeroen and Chen, Siheng (2021). Joint 3D human shape recovery and pose estimation from a single image with bilayer graph. 9th International Conference on 3D Vision (3DV), London, United Kingdom, 1-3 December 2021. Piscataway, NJ United States: IEEE Computer Society. doi: 10.1109/3DV53792.2021.00060
PR-RRN: pairwise-regularized residual-recursive networks for non-rigid structure-from-motion

Zeng, Haitian, Dai, Yuchao, Yu, Xin, Wang, Xiaohan and Yang, Yi (2021). PR-RRN: pairwise-regularized residual-recursive networks for non-rigid structure-from-motion. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00555
PSTNET: POINT SPATIO-TEMPORAL CONVOLUTION ON POINT CLOUD SEQUENCES

Fan, Hehe, Yu, Xin, Ding, Yuhang, Yang, Yi and Kankanhalli, Mohan (2021). PSTNET: POINT SPATIO-TEMPORAL CONVOLUTION ON POINT CLOUD SEQUENCES. International Conference on Learning Representations, ICLR.
RFNet: region-aware fusion network for incomplete multi-modal brain tumor segmentation

Ding, Yuhang, Yu, Xin and Yang, Yi (2021). RFNet: region-aware fusion network for incomplete multi-modal brain tumor segmentation. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00394
RGB-D saliency detection via cascaded mutual information minimization

Zhang, Jing, Fan, Deng-Ping, Dai, Yuchao, Yu, Xin, Zhong, Yiran, Barnes, Nick and Shao, Ling (2021). RGB-D saliency detection via cascaded mutual information minimization. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00430
Removing raindrops and rain streaks in one go

Quan, Ruijie, Yu, Xin, Liang, Yuanzhi and Yang, Yi (2021). Removing raindrops and rain streaks in one go. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: IEEE COMPUTER SOC. doi: 10.1109/CVPR46437.2021.00903
Self-supervised visibility learning for novel view synthesis

Shi, Yujiao, Li, Hongdong and Yu, Xin (2021). Self-supervised visibility learning for novel view synthesis. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/CVPR46437.2021.00955
Super-resolving cross-domain face miniatures by peeking at one-shot exemplar

Li, Peike, Yu, Xin and Yang, Yi (2021). Super-resolving cross-domain face miniatures by peeking at one-shot exemplar. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00443
TSPNet: hierarchical feature learning via temporal semantic pyramid for sign language translation

Li, Dongxu, Xu, Chenchen, Yu, Xin, Zhang, Kaihao, Swift, Ben, Suominen, Hanna and Li, Hongdong (2020). TSPNet: hierarchical feature learning via temporal semantic pyramid for sign language translation. 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada, 6-12 December 2020. Maryland Heights, MO USA: Morgan Kaufmann Publishers.
6DoF object pose estimation via differentiable proxy voting regularizer

Yu, Xin, Zhuang, Zheyu, Koniusz, Piotr and Li, Hongdong (2020). 6DoF object pose estimation via differentiable proxy voting regularizer. 31st British Machine Vision Conference, Virtual, 7-10 September 2020. Bath, United Kingdom: British Machine Vision Association, BMVA.
Copy and Paste GAN: Face Hallucination From Shaded Thumbnails

Zhang, Yang, Tsang, Ivor W., Luo, Yawei, Hu, Chang-Hui, Lu, Xiaobo and Yu, Xin (2020). Copy and Paste GAN: Face Hallucination From Shaded Thumbnails. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA United States, 13-19 June 2020. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr42600.2020.00738
Weakly-Supervised Salient Object Detection via Scribble Annotations

Zhang, Jing, Yu, Xin, Li, Aixuan, Song, Peipei, Liu, Bowen and Dai, Yuchao (2020). Weakly-Supervised Salient Object Detection via Scribble Annotations. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA United States, 13-19 June 2020. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr42600.2020.01256
Going Beyond Real Data: A Robust Visual Representation for Vehicle Re-identification

Zheng, Zhedong, Jiang, Minyue, Wang, Zhigang, Wang, Jian, Bai, Zechen, Zhang, Xuanmeng, Yu, Xin, Tan, Xiao, Yang, Yi, Wen, Shilei and Ding, Errui (2020). Going Beyond Real Data: A Robust Visual Representation for Vehicle Re-identification. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Electr Network, Jun 14-19, 2020. LOS ALAMITOS: IEEE COMPUTER SOC. doi: 10.1109/CVPRW50498.2020.00307
LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision

Zhuang, Zheyu, Yu, Xin and Mahony, Robert (2020). LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision. 2020 IEEE International Conference on Robotics and Automation (ICRA), Online, 31 May - 15 June 2020. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/icra40945.2020.9196781
Optimal Feature Transport for Cross-View Image Geo-Localization

Shi, Yujiao, Yu, Xin, Liu, Liu, Zhang, Tong and Li, Hongdong (2020). Optimal Feature Transport for Cross-View Image Geo-Localization. 34th AAAI Conference on Artificial Intelligence / 32nd Innovative Applications of Artificial Intelligence Conference / 10th AAAI Symposium on Educational Advances in Artificial Intelligence, New York Ny, Feb 07-12, 2020. PALO ALTO: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE.
Transferring Cross-domain Knowledge for Video Sign Language Recognition

Li, Dongxu, Yu, Xin, Xu, Chenchen, Petersson, Lars and Li, Hongdong (2020). Transferring Cross-domain Knowledge for Video Sign Language Recognition. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Electr Network, Jun 14-19, 2020. NEW YORK: IEEE. doi: 10.1109/CVPR42600.2020.00624
When Humans Meet Machines: Towards Efficient Segmentation Networks

Li, Peike, Dong, Xuanyi, Yu, Xin and Yang, Yi (2020). When Humans Meet Machines: Towards Efficient Segmentation Networks. British Machine Vision Association, BMVA.
Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching

Shi, Yujiao, Yu, Xin, Campbell, Dylan and Li, Hongdong (2020). Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online, 14-19 June 2020. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/CVPR42600.2020.00412
Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison

Li, Dongxu, Opazo, Cristian Rodriguez, Yu, Xin and Li, Hongdong (2020). Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Snowmass Co, Mar 01-05, 2020. LOS ALAMITOS: IEEE COMPUTER SOC.
Recovering Faces from Portraits with Auxiliary Facial Attributes

Shiri, Fatemeh, Yu, Xin, Porikli, Fatih, Hartley, Richard and Koniusz, Piotr (2019). Recovering Faces from Portraits with Auxiliary Facial Attributes. 19th IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI United States, 7-11 January 2019. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/WACV.2019.00049
Bringing a Blurry Frame Alive at High Frame-Rate with an Event Camera

Pan, Liyuan, Scheerlinck, Cedric, Yu, Xin, Hartley, Richard, Liu, Miaomiao and Dai, Yuchao (2019). Bringing a Blurry Frame Alive at High Frame-Rate with an Event Camera. 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA United States, 16-20 June 2019. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/CVPR.2019.00698
SOSNet: Second Order Similarity Regularization for Local Descriptor Learning

Tian, Yurun, Yu, Xin, Fan, Bin, Wu, Fuchao, Heijnen, Huub and Balntas, Vassileios (2019). SOSNet: Second Order Similarity Regularization for Local Descriptor Learning. 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach Ca, Jun 16-20, 2019. NEW YORK: IEEE. doi: 10.1109/CVPR.2019.01127
Spatial-Aware Feature Aggregation for Cross-View Image based Geo-Localization

Shi, Yujiao, Liu, Liu, Yu, Xin and Li, Hongdong (2019). Spatial-Aware Feature Aggregation for Cross-View Image based Geo-Localization. 33rd Conference on Neural Information Processing Systems (NeurIPS), Vancouver Canada, Dec 08-14, 2019. LA JOLLA: NEURAL INFORMATION PROCESSING SYSTEMS (NIPS).
Unsupervised Extraction of Local Image Descriptors via Relative Distance Ranking Loss

Yu, Xin, Tian, Yurun, Porikli, Fatih, Hartley, Richard, Li, Hongdong, Heijnen, Huub and Balntas, Vassileios (2019). Unsupervised Extraction of Local Image Descriptors via Relative Distance Ranking Loss. IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea, 27 October - 2 November 2019. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ICCVW.2019.00351
Face Super-Resolution Guided by Facial Component Heatmaps

Yu, Xin, Fernando, Basura, Ghanem, Bernard, Porikli, Fatih and Hartley, Richard (2018). Face Super-Resolution Guided by Facial Component Heatmaps. 15th European Conference on Computer Vision (ECCV), Munich Germany, Sep 08-14, 2018. CHAM: SPRINGER INTERNATIONAL PUBLISHING AG. doi: 10.1007/978-3-030-01240-3_14
Identity-preserving Face Recovery from Portraits

Shiri, Fatemeh, Yu, Xin, Porikli, Fatih, Hartley, Richard and Koniusz, Piotr (2018). Identity-preserving Face Recovery from Portraits. 18th IEEE Winter Conference on Applications of Computer Vision (WACV), Nv, Mar 12-15, 2018. NEW YORK: IEEE. doi: 10.1109/WACV.2018.00018
Learning Strict Identity Mappings in Deep Residual Networks

Yu, Xin, Yu, Zhiding and Ramalingam, Srikumar (2018). Learning Strict Identity Mappings in Deep Residual Networks. 31st IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City Ut, Jun 18-23, 2018. NEW YORK: IEEE. doi: 10.1109/CVPR.2018.00466
Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes

Yu, Xin, Fernando, Basura, Hartley, Richard and Porikli, Fatih (2018). Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes. 31st IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City Ut, Jun 18-23, 2018. NEW YORK: IEEE. doi: 10.1109/CVPR.2018.00101
Face Destylization

Shiri, Fatemeh, Yu, Xin, Koniusz, Piotr and Porikli, Fatih (2017). Face Destylization. International Conference on Digital Image Computing - Techniques and Applications (DICTA), Sydney Australia, Nov 29-Dec 01, 2017. NEW YORK: IEEE.
Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks

Yu, Xin and Porikli, Fatih (2017). Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks. 31st AAAI Conference on Artificial Intelligence, San Francisco Ca, Feb 04-09, 2017. PALO ALTO: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE.
Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders

Yu, Xin and Porikli, Fatih (2017). Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders. 30th IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu Hi, Jul 21-26, 2017. NEW YORK: IEEE. doi: 10.1109/CVPR.2017.570
Ultra-Resolving Face Images by Discriminative Generative Networks

Yu, Xin and Porikli, Fatih (2016). Ultra-Resolving Face Images by Discriminative Generative Networks. 14th European Conference on Computer Vision (ECCV), Amsterdam, Netherlands, 8-16 October 2016. Heidelberg, Germany: Springer. doi: 10.1007/978-3-319-46454-1_20

Grants (Administered at UQ)

Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production

(2024) Google Inc
Breaking the Communication Barrier for the Australian Deaf Community: Vision Based Australian Sign Language Translation and Production

(2023–2028) Google Asia Pacific Pte Ltd
Analytics for the Australian Grains Industry (AAGI)

(2023–2027) Grains Research & Development Corporation
Advancing Human Perception: Countering Evolving Malicious Fake Visual Data

(2023–2026) ARC Discovery Early Career Researcher Award
Developing applications of satellite imagery for modelling environmental and social impacts of climate change on seaweed farming in Indonesia (KONEKSI Grant administered by Griffith University)

(2023–2024) Griffith University

PhD and MPhil Supervision

Current Supervision

Two way Auslan Translation

Doctor Philosophy — Principal Advisor
Other advisors:
- Dr Mahsa Baktashmotlagh
The prediction, diagnosis, and severity estimation models for plant disease

Doctor Philosophy — Principal Advisor
Other advisors:
- Associate Professor Sen Wang
Digital Asset IP Protection

Doctor Philosophy — Principal Advisor
Other advisors:
- Dr Mahsa Baktashmotlagh
Enhancing Building Fire Safety by Utilising Machine Learning Techniques

Doctor Philosophy — Principal Advisor
Other advisors:
- Professor Brian Lovell
Towards Efficient Pest Detection in Agriculture

Doctor Philosophy — Principal Advisor
Other advisors:
- Associate Professor Sen Wang
Advancing Human Perception: Countering Evolving Malicious Fake Visual Data

Doctor Philosophy — Principal Advisor
Other advisors:
- Honorary Professor Zhi-Gang Chen
Combating evolving deceptive fake visual information through deepfake detection

Doctor Philosophy — Principal Advisor
Other advisors:
- Associate Professor Sen Wang
Two way Auslan Translation

Doctor Philosophy — Principal Advisor
Other advisors:
- Professor Helen Huang
Data driven approaches for smart farming

Doctor Philosophy — Associate Advisor
Other advisors:
- Professor Helen Huang
- Honorary Professor Zhi-Gang Chen
Remote Sensing Analysis in computer vision

Doctor Philosophy — Associate Advisor
Other advisors:
- Professor Helen Huang

The University of Queensland

UQ Researchers

Dr Xin Yu

ARC DECRA

Overview

Research Impacts

Publications

Grants

Supervision

Publications

Book Chapter

Journal Article

Conference Publication

Grants (Administered at UQ)

PhD and MPhil Supervision

Current Supervision

Links

Unit Links

Other Media

A Member of

Quick Links

Social Media

Explore

Need Help?

Emergency

The University of Queensland

UQ Researchers

Dr Xin Yu

ARC DECRA

Overview

Research Impacts

Publications

Grants

Supervision

Publications

Book Chapter

Journal Article

Conference Publication

Grants (Administered at UQ)

PhD and MPhil Supervision

Current Supervision

Links

Unit Links

Other Media

UQ News

UQ eSpace

Related Resources

Similar Staff

Professor Brian Lovell

Professor Helen Huang

Dr Zoe Staines

Researchers in Associated Units

A Member of

Quick Links

Social Media

Explore

Need Help?

Emergency