My name is Xin Yu, a Senior Lecturer at the University of Queensland. I am an Australian Research Council Discovery Early Career Researcher Award 2023-2025 (DECRA) recipient and an awardee of the prestigious Google Research Scholar Program in 2021. Previously, I was a research fellow at the Australian National University (ANU). I received my PhD degree from the Australian National Unversity under the supervision of Prof. Richard Hartley, Prof. Fatih Porikli and Dr. Basura Fernando. I also received a PhD degree from Tsinghua University supervised by Prof. Li Zhang. I am interested in Computer Vision and Machine Learning topics.
My research topics includes various computer vision and machine learning tasks, especially in efficient low-level image processing, image retrieval and localization, action recognition, 3D pose estimation, visual navigation and sign language recognition and translation.
One of my research papers has been awarded "Best Paper Honorable Mention" award in the premium computer vision conference WACV 2020, and one paper has been nominated for the Best Paper Award in CVPR 2020.
I was awarded the Outstanding Reviewer Award in ECCV 2020, CVPR 2021 and ICCV 2021. CVPR, ICCV and ECCV are internationally world-leading computer vision and machine learning conferences. My research interests include deep learning techniques, image processing, and computer vision tasks. I am a program committee member of top-tier computer vision and machine learning conferences, such as CVPR, ICCV, ECCV, ICML, ICLR and NeurIPS, and a reviewer of prestigious journals, such as TPAMI, IJCV and TIP.
I am happy to supervise self-motivated PhD and MPhil students. If you are an undergraduate student and willing to conduct your honour project, please drop me an email.
Journal Article: AI empowered Auslan learning for parents of deaf children and children of deaf adults
Sheng, Hongwei, Shen, Xin, Du, Heming, Zhang, Hu, Huang, Zi and Yu, Xin (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 1-11. doi: 10.1007/s43681-024-00457-y
Journal Article: Detecting facial action units from global-local fine-grained expressions
Zhang, Wei, Li, Lincheng, Ding, Yu, Chen, Wei, Deng, Zhigang and Yu, Xin (2024). Detecting facial action units from global-local fine-grained expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34 (2), 983-994. doi: 10.1109/tcsvt.2023.3288903
Journal Article: CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses
Fu, Hongyu, Yu, Xin, Li, Lincheng and Zhang, Li (2024). CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses. IEEE Transactions on Multimedia, 1-12. doi: 10.1109/tmm.2024.3388929
(2024) Google Inc
(2023–2028) Google Asia Pacific Pte Ltd
Analytics for the Australian Grains Industry (AAGI)
(2023–2027) Grains Research & Development Corporation
Two way Auslan Translation
Doctor Philosophy
The prediction, diagnosis, and severity estimation models for plant disease
Doctor Philosophy
Digital Asset IP Protection
Doctor Philosophy
Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation
Liu, Jinhui, Zou, Zhikang, Ye, Xiaoqing, Tan, Xiao, Ding, Errui, Xu, Feng and Yu, Xin (2020). Leaping from 2D Detection to Efficient 6DoF Object Pose Estimation. Computer Vision – ECCV 2020 Workshops. (pp. 707-714) Cham: Springer International Publishing. doi: 10.1007/978-3-030-66096-3_47
Learning Object Relation Graph and Tentative Policy for Visual Navigation
Du, Heming, Yu, Xin and Zheng, Liang (2020). Learning Object Relation Graph and Tentative Policy for Visual Navigation. Computer Vision – ECCV 2020. (pp. 19-34) Cham: Springer International Publishing. doi: 10.1007/978-3-030-58571-6_2
AI empowered Auslan learning for parents of deaf children and children of deaf adults
Sheng, Hongwei, Shen, Xin, Du, Heming, Zhang, Hu, Huang, Zi and Yu, Xin (2024). AI empowered Auslan learning for parents of deaf children and children of deaf adults. AI and Ethics, 1-11. doi: 10.1007/s43681-024-00457-y
Detecting facial action units from global-local fine-grained expressions
Zhang, Wei, Li, Lincheng, Ding, Yu, Chen, Wei, Deng, Zhigang and Yu, Xin (2024). Detecting facial action units from global-local fine-grained expressions. IEEE Transactions on Circuits and Systems for Video Technology, 34 (2), 983-994. doi: 10.1109/tcsvt.2023.3288903
CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses
Fu, Hongyu, Yu, Xin, Li, Lincheng and Zhang, Li (2024). CBARF: Cascaded Bundle-Adjusting Neural Radiance Fields From Imperfect Camera Poses. IEEE Transactions on Multimedia, 1-12. doi: 10.1109/tmm.2024.3388929
CMGNet: Collaborative multi-modal graph network for video captioning
Rao, Qi, Yu, Xin, Li, Guang and Zhu, Linchao (2024). CMGNet: Collaborative multi-modal graph network for video captioning. Computer Vision and Image Understanding, 238 103864, 1-10. doi: 10.1016/j.cviu.2023.103864
MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers
Hu, Zhipeng, Tang, Jilin, Li, Lincheng, Hou, Jie, Xin, Haoran, Yu, Xin and Bu, Jiajun (2024). MarkerNet: A divide-and-conquer solution to motion capture solving from raw markers. Computer Animation and Virtual Worlds, 35 (1). doi: 10.1002/cav.2228
StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads
Wang, Suzhen, Ma, Yifeng, Ding, Yu, Hu, Zhipeng, Fan, Changjie, Lv, Tangjie, Deng, Zhidong and Yu, Xin (2024). StyleTalk++: A Unified Framework for Controlling the Speaking Styles of Talking Heads. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1-17. doi: 10.1109/tpami.2024.3357808
DMMG: Dual min-max games for self-supervised skeleton-based action recognition
Guan, Shannan, Yu, Xin, Huang, Wei, Fang, Gengfa and Lu, Haiyan (2023). DMMG: Dual min-max games for self-supervised skeleton-based action recognition. IEEE Transactions on Image Processing, 33, 395-407. doi: 10.1109/tip.2023.3338410
Yu, Xin, Yang, Qi, Zhou, Yinchi, Cai, Leon Y., Gao, Riqiang, Lee, Ho Hin, Li, Thomas, Bao, Shunxing, Xu, Zhoubing, Lasko, Thomas A., Abramson, Richard G., Zhang, Zizhao, Huo, Yuankai, Landman, Bennett A. and Tang, Yucheng (2023). UNesT: Local spatial representation learning with hierarchical transformer for efficient medical segmentation. Medical Image Analysis, 90 102939, 1-15. doi: 10.1016/j.media.2023.102939
Deep idempotent network for efficient single image blind deblurring
Mao, Yuxin, Wan, Zhexiong, Dai, Yuchao and Yu, Xin (2023). Deep idempotent network for efficient single image blind deblurring. IEEE Transactions on Circuits and Systems for Video Technology, 33 (1), 172-185. doi: 10.1109/tcsvt.2022.3202361
Single slice thigh CT muscle group segmentation with domain adaptation and self-training
Yang, Qi, Yu, Xin, Lee, Ho Hin, Cai, Leon Y., Xu, Kaiwen, Bao, Shunxing, Huo, Yuankai, Moore, Ann Zenobia, Makrogiannis, Sokratis, Ferrucci, Luigi and Landman, Bennett A (2023). Single slice thigh CT muscle group segmentation with domain adaptation and self-training. Journal of Medical Imaging, 10 (4) 044001, 1-12. doi: 10.1117/1.JMI.10.4.044001
A consensus protocol for functional connectivity analysis in the rat brain
Grandjean, Joanes, Desrosiers-Gregoire, Gabriel, Anckaerts, Cynthia, Angeles-Valdez, Diego, Ayad, Fadi, Barrière, David A., Blockx, Ines, Bortel, Aleksandra, Broadwater, Margaret, Cardoso, Beatriz M., Célestine, Marina, Chavez-Negrete, Jorge E., Choi, Sangcheon, Christiaen, Emma, Clavijo, Perrin, Colon-Perez, Luis, Cramer, Samuel, Daniele, Tolomeo, Dempsey, Elaine, Diao, Yujian, Doelemeyer, Arno, Dopfel, David, Dvořáková, Lenka, Falfán-Melgoza, Claudia, Fernandes, Francisca F., Fowler, Caitlin F., Fuentes-Ibañez, Antonio, Garin, Clément, Gelderman, Eveline ... Hess, Andreas (2023). A consensus protocol for functional connectivity analysis in the rat brain. Nature Neuroscience, 26 (4), 673-681. doi: 10.1038/s41593-023-01286-8
Accurate 3-DoF camera geo-localization via ground-to-satellite image matching
Shi, Yujiao, Yu, Xin, Liu, Liu, Campbell, Dylan, Koniusz, Piotr and Li, Hongdong (2023). Accurate 3-DoF camera geo-localization via ground-to-satellite image matching. IEEE transactions on pattern analysis and machine intelligence, 45 (3), 2682-2697. doi: 10.1109/TPAMI.2022.3189702
Boosting model inversion attacks with adversarial examples
Zhou, Shuai, Zhu, Tianqing, Ye, Dayong, Yu, Xin and Zhou, Wanlei (2023). Boosting model inversion attacks with adversarial examples. IEEE Transactions on Dependable and Secure Computing, 1-18. doi: 10.1109/TDSC.2023.3285015
Calligraphy Font Generation via Explicitly Modeling Location-aware Glyph Component Deformations
Zhao, Minda, Qi, Xingqun, Hu, Zhipeng, Li, Lincheng, Zhang, Yongqiang, Huang, Zi and Yu, Xin (2023). Calligraphy Font Generation via Explicitly Modeling Location-aware Glyph Component Deformations. IEEE Transactions on Multimedia, 26, 1-13. doi: 10.1109/tmm.2023.3342690
Cyclic self-training with proposal weight modulation for cross-supervised object detection
Xu, Yunqiu, Zhou, Chunluan, Yu, Xin and Yang, Yi (2023). Cyclic self-training with proposal weight modulation for cross-supervised object detection. IEEE Transactions on Image Processing, 32, 1992-2002. doi: 10.1109/TIP.2023.3261752
HairStyle editing via parametric controllable strokes
Song, Xinhui, Liu, Chen, Zheng, Youyi, Feng, Zunlei, Li, Lincheng, Zhou, Kun and Yu, Xin (2023). HairStyle editing via parametric controllable strokes. IEEE Transactions on Visualization and Computer Graphics, 1-14. doi: 10.1109/TVCG.2023.3241894
Deep hierarchical representation of point cloud videos via spatio-temporal decomposition
Fan, Hehe, Yu, Xin, Yang, Yi and Kankanhalli, Mohan (2022). Deep hierarchical representation of point cloud videos via spatio-temporal decomposition. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (12), 9918-9930. doi: 10.1109/TPAMI.2021.3135117
Geometry-guided street-view panorama synthesis from satellite imagery
Shi, Yujiao, Campbell, Dylan, Yu, Xin and Li, Hongdong (2022). Geometry-guided street-view panorama synthesis from satellite imagery. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (12), 10009-10022. doi: 10.1109/TPAMI.2022.3140750
Single image based 3D human pose estimation via uncertainty learning
Han, Chuchu, Yu, Xin, Gao, Changxin, Sang, Nong and Yang, Yi (2022). Single image based 3D human pose estimation via uncertainty learning. Pattern Recognition, 132 108934. doi: 10.1016/j.patcog.2022.108934
Recursive copy and paste GAN: face hallucination from shaded thumbnails
Zhang, Yang, Tsang, Ivor W., Luo, Yawei, Hu, Changhui, Lu, Xiaobo and Yu, Xin (2022). Recursive copy and paste GAN: face hallucination from shaded thumbnails. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (8), 4321-4338. doi: 10.1109/TPAMI.2021.3061312
High frame rate video reconstruction based on an event camera
Pan, Liyuan, Hartley, Richard, Scheerlinck, Cedric, Liu, Miaomiao, Yu, Xin and Dai, Yuchao (2022). High frame rate video reconstruction based on an event camera. IEEE Transactions On Pattern Analysis and Machine Intelligence, 44 (5), 2519-2533. doi: 10.1109/TPAMI.2020.3036667
Single-image deraining via recurrent residual multiscale networks
Zheng, Yupei, Yu, Xin, Liu, Miaomiao and Zhang, Shunli (2022). Single-image deraining via recurrent residual multiscale networks. IEEE Transactions On Neural Networks and Learning Systems, 33 (3), 1310-1323. doi: 10.1109/TNNLS.2020.3041752
Pro-UIGAN: progressive face hallucination from occluded thumbnails
Zhang, Yang, Yu, Xin, Lu, Xiaobo and Liu, Ping (2022). Pro-UIGAN: progressive face hallucination from occluded thumbnails. IEEE Transactions On Image Processing, 31, 3236-3250. doi: 10.1109/TIP.2022.3167280
Understanding atomic hand-object interaction with human intention
Fan, Hehe, Zhuo, Tao, Yu, Xin, Yang, Yi and Kankanhalli, Mohan (2022). Understanding atomic hand-object interaction with human intention. IEEE Transactions On Circuits and Systems for Video Technology, 32 (1), 275-285. doi: 10.1109/TCSVT.2021.3058688
Xu, Yunqiu, Yu, Xin, Zhang, Jing, Zhu, Linchao and Wang, Dadong (2022). Weakly supervised RGB-D salient object detection with prediction consistency training and active scribble boosting. IEEE Transactions On Image Processing, 31, 2148-2161. doi: 10.1109/TIP.2022.3151999
Learning with noisy labels via self-reweighting from class centroids
Ma, Fan, Wu, Yu, Yu, Xin and Yang, Yi (2021). Learning with noisy labels via self-reweighting from class centroids. IEEE Transactions On Neural Networks and Learning Systems, 33 (11), 6275-6285. doi: 10.1109/TNNLS.2021.3073248
Progressive transfer learning for face anti-spoofing
Quan, Ruijie, Wu, Yu, Yu, Xin and Yang, Yi (2021). Progressive transfer learning for face anti-spoofing. IEEE Transactions on Image Processing, 30, 3946-3955. doi: 10.1109/TIP.2021.3066912
Face hallucination with finishing touches
Zhang, Yang, Tsang, Ivor W., Li, Jun, Liu, Ping, Lu, Xiaobo and Yu, Xin (2021). Face hallucination with finishing touches. IEEE Transactions On Image Processing, 30 9318504, 1728-1743. doi: 10.1109/TIP.2020.3046918
Xu, Yunqiu, Zhou, Chunluan, Yu, Xin, Xiao, Bin and Yang, Yi (2021). Pyramidal multiple instance detection network with mask guided self-correction for weakly supervised object detection. IEEE Transactions On Image Processing, 30, 3029-3040. doi: 10.1109/TIP.2021.3056887
Single Image Portrait Relighting via Explicit Multiple Reflectance Channel Modeling
Wang, Zhibo, Yu, Xin, Lu, Ming, Wang, Quan, Qian, Chen and Xu, Feng (2020). Single Image Portrait Relighting via Explicit Multiple Reflectance Channel Modeling. Acm Transactions On Graphics, 39 (6). doi: 10.1145/3414685.3417824
Yu, Xin, Fernando, Basura, Hartley, Richard and Porikli, Fatih (2020). Semantic Face Hallucination: Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes. IEEE Transactions On Pattern Analysis and Machine Intelligence, 42 (11), 2926-2943. doi: 10.1109/TPAMI.2019.2916881
Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces
Yu, Xin, Shiri, Fatemeh, Ghanem, Bernard and Porikli, Fatih (2020). Can We See More? Joint Frontalization and Hallucination of Unaligned Tiny Faces. Ieee Transactions On Pattern Analysis and Machine Intelligence, 42 (9) 8704962, 2148-2164. doi: 10.1109/TPAMI.2019.2914039
Hallucinating Unaligned Face Images by Multiscale Transformative Discriminative Networks
Yu, Xin, Porikli, Fatih, Fernando, Basura and Hartley, Richard (2019). Hallucinating Unaligned Face Images by Multiscale Transformative Discriminative Networks. International Journal of Computer Vision, 128 (2), 500-526. doi: 10.1007/s11263-019-01254-5
Identity-Preserving Face Recovery from Stylized Portraits
Shiri, Fatemeh, Yu, Xin, Porikli, Fatih, Hartley, Richard and Koniusz, Piotr (2019). Identity-Preserving Face Recovery from Stylized Portraits. International Journal of Computer Vision, 127 (6-7), 863-883. doi: 10.1007/s11263-019-01169-1
Single Image Depth Estimation With Normal Guided Scale Invariant Deep Convolutional Fields
Yan, Han, Yu, Xin, Zhang, Yu, Zhang, Shunli, Zhao, Xiaolin and Zhang, Li (2019). Single Image Depth Estimation With Normal Guided Scale Invariant Deep Convolutional Fields. IEEE Transactions On Circuits and Systems for Video Technology, 29 (1) 8105853, 80-92. doi: 10.1109/TCSVT.2017.2772892
Imagining the Unimaginable Faces by Deconvolutional Networks
Yu, Xin and Porikli, Fatih (2018). Imagining the Unimaginable Faces by Deconvolutional Networks. Ieee Transactions On Image Processing, 27 (6), 2747-2761. doi: 10.1109/TIP.2018.2808840
PMSC: PatchMatch-Based Superpixel Cut for Accurate Stereo Matching
Li, Lincheng, Zhang, Shunli, Yu, Xin and Zhang, Li (2018). PMSC: PatchMatch-Based Superpixel Cut for Accurate Stereo Matching. Ieee Transactions On Circuits and Systems for Video Technology, 28 (3), 679-692. doi: 10.1109/TCSVT.2016.2628782
3D cost aggregation with multiple minimum spanning trees for stereo matching
Li, Lincheng, Yu, Xin, Zhang, Shunli, Zhao, Xiaolin and Zhang, Li (2017). 3D cost aggregation with multiple minimum spanning trees for stereo matching. Applied Optics, 56 (12), 3411-3420. doi: 10.1364/AO.56.003411
Multi-local-task learning with global regularization for object tracking
Zhang, Shunli, Sui, Yao, Zhao, Sicong, Yu, Xin and Zhang, Li (2015). Multi-local-task learning with global regularization for object tracking. Pattern Recognition, 48 (12), 3881-3894. doi: 10.1016/j.patcog.2015.06.005
Sui, Yao, Zhao, Xiaolin, Zhang, Shunli, Yu, Xin, Zhao, Sicong and Zhang, Li (2015). Self-expressive tracking. Pattern Recognition, 48 (9), 2872-2884. doi: 10.1016/j.patcog.2015.03.007
Hybrid support vector machines for robust object tracking
Zhang, Shunli, Sui, Yao, Yu, Xin, Zhao, Sicong and Zhang, Li (2015). Hybrid support vector machines for robust object tracking. Pattern Recognition, 48 (8), 2474-2488. doi: 10.1016/j.patcog.2015.02.008
Object Tracking With Multi-View Support Vector Machines
Zhang, Shunli, Yu, Xin, Sui, Yao, Zhao, Sicong and Zhang, Li (2015). Object Tracking With Multi-View Support Vector Machines. Ieee Transactions On Multimedia, 17 (3), 265-278. doi: 10.1109/TMM.2015.2390044
Removing blur kernel noise via a hybrid l(p) norm
Yu, Xin, Zhang, Shunli, Zhao, Xiaolin and Zhang, Li (2015). Removing blur kernel noise via a hybrid l(p) norm. Journal of Electronic Imaging, 24 (1). doi: 10.1117/1.JEI.24.1.013011
Handling noise in single image defocus map estimation by using directional filters
Yu, Xin, Zhao, Xiaolin, Sui, Yao and Zhang, Li (2014). Handling noise in single image defocus map estimation by using directional filters. Optics Letters, 39 (21), 6281-6284. doi: 10.1364/OL.39.006281
Efficient Patch-Wise Non-Uniform Deblurring for a Single Image
Yu, Xin, Xu, Feng, Zhang, Shunli and Zhang, Li (2014). Efficient Patch-Wise Non-Uniform Deblurring for a Single Image. Ieee Transactions On Multimedia, 16 (6), 1510-1524. doi: 10.1109/TMM.2014.2321734
Non-rigid Object Tracking as Salient Region Segmentation and Association
Zhao, Xiaolin, Yu, Xin, Sun, Liguo, Hu, Kangqiao, Wang, Guijin and Zhang, Li (2011). Non-rigid Object Tracking as Salient Region Segmentation and Association. Ieice Transactions On Information and Systems, E94D (4), 934-937. doi: 10.1587/transinf.E94.D.934
Learning efficient unsupervised satellite image-based building damage detection
Zhang, Yiyun, Wang, Zijian, Luo, Yadan, Yu, Xin and Huang, Zi (2023). Learning efficient unsupervised satellite image-based building damage detection. 2023 IEEE International Conference on Data Mining (ICDM), Shanghai, China, 1-4 December 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/icdm58522.2023.00206
A new perspective of weakly supervised 3D instance segmentation via bounding boxes
Yu, Qingtao, Du, Heming and Yu, Xin (2023). A new perspective of weakly supervised 3D instance segmentation via bounding boxes. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_9
Context-based masking for spontaneous venous pulsations detection
Sheng, Hongwei, Yu, Xin, Li, Xue and Golzan, Mojtaba (2023). Context-based masking for spontaneous venous pulsations detection. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8388-9_42
Toward a unified framework for RGB and RGB-D visual navigation
Du, Heming, Huang, Zi, Chapman, Scott and Yu, Xin (2023). Toward a unified framework for RGB and RGB-D visual navigation. 36th Australasian Joint Conference on Artificial Intelligence, AJCAI 2023, Brisbane, QLD Australia, 28 November –1 December 2023. Singapore: Springer. doi: 10.1007/978-981-99-8391-9_29
Towards reliable and efficient vegetation segmentation for Australian wheat data analysis
Yuan, Bowen, Wang, Zijian and Yu, Xin (2023). Towards reliable and efficient vegetation segmentation for Australian wheat data analysis. 34th Australasian Database Conference (ADC), Melbourne, NSW Australia, 1-3 November 2023. Cham, Switzerland: Springer Cham. doi: 10.1007/978-3-031-47843-7_9
Audio-visual segmentation by exploring cross-modal mutual semantics
Liu, Chen, Li, Peike Patrick, Qi, Xingqun, Zhang, Hu, Li, Lincheng, Wang, Dadong and Yu, Xin (2023). Audio-visual segmentation by exploring cross-modal mutual semantics. MM '23: The 31st ACM International Conference on Multimedia, Ottawa, ON Canada, 29 October - 3 November 2023. New York, NY United States: Association for Computing Machinery. doi: 10.1145/3581783.3612373
DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition
Wang, Ming, Guo, Xianda, Lin, Beibei, Yang, Tian, Zhu, Zheng, Li, Lincheng, Zhang, Shunli and Yu, Xin (2023). DyGait: Exploiting Dynamic Representations for High-performance Gait Recognition. IEEE. doi: 10.1109/iccv51070.2023.01235
Lee, Ho Hin, Liu, Quan, Bao, Shunxing, Yang, Qi, Yu, Xin, Cai, Leon Y., Li, Thomas Z., Huo, Yuankai, Koutsoukos, Xenofon and Landman, Bennett A. (2023). Scaling up 3D Kernels with Bayesian frequency re-parameterization for medical image segmentation. MICCAI 2023 26th International Conference, Vancouver, BC, Canada, 8–12 October 2023. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-43901-8_60
Gait Recognition with Mask-based Regularization
Shen, Chuanfu, Lin, Beibei, Zhang, Shunli, Yu, Xin, Huang, George Q. and Yu, Shiqi (2023). Gait Recognition with Mask-based Regularization. IEEE. doi: 10.1109/ijcb57857.2023.10449112
Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement
Qi, Xingqun, Liu, Chen, Sun, Muyi, Li, Lincheng, Fan, Changjie and Yu, Xin (2023). Diverse 3D Hand Gesture Prediction from Body Dynamics by Bilateral Hand Disentanglement. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00448
Sheng, Hongwei, Yu, Xin, Wang, Feiyu, Khan, MD Wahiduzzaman, Weng, Hexuan, Shariflou, Sahar and Golzan, S. Mojtaba (2023). Autonomous Stabilization of Retinal Videos for Streamlining Assessment of Spontaneous Venous Pulsations. 45th Annual International Conference of the IEEE-Engineering-in-Medicine-and-Biology-Society (EMBC), Sydney Australia, Jul 24-27, 2023. NEW YORK: IEEE. doi: 10.1109/embc40787.2023.10341088
NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination
Wu, Haoqian, Hu, Zhipeng, Li, Lincheng, Zhang, Yongqiang, Fan, Changjie and Yu, Xin (2023). NeFII: Inverse Rendering for Reflectance Decomposition with Near-Field Indirect Illumination. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, Canada, 17-24 June 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr52729.2023.00418
Du, Heming, Li, Lincheng, Huang, Zi and Yu, Xin (2023). Object-goal visual navigation via effective exploration of relations among historical navigation states. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17-24 June 2023. Piscataway, NJ, United States: IEEE. doi: 10.1109/cvpr52729.2023.00252
Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions
Cai, Junyang, Nguyen, Khai-Nguyen, Shrestha, Nishant, Good, Aidan, Tu, Ruisen, Yu, Xin, Zhe, Shandian and Serra, Thiago (2023). Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions. 20th International Conference on the Integration of Constraint Programming, Artificial Intelligence, and Operations Research, Nice, France, 29 May – 1 June 2023. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-33271-5_14
Exploring active 3D object detection from a generalization perspective
Luo, Yadan, Chen, Zhuoxiao, Wang, Zijian, Yu, Xin, Huang, Zi and Baktashmotlagh, Mahsa (2023). Exploring active 3D object detection from a generalization perspective. 11th International Conference on Learning Representations (ICLR), Kigali, Rwanda, 1 - 5 May 2023. New York, NY, United States: Cornell Tech. doi: 10.48550/arXiv.2301.09249
Topological-preserving membrane skeleton segmentation in multiplex immunofluorescence imaging
Bao, Shunxing, Cui, Can, Li, Jia, Tang, Yucheng, Lee, Ho Hin, Deng, Ruining, Remedios, Lucas W., Yu, Xin, Yang, Qi, Chiron, Sophie, Patterson, Nathan H., Lau, Ken S., Liu, Qi, Roland, Joseph T., Coburn, Lori A., Wilson, Keith T., Landman, Bennett A. and Huo, Yuankai (2023). Topological-preserving membrane skeleton segmentation in multiplex immunofluorescence imaging. Conference on Medical Imaging - Digital and Computational Pathology, San Diego, CA, United States, 19-23 February 2023. Bellingham, WA, United States: SPIE - International Society for Optical Engineering. doi: 10.1117/12.2654087
A divide-and-conquer solution to 3D human motion estimation from raw MoCap data
Tang, Jilin, Li, Lincheng, Hou, Jie, Xin, Haoran and Yu, Xin (2023). A divide-and-conquer solution to 3D human motion estimation from raw MoCap data. 30th IEEE Conference Virtual Reality and 3D User Interfaces (IEEE VR), Shanghai, China, 25-29 March 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/vrw58643.2023.00226
Sign Spotting via Multi-modal Fusion and Testing Time Transferring
Fu, Hongyu, Liu, Chen, Qi, Xingqun, Lin, Beibei, Li, Lincheng, Zhang, Li and Yu, Xin (2023). Sign Spotting via Multi-modal Fusion and Testing Time Transferring. European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23–27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-25085-9_16
Deep whole brain segmentation of 7T structural MRI
Ramadass, Karthik, Yu, Xin, Cai, Leon Y., Tang, Yucheng, Bao, Shunxing, Kerley, Cailey, D'Archangel, Micah, Barquero, Laura A., Newton, Allen T., Gauthier, Isabel, McGugin, Rankin Williams, Dawant, Benoit M., Cutting, Laurie E., Huo, Yuankai and Landman, Bennett A. (2023). Deep whole brain segmentation of 7T structural MRI. SPIE Medical Imaging 2023: Image Processing, San Diego, CA United States, 19–23 February 2023. Bellingham, WA United States: SPIE. doi: 10.1117/12.2654108
Longitudinal Variability Analysis on Low-dose Abdominal CT with Deep Learning-based Segmentation
Yu, Xin, Tang, Yucheng, Yang, Qi, Lee, Ho Hin, Gao, Riqiang, Bao, Shunxing, Moore, Ann Zenobia, Ferrucci, Luigi and Landman, Bennett A (2023). Longitudinal Variability Analysis on Low-dose Abdominal CT with Deep Learning-based Segmentation. SPIE Medical Imaging 2023: Image Processing, San Diego, CA United States, 19–23 February 2023. Bellingham, WA United States: SPIE. doi: 10.1117/12.2653762
Proactive deepfake defence via identity watermarking
Zhao, Yuan, Liu, Bo, Ding, Ming, Liu, Baoping, Zhu, Tianqing and Yu, Xin (2023). Proactive deepfake defence via identity watermarking. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00458
TI2Net: Temporal identity inconsistency network for deepfake detection
Liu, Baoping, Liu, Bo, Ding, Ming, Zhu, Tianqing and Yu, Xin (2023). TI2Net: Temporal identity inconsistency network for deepfake detection. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00467
Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors
Du, Heming, Yu, Xin, Hussain, Farookh, Armin, Mohammad Ali, Petersson, Lars and Li, Weihao (2023). Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors. 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 2-7 January 2023. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/wacv56688.2023.00425
CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization
Shi, Yujiao, Yu, Xin, Wang, Shan and Li, Hongdong (2023). CVLNet: Cross-view Semantic Correspondence Learning for Video-Based Camera Localization. 16th Asian Conference on Computer Vision, Macao, China, 4–8 December 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-26319-4_8
Wang, Ming, Lin, Beibei, Guo, Xianda, Li, Lincheng, Zhu, Zheng, Sun, Jiande, Zhang, Shunli, Liu, Yu and Yu, Xin (2023). GaitStrip: Gait Recognition via Effective Strip-Based Feature Representations and Multi-level Framework. 16th Asian Conference on Computer Vision, Macao, China, 4–8 December 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-26316-3_42
Sim2RealVS: A new benchmark for video stabilization with a strong baseline
Rao, Qi, Yu, Xin, Navasardyan, Shant and Shi, Humphrey (2023). Sim2RealVS: A new benchmark for video stabilization with a strong baseline. 23rd IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 3-7 January 2023. Piscataway, NJ United States: IEEE. doi: 10.1109/wacv56688.2023.00537
MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views
Zeng, Haitian, Yu, Xin, Miao, Jiaxu and Yang, Yi (2022). MHR-Net: Multiple-Hypothesis Reconstruction of Non-Rigid Shapes from 2D Views. European Conference on Computer Vision (ECCV 2022), Tel Aviv, Israel, 23-27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-20086-1_1
Instance as identity: a generic online paradigm for video instance segmentation
Zhu, Feng, Yang, Zongxin, Yu, Xin, Yang, Yi and Wei, Yunchao (2022). Instance as identity: a generic online paradigm for video instance segmentation. Computer Vision – ECCV 2022 17th European Conference, Tel Aviv, Israel, 23–27 October 2022. Cham, Switzerland: Springer. doi: 10.1007/978-3-031-19818-2_30
Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields
Yao, Guangming, Wu, Hongzhi, Yuan, Yi, Li, Lincheng, Zhou, Kun and Yu, Xin (2022). Learning Implicit Body Representations from Double Diffusion Based Neural Radiance Fields. Thirty-First International Joint Conference on Artificial Intelligence IJCAI-ECAI 2022, Vienna, Austria, 23-29 July 2022. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2022/218
Tang, Tianqi, Du, Heming, Yu, Xin and Yang, Yi (2022). Monocular Camera-Based Point-Goal Navigation by Learning Depth Channel and Cross-Modality Pyramid Fusion. Thirty-Sixth AAAI Conference on Artificial Intelligence, Online, 22 February - 1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i5.20480
Tang, Tianqi, Du, Heming, Yu, Xin and Yang, Yi (2022). Monocular camera-based point-goal navigation by learning depth channel and cross-modality pyramid fusion. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i5.20480
One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning
Wang, Suzhen, Li, Lincheng, Ding, Yu and Yu, Xin (2022). One-Shot Talking Face Generation from Single-Speaker Audio-Visual Correlation Learning. Thirty-Sixth AAAI Conference on Artificial Intelligence, Online, 22 February - 1 March 2022. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v36i3.20154
One-shot talking face generation from single-speaker audio-visual correlation learning
Wang, Suzhen, Li, Lincheng, Ding, Yu and Yu, Xin (2022). One-shot talking face generation from single-speaker audio-visual correlation learning. 36th AAAI Conference on Artificial Intelligence / 34th Conference on Innovative Applications of Artificial Intelligence / 12th Symposium on Educational Advances in Artificial Intelligence, Online, 22 February –1 March 2022. Palo Alto, CA United States: ASSOC. doi: 10.1609/aaai.v36i3.20154
Batch Multi-Fidelity Active Learning with Budget Constraints
Li, Shibo, Phillips, Jeff M., Yu, Xin, Kirby, Robert M. and Zhe, Shandian (2022). Batch Multi-Fidelity Active Learning with Budget Constraints. 36th Conference on Neural Information Processing Systems (NeurIPS 2022), Online, 28 November - 9 December 2022. Maryland Heights, MO United States: Morgan Kaufmann Publishers.
End-to-end multi-instance robotic reaching from monocular vision
Zhuang, Zheyu, Yu, Xin and Mahony, Robert (2021). End-to-end multi-instance robotic reaching from monocular vision. IEEE International Conference on Robotics and Automation (ICRA), Xian, China, 30 May - 5 June 2021. Washington, DC United States: IEEE Computer Society. doi: 10.1109/ICRA48506.2021.9561518
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion
Wang, Suzhen, Li, Lincheng, Ding, Yu, Fan, Changjie and Yu, Xin (2021). Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion. Thirtieth International Joint Conference on Artificial Intelligence, Montreal, Canada, 19-27 August 2021. Los Angeles, CA United States: International Joint Conferences on Artificial Intelligence Organization. doi: 10.24963/ijcai.2021/152
Auto-navigator: decoupled neural architecture search for visual navigation
Tang, Tianqi, Yu, Xin, Dong, Xuanyi and Yang, Yi (2021). Auto-navigator: decoupled neural architecture search for visual navigation. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE. doi: 10.1109/WACV48630.2021.00379
The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose
Ben-Shabat, Yizhak, Yu, Xin, Saleh, Fatemeh, Campbell, Dylan, Rodriguez-Opazo, Cristian, Li, Hongdong and Gould, Stephen (2021). The IKEA ASM dataset: understanding people assembling furniture through actions, objects and pose. IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI United States, 5-9 January 2021. Piscataway, NJ United States: IEEE Computer Society. doi: 10.1109/WACV48630.2021.00089
Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation
Ding, Yuhang, Yu, Xin and Yang, Yi (2021). Modeling the probabilistic distribution of unlabeled data for one-shot medical image segmentation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i2.16212
Write-a-speaker: Text-based emotional and rhythmic talking-head generation
Li, Lincheng, Wang, Suzhen, Zhang, Zhimeng, Ding, Yu, Zheng, Yixing, Yu, Xin and Changjie Fan (2021). Write-a-speaker: Text-based emotional and rhythmic talking-head generation. 35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence, Online, 2–9 February 2021. Palo Alto, CA United States: Association for the Advancement of Artificial Intelligence. doi: 10.1609/aaai.v35i3.16286
VTNET: visual transformer network for object goal navigation
Du, Heming, Yu, Xin and Zheng, Liang (2021). VTNET: visual transformer network for object goal navigation. 9th International Conference on Learning Representations, Virtual, 3-7 May 2021. Appleton WI USA: International Conference on Learning Representations.
A general approach to state refinement
Kennedy, Gerard, Gao, Jin, Zhuang, Zheyu, Yu, Xin and Mahony, Robert (2021). A general approach to state refinement. IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Virtual, 27 September - 1 October 2021. Piscataway, NJ, United States: IEEE. doi: 10.1109/IROS51168.2021.9636400
ARVo: learning all-range volumetric correspondence for video deblurring
Li, Dongxu, Xu, Chenchen, Zhang, Kaihao, Yu, Xin, Zhong, Yiran, Ren, Wenqi, Suominen, Hanna and Li, Hongdong (2021). ARVo: learning all-range volumetric correspondence for video deblurring. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/CVPR46437.2021.00763
DSC-PoseNet: learning 6DoF object pose estimation via dual-scale consistency
Yang, Zongxin, Yu, Xin and Yang, Yi (2021). DSC-PoseNet: learning 6DoF object pose estimation via dual-scale consistency. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/CVPR46437.2021.00390
Few-shot Weighted Style Matching for Glaucoma Detection
Liu, Jinhui and Yu, Xin (2021). Few-shot Weighted Style Matching for Glaucoma Detection. First CAAI International Conference, CICAI 2021, Hangzhou, China, 5–6 June 2021. Cham, Switzerland: Springer. doi: 10.1007/978-3-030-93046-2_25
Gait recognition via effective global-local feature representation and local temporal aggregation
Lin, Beibei, Zhang, Shunli and Yu, Xin (2021). Gait recognition via effective global-local feature representation and local temporal aggregation. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.01438
Joint 3D human shape recovery and pose estimation from a single image with bilayer graph
Yu, Xin, van Baar, Jeroen and Chen, Siheng (2021). Joint 3D human shape recovery and pose estimation from a single image with bilayer graph. 9th International Conference on 3D Vision (3DV), London, United Kingdom, 1-3 December 2021. Piscataway, NJ United States: IEEE Computer Society. doi: 10.1109/3DV53792.2021.00060
PR-RRN: pairwise-regularized residual-recursive networks for non-rigid structure-from-motion
Zeng, Haitian, Dai, Yuchao, Yu, Xin, Wang, Xiaohan and Yang, Yi (2021). PR-RRN: pairwise-regularized residual-recursive networks for non-rigid structure-from-motion. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00555
PSTNET: POINT SPATIO-TEMPORAL CONVOLUTION ON POINT CLOUD SEQUENCES
Fan, Hehe, Yu, Xin, Ding, Yuhang, Yang, Yi and Kankanhalli, Mohan (2021). PSTNET: POINT SPATIO-TEMPORAL CONVOLUTION ON POINT CLOUD SEQUENCES. International Conference on Learning Representations, ICLR.
RFNet: region-aware fusion network for incomplete multi-modal brain tumor segmentation
Ding, Yuhang, Yu, Xin and Yang, Yi (2021). RFNet: region-aware fusion network for incomplete multi-modal brain tumor segmentation. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00394
RGB-D saliency detection via cascaded mutual information minimization
Zhang, Jing, Fan, Deng-Ping, Dai, Yuchao, Yu, Xin, Zhong, Yiran, Barnes, Nick and Shao, Ling (2021). RGB-D saliency detection via cascaded mutual information minimization. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00430
Removing raindrops and rain streaks in one go
Quan, Ruijie, Yu, Xin, Liang, Yuanzhi and Yang, Yi (2021). Removing raindrops and rain streaks in one go. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: IEEE COMPUTER SOC. doi: 10.1109/CVPR46437.2021.00903
Self-supervised visibility learning for novel view synthesis
Shi, Yujiao, Li, Hongdong and Yu, Xin (2021). Self-supervised visibility learning for novel view synthesis. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021. Washington, DC, United States: I E E E Computer Society. doi: 10.1109/CVPR46437.2021.00955
Super-resolving cross-domain face miniatures by peeking at one-shot exemplar
Li, Peike, Yu, Xin and Yang, Yi (2021). Super-resolving cross-domain face miniatures by peeking at one-shot exemplar. 18th IEEE/CVF International Conference on Computer Vision (ICCV), Virtual, 11-17 October 2021. New York, NY, United States: IEEE. doi: 10.1109/ICCV48922.2021.00443
TSPNet: hierarchical feature learning via temporal semantic pyramid for sign language translation
Li, Dongxu, Xu, Chenchen, Yu, Xin, Zhang, Kaihao, Swift, Ben, Suominen, Hanna and Li, Hongdong (2020). TSPNet: hierarchical feature learning via temporal semantic pyramid for sign language translation. 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada, 6-12 December 2020. Maryland Heights, MO USA: Morgan Kaufmann Publishers.
6DoF object pose estimation via differentiable proxy voting regularizer
Yu, Xin, Zhuang, Zheyu, Koniusz, Piotr and Li, Hongdong (2020). 6DoF object pose estimation via differentiable proxy voting regularizer. 31st British Machine Vision Conference, Virtual, 7-10 September 2020. Bath, United Kingdom: British Machine Vision Association, BMVA.
Copy and Paste GAN: Face Hallucination From Shaded Thumbnails
Zhang, Yang, Tsang, Ivor W., Luo, Yawei, Hu, Chang-Hui, Lu, Xiaobo and Yu, Xin (2020). Copy and Paste GAN: Face Hallucination From Shaded Thumbnails. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA United States, 13-19 June 2020. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr42600.2020.00738
Weakly-Supervised Salient Object Detection via Scribble Annotations
Zhang, Jing, Yu, Xin, Li, Aixuan, Song, Peipei, Liu, Bowen and Dai, Yuchao (2020). Weakly-Supervised Salient Object Detection via Scribble Annotations. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA United States, 13-19 June 2020. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/cvpr42600.2020.01256
Going Beyond Real Data: A Robust Visual Representation for Vehicle Re-identification
Zheng, Zhedong, Jiang, Minyue, Wang, Zhigang, Wang, Jian, Bai, Zechen, Zhang, Xuanmeng, Yu, Xin, Tan, Xiao, Yang, Yi, Wen, Shilei and Ding, Errui (2020). Going Beyond Real Data: A Robust Visual Representation for Vehicle Re-identification. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Electr Network, Jun 14-19, 2020. LOS ALAMITOS: IEEE COMPUTER SOC. doi: 10.1109/CVPRW50498.2020.00307
LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision
Zhuang, Zheyu, Yu, Xin and Mahony, Robert (2020). LyRN (Lyapunov Reaching Network): A Real-Time Closed Loop approach from Monocular Vision. 2020 IEEE International Conference on Robotics and Automation (ICRA), Online, 31 May - 15 June 2020. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/icra40945.2020.9196781
Optimal Feature Transport for Cross-View Image Geo-Localization
Shi, Yujiao, Yu, Xin, Liu, Liu, Zhang, Tong and Li, Hongdong (2020). Optimal Feature Transport for Cross-View Image Geo-Localization. 34th AAAI Conference on Artificial Intelligence / 32nd Innovative Applications of Artificial Intelligence Conference / 10th AAAI Symposium on Educational Advances in Artificial Intelligence, New York Ny, Feb 07-12, 2020. PALO ALTO: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE.
Transferring Cross-domain Knowledge for Video Sign Language Recognition
Li, Dongxu, Yu, Xin, Xu, Chenchen, Petersson, Lars and Li, Hongdong (2020). Transferring Cross-domain Knowledge for Video Sign Language Recognition. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Electr Network, Jun 14-19, 2020. NEW YORK: IEEE. doi: 10.1109/CVPR42600.2020.00624
When Humans Meet Machines: Towards Efficient Segmentation Networks
Li, Peike, Dong, Xuanyi, Yu, Xin and Yang, Yi (2020). When Humans Meet Machines: Towards Efficient Segmentation Networks. British Machine Vision Association, BMVA.
Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching
Shi, Yujiao, Yu, Xin, Campbell, Dylan and Li, Hongdong (2020). Where am I looking at? Joint Location and Orientation Estimation by Cross-View Matching. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Online, 14-19 June 2020. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/CVPR42600.2020.00412
Li, Dongxu, Opazo, Cristian Rodriguez, Yu, Xin and Li, Hongdong (2020). Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison. IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Snowmass Co, Mar 01-05, 2020. LOS ALAMITOS: IEEE COMPUTER SOC.
Recovering Faces from Portraits with Auxiliary Facial Attributes
Shiri, Fatemeh, Yu, Xin, Porikli, Fatih, Hartley, Richard and Koniusz, Piotr (2019). Recovering Faces from Portraits with Auxiliary Facial Attributes. 19th IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa Village, HI United States, 7-11 January 2019. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/WACV.2019.00049
Bringing a Blurry Frame Alive at High Frame-Rate with an Event Camera
Pan, Liyuan, Scheerlinck, Cedric, Yu, Xin, Hartley, Richard, Liu, Miaomiao and Dai, Yuchao (2019). Bringing a Blurry Frame Alive at High Frame-Rate with an Event Camera. 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA United States, 16-20 June 2019. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/CVPR.2019.00698
SOSNet: Second Order Similarity Regularization for Local Descriptor Learning
Tian, Yurun, Yu, Xin, Fan, Bin, Wu, Fuchao, Heijnen, Huub and Balntas, Vassileios (2019). SOSNet: Second Order Similarity Regularization for Local Descriptor Learning. 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach Ca, Jun 16-20, 2019. NEW YORK: IEEE. doi: 10.1109/CVPR.2019.01127
Spatial-Aware Feature Aggregation for Cross-View Image based Geo-Localization
Shi, Yujiao, Liu, Liu, Yu, Xin and Li, Hongdong (2019). Spatial-Aware Feature Aggregation for Cross-View Image based Geo-Localization. 33rd Conference on Neural Information Processing Systems (NeurIPS), Vancouver Canada, Dec 08-14, 2019. LA JOLLA: NEURAL INFORMATION PROCESSING SYSTEMS (NIPS).
Unsupervised Extraction of Local Image Descriptors via Relative Distance Ranking Loss
Yu, Xin, Tian, Yurun, Porikli, Fatih, Hartley, Richard, Li, Hongdong, Heijnen, Huub and Balntas, Vassileios (2019). Unsupervised Extraction of Local Image Descriptors via Relative Distance Ranking Loss. IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea, 27 October - 2 November 2019. Piscataway, NJ United States: Institute of Electrical and Electronics Engineers. doi: 10.1109/ICCVW.2019.00351
Face Super-Resolution Guided by Facial Component Heatmaps
Yu, Xin, Fernando, Basura, Ghanem, Bernard, Porikli, Fatih and Hartley, Richard (2018). Face Super-Resolution Guided by Facial Component Heatmaps. 15th European Conference on Computer Vision (ECCV), Munich Germany, Sep 08-14, 2018. CHAM: SPRINGER INTERNATIONAL PUBLISHING AG. doi: 10.1007/978-3-030-01240-3_14
Identity-preserving Face Recovery from Portraits
Shiri, Fatemeh, Yu, Xin, Porikli, Fatih, Hartley, Richard and Koniusz, Piotr (2018). Identity-preserving Face Recovery from Portraits. 18th IEEE Winter Conference on Applications of Computer Vision (WACV), Nv, Mar 12-15, 2018. NEW YORK: IEEE. doi: 10.1109/WACV.2018.00018
Learning Strict Identity Mappings in Deep Residual Networks
Yu, Xin, Yu, Zhiding and Ramalingam, Srikumar (2018). Learning Strict Identity Mappings in Deep Residual Networks. 31st IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City Ut, Jun 18-23, 2018. NEW YORK: IEEE. doi: 10.1109/CVPR.2018.00466
Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes
Yu, Xin, Fernando, Basura, Hartley, Richard and Porikli, Fatih (2018). Super-Resolving Very Low-Resolution Face Images with Supplementary Attributes. 31st IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City Ut, Jun 18-23, 2018. NEW YORK: IEEE. doi: 10.1109/CVPR.2018.00101
Shiri, Fatemeh, Yu, Xin, Koniusz, Piotr and Porikli, Fatih (2017). Face Destylization. International Conference on Digital Image Computing - Techniques and Applications (DICTA), Sydney Australia, Nov 29-Dec 01, 2017. NEW YORK: IEEE.
Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks
Yu, Xin and Porikli, Fatih (2017). Face Hallucination with Tiny Unaligned Images by Transformative Discriminative Neural Networks. 31st AAAI Conference on Artificial Intelligence, San Francisco Ca, Feb 04-09, 2017. PALO ALTO: ASSOC ADVANCEMENT ARTIFICIAL INTELLIGENCE.
Yu, Xin and Porikli, Fatih (2017). Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders. 30th IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu Hi, Jul 21-26, 2017. NEW YORK: IEEE. doi: 10.1109/CVPR.2017.570
Ultra-Resolving Face Images by Discriminative Generative Networks
Yu, Xin and Porikli, Fatih (2016). Ultra-Resolving Face Images by Discriminative Generative Networks. 14th European Conference on Computer Vision (ECCV), Amsterdam, Netherlands, 8-16 October 2016. Heidelberg, Germany: Springer. doi: 10.1007/978-3-319-46454-1_20
(2024) Google Inc
(2023–2028) Google Asia Pacific Pte Ltd
Analytics for the Australian Grains Industry (AAGI)
(2023–2027) Grains Research & Development Corporation
Advancing Human Perception: Countering Evolving Malicious Fake Visual Data
(2023–2026) ARC Discovery Early Career Researcher Award
(2023–2024) Griffith University
Two way Auslan Translation
Doctor Philosophy — Principal Advisor
Other advisors:
The prediction, diagnosis, and severity estimation models for plant disease
Doctor Philosophy — Principal Advisor
Other advisors:
Digital Asset IP Protection
Doctor Philosophy — Principal Advisor
Other advisors:
Enhancing Building Fire Safety by Utilising Machine Learning Techniques
Doctor Philosophy — Principal Advisor
Other advisors:
Towards Efficient Pest Detection in Agriculture
Doctor Philosophy — Principal Advisor
Other advisors:
Advancing Human Perception: Countering Evolving Malicious Fake Visual Data
Doctor Philosophy — Principal Advisor
Other advisors:
Combating evolving deceptive fake visual information through deepfake detection
Doctor Philosophy — Principal Advisor
Other advisors:
Two way Auslan Translation
Doctor Philosophy — Principal Advisor
Other advisors:
Data driven approaches for smart farming
Doctor Philosophy — Associate Advisor
Other advisors:
Remote Sensing Analysis in computer vision
Doctor Philosophy — Associate Advisor
Other advisors: