Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation Y Zhu, Y Wu, K Olszewski, J Ren, S Tulyakov, Y Yan
International Conference on Learning Representations (ICLR), 2023
46 2023 Quantized GAN for Complex Music Generation from Dance Videos Y Zhu, K Olszewski, Y Wu, P Achlioptas, M Chai, Y Yan, S Tulyakov
European Conference on Computer Vision (ECCV), 2022
24 2022 Learning Audio-Visual Correlations from Variational Cross-Modal Generation Y Zhu, Y Wu, H Latapie, Y Yang, Y Yan
The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2021
21 2021 Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition X Zhu, Y Zhu, H Wang, H Wen, Y Yan, P Liu
ACM Transactions on Multimedia Computing, Communications, and Applications …, 2022
20 2022 Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents Y Zhu, Y Wu, Y Yang, Y Yan
European Conference on Computer Vision (ECCV), 2020
11 2020 Hierarchical HMM for Eye Movement Classification Y Zhu, Y Yan, O Komogortsev
European Conference on Computer Vision Workshop (ECCV Workshop), 2020
10 2020 Saying the Unseen: Video Descriptions via Dialog Agents Y Zhu, Y Wu, Y Yang, Y Yan
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021
8 2021 Boundary Guided Learning-Free Semantic Control with Diffusion Models Y Zhu, Y Wu, Z Deng, O Russakovsky, Y Yan
Conference on Neural Information Processing Systems (NeurIPS), 2023
7 2023 Vision+ X: A Survey on Multimodal Learning in the Light of Data Y Zhu, Y Wu, N Sebe, Y Yan
arXiv preprint arXiv:2210.02884, 2022
5 2022 Denoising Diffusion Probabilistic Models to Predict the Density of Molecular Clouds D Xu, J Tan, CJ Hsu, Y Zhu
The Astrophysical Journal (APJ), 2023
4 2023 Multiview based 3D scene understanding on partial point sets Y Zhu, SE Shepstone, P Martínez-Nuevo, MS Kristoffersen, F Moutarde, ...
arXiv preprint arXiv:1812.01712, 2018
4 2018 Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation Y Yang, R Wang, Z Qian, Y Zhu, Y Wu
International Conference on Learning Representations (ICLR), 2024
3 2024 Discrete Diffusion Reward Guidance Methods for Offline Reinforcement Learning M Coleman, O Russakovsky, C Allen-Blanchette, Y Zhu
International Conference on Machine Learning Workshop (ICML Workshop), 2023
2 2023 Supplementing Missing Visions via Dialog for Scene Graph Generations Y Zhu, X Zhu, Y Shang, Z Zhao, Y Yan
The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2024
1 2024 D : Scaling Up Deepfake Detection by Learning from Discrepancy Y Yang, Z Qian, Y Zhu, Y Wu
arXiv preprint arXiv:2404.04584, 2024
2024 Mining and Unifying Heterogeneous Contrastive Relations for Weakly-Supervised Actor-Action Segmentation B Duan, H Tang, C Sun, Y Zhu, Y Yan
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
2024 DETER: Detecting Edited Regions for Deterring Generative Manipulations S Wang, Y Zhu, R Wang, A Dharmasiri, O Russakovsky, Y Wu
arXiv preprint arXiv:2312.10539, 2023
2023 Unseen Image Synthesis with Diffusion Models Y Zhu, Y Wu, Z Deng, O Russakovsky, Y Yan
arXiv preprint arXiv:2310.09213, 2023
2023 Multimodal Learning and Generation Toward a Multisensory and Creative AI System Y Zhu
Illinois Institute of Technology, 2023
2023 Denoising Diffusion Probabilistic Models to Predict the Number Density of Molecular Clouds in Astronomy D Xu, J Tan, CJ Hsu, Y Zhu
ICLR 2023 Workshop on Physics for Machine Learning (ICLR Workshop), 2023
2023