Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis W Feng, X He, TJ Fu, V Jampani, A Akula, P Narayana, S Basu, XE Wang, ... ICLR 2023, 2023 | 152 | 2023 |
Layoutgpt: Compositional visual planning and generation with large language models W Feng, W Zhu, T Fu, V Jampani, A Akula, X He, S Basu, XE Wang, ... Advances in Neural Information Processing Systems 36, 2024 | 58 | 2024 |
Neuro-Symbolic Procedural Planning with Commonsense Prompting Y Lu, W Feng, W Zhu, W Xu, XE Wang, M Eckstein, WY Wang ICLR 2023, 2023 | 26* | 2023 |
Velma: Verbalization embodiment of llm agents for vision and language navigation in street view R Schumann, W Zhu, W Feng, TJ Fu, S Riezler, WY Wang Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18924 …, 2024 | 18 | 2024 |
CPL: Counterfactual Prompt Learning for Vision and Language Models X He, D Yang, W Feng, TJ Fu, A Akula, V Jampani, P Narayana, S Basu, ... EMNLP 2022, 2022 | 18 | 2022 |
Discriminative diffusion models as few-shot vision and language learners X He, W Feng, TJ Fu, V Jampani, A Akula, P Narayana, S Basu, WY Wang, ... arXiv preprint arXiv:2305.10722, 2023 | 2 | 2023 |
ULN: Towards Underspecified Vision-and-Language Navigation W Feng, TJ Fu, Y Lu, WY Wang EMNLP 2022, 2022 | 2 | 2022 |
EDIS: Entity-Driven Image Search over Multimodal Web Content S Liu, W Feng, W Chen, WY Wang arXiv preprint arXiv:2305.13631, 2023 | 1 | 2023 |
Reward Guided Latent Consistency Distillation J Li, W Feng, W Chen, WY Wang arXiv preprint arXiv:2403.11027, 2024 | | 2024 |
Anticipating the Unseen Discrepancy for Vision and Language Navigation Y Lu, H Zhang, P Nie, W Feng, W Xu, XE Wang, WY Wang arXiv preprint arXiv:2209.04725, 2022 | | 2022 |