Follow
Sicong Leng
Sicong Leng
Nanyang Technological University & Alibaba DAMO Academy
Verified email at e.ntu.edu.sg - Homepage
Title
Cited by
Cited by
Year
Interventional video grounding with dual contrastive learning
G Nan, R Qiao, Y Xiao, J Liu, S Leng, H Zhang, W Lu
CVPR 2021, 2021
1652021
Mitigating object hallucinations in large vision-language models through visual contrastive decoding
S Leng, H Zhang, G Chen, X Li, S Lu, C Miao, L Bing
CVPR 2024, 2024
1102024
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Z Cheng, S Leng, H Zhang, Y Xin, X Li, G Chen, Y Zhu, W Zhang, Z Luo, ...
arXiv preprint arXiv:2406.07476, 2024
662024
Speaker-oriented latent structures for dialogue-based relation extraction
G Nan, G Luo, S Leng, Y Xiao, W Lu
arXiv preprint arXiv:2109.05182, 2021
92021
Uncovering What, Why and How: A Comprehensive Benchmark for Causation Understanding of Video Anomaly
XT Hang Du , Sicheng Zhang , Binzhu Xie, Guoshun Nan, Jiayang Zhang, Junrui ...
CVPR 2024, 2024
8*2024
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
W An, F Tian, S Leng, J Nie, H Lin, QY Wang, G Dai, P Chen, S Lu
arXiv preprint arXiv:2406.12718, 2024
62024
Tell2Design: A Dataset for Language-Guided Floor Plan Generation
S Leng, Y Zhou, MH Dupty, WS Lee, SC Joyce, W Lu
ACL 2023, Area Chair Award, 2023
62023
Videollama 2: Advancing spatial-temporal modeling and audio understanding in video-llms (2024)
Z Cheng, S Leng, H Zhang, Y Xin, X Li, G Chen, Y Zhu, W Zhang, Z Luo, ...
URL https://arxiv. org/abs/2406.07476 9, 0
5
Constrained Layout Generation with Factor Graphs
MH Dupty, Y Dong, S Leng, G Fu, YL Goh, W Lu, WS Lee
CVPR 2024, 2024
32024
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
S Leng, Y Xing, Z Cheng, Y Zhou, H Zhang, X Li, D Zhao, S Lu, C Miao, ...
arXiv preprint arXiv:2410.12787, 2024
12024
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
Z Cheng, H Zhang, K Li, S Leng, Z Hu, F Wu, D Zhao, X Li, L Bing
arXiv preprint arXiv:2410.17243, 2024
2024
BenchX: A Unified Benchmark Framework for Medical Vision-Language Pretraining on Chest X-Rays
Y Zhou, T Faith, Y Xu, S Leng, X Xu, Y Liu, RSM Goh
NeurIPS 2024, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–12