Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 1605 | 2023 |
Palm 2 technical report R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ... arXiv preprint arXiv:2305.10403, 2023 | 1257 | 2023 |
Detectors: Detecting objects with recursive feature pyramid and switchable atrous convolution S Qiao, LC Chen, A Yuille Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 925 | 2021 |
Few-shot image recognition by predicting parameters from activations S Qiao, C Liu, W Shen, AL Yuille Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 660 | 2018 |
Deep Co-Training for Semi-Supervised Image Recognition S Qiao, W Shen, Z Zhang, B Wang, A Yuille The European Conference on Computer Vision (ECCV), 2018 | 548 | 2018 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 411 | 2024 |
Micro-batch training with batch-channel normalization and weight standardization S Qiao, H Wang, C Liu, W Shen, A Yuille arXiv preprint arXiv:1903.10520, 2019 | 325 | 2019 |
Single-shot object detection with enriched semantics Z Zhang, S Qiao, C Xie, W Shen, B Wang, AL Yuille Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 255 | 2018 |
Unrealcv: Virtual worlds for computer vision W Qiu, F Zhong, Y Zhang, S Qiao, Z Xiao, TS Kim, Y Wang Proceedings of the 25th ACM international conference on Multimedia, 1221-1224, 2017 | 253 | 2017 |
Training Deep Neural Networks in Generations: A More Tolerant Teacher Educates Better Students C Yang, L Xie, S Qiao, A Yuille Proceedings of the AAAI Conference on Artificial Intelligence, 2019 | 233* | 2019 |
Vip-deeplab: Learning visual perception with depth-aware video panoptic segmentation S Qiao, Y Zhu, H Adam, A Yuille, LC Chen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 156 | 2021 |
Cmt-deeplab: Clustering mask transformers for panoptic segmentation Q Yu, H Wang, D Kim, S Qiao, M Collins, Y Zhu, H Adam, A Yuille, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 93 | 2022 |
Sort: Second-order response transform for visual recognition Y Wang, L Xie, C Liu, S Qiao, Y Zhang, W Zhang, Q Tian, A Yuille Proceedings of the IEEE International Conference on Computer Vision, 1359-1368, 2017 | 66 | 2017 |
Robust face detection via learning small faces on hard images Z Zhang, W Shen, S Qiao, Y Wang, B Wang, A Yuille Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2020 | 59 | 2020 |
Scalenet: Guiding object proposal generation in supermarkets and beyond S Qiao, W Shen, W Qiu, C Liu, A Yuille Proceedings of the IEEE International Conference on Computer Vision, 1791-1800, 2017 | 59 | 2017 |
Deeplab2: A tensorflow library for deep labeling M Weber, H Wang, S Qiao, J Xie, MD Collins, Y Zhu, L Yuan, D Kim, Q Yu, ... arXiv preprint arXiv:2106.09748, 2021 | 55 | 2021 |
Scaling wide residual networks for panoptic segmentation LC Chen, H Wang, S Qiao arXiv preprint arXiv:2011.11675, 2020 | 55 | 2020 |
Moat: Alternating mobile convolution and attention brings strong vision models C Yang, S Qiao, Q Yu, X Yuan, Y Zhu, A Yuille, H Adam, LC Chen The Eleventh International Conference on Learning Representations, 2022 | 54 | 2022 |
Tubeformer-deeplab: Video mask transformer D Kim, J Xie, H Wang, S Qiao, Q Yu, HS Kim, H Adam, IS Kweon, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 46 | 2022 |
Waymo open dataset: Panoramic video panoptic segmentation J Mei, AZ Zhu, X Yan, H Yan, S Qiao, LC Chen, H Kretzschmar European Conference on Computer Vision, 53-72, 2022 | 45 | 2022 |