Turning a clip model into a scene text detector W Yu, Y Liu, W Hua, D Jiang, B Ren, X Bai Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 71 | 2023 |
Perceiving stroke-semantic context: Hierarchical contrastive learning for robust scene text recognition H Liu, B Wang, Z Bao, M Xue, S Kang, D Jiang, Y Liu, B Ren Proceedings of the AAAI Conference on Artificial Intelligence 36 (2), 1702-1710, 2022 | 50 | 2022 |
Neural collaborative graph machines for table structure recognition H Liu, X Li, B Liu, D Jiang, Y Liu, B Ren Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 40 | 2022 |
Visual information extraction in the wild: practical dataset and end-to-end solution J Kuang, W Hua, D Liang, M Yang, D Jiang, B Ren, X Bai International Conference on Document Analysis and Recognition, 36-53, 2023 | 37 | 2023 |
Show, read and reason: Table structure recognition with flexible context aggregator H Liu, X Li, B Liu, D Jiang, Y Liu, B Ren, R Ji Proceedings of the 29th ACM International Conference on Multimedia, 1084-1092, 2021 | 36 | 2021 |
The devil is in the frequency: Geminated gestalt autoencoder for self-supervised visual pre-training H Liu, X Jiang, X Li, A Guo, Y Hu, D Jiang, B Ren Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1649-1656, 2023 | 30 | 2023 |
Hierarchical multi-label text classification with horizontal and vertical category correlations L Xu, S Teng, R Zhao, J Guo, C Xiao, D Jiang, B Ren Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 27 | 2021 |
Sequence-to-action: Grammatical error correction with action guided sequence generation J Li, J Guo, Y Zhu, X Sheng, D Jiang, B Ren, L Xu Proceedings of the AAAI Conference on Artificial Intelligence 36 (10), 10974 …, 2022 | 24 | 2022 |
Hrvda: High-resolution visual document assistant C Liu, K Yin, H Cao, X Jiang, X Li, Y Liu, D Jiang, X Sun, L Xu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 16 | 2024 |
Nommer: Nominate synergistic context in vision transformer for visual recognition H Liu, X Jiang, X Li, Z Bao, D Jiang, B Ren Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 16 | 2022 |
Attention where it matters: Rethinking visual document understanding with selective region concentration H Cao, C Bao, C Liu, H Chen, K Yin, H Liu, Y Liu, D Jiang, X Sun Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 13 | 2023 |
Query-driven generative network for document information extraction in the wild H Cao, X Li, J Ma, D Jiang, A Guo, Y Hu, H Liu, Y Liu, B Ren Proceedings of the 30th ACM International Conference on Multimedia, 4261-4271, 2022 | 13 | 2022 |
GMN: generative multi-modal network for practical document information extraction H Cao, J Ma, A Guo, Y Hu, H Liu, D Jiang, Y Liu, B Ren arXiv preprint arXiv:2207.04713, 2022 | 12 | 2022 |
Enhancing visual document understanding with contrastive learning in large visual-language models X Li, Y Wu, X Jiang, Z Guo, M Gong, H Cao, Y Liu, D Jiang, X Sun Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 10 | 2024 |
Puzzlenet: scene text detection by segment context graph learning H Liu, A Guo, D Jiang, Y Hu, B Ren arXiv preprint arXiv:2002.11371, 2020 | 10 | 2020 |
Os-msl: One stage multimodal sequential link framework for scene segmentation and classification Y Liu, L Qiao, D Yin, Z Jiang, X Jiang, D Jiang, B Ren Proceedings of the 30th ACM International Conference on Multimedia, 6269-6277, 2022 | 8 | 2022 |
Relational representation learning in visually-rich documents X Li, Y Zheng, Y Hu, H Cao, Y Wu, D Jiang, Y Liu, B Ren Proceedings of the 30th ACM International Conference on Multimedia, 4614-4624, 2022 | 7 | 2022 |
Accurate structured-text spotting for arithmetical exercise correction Y Hu, Y Zheng, H Liu, D Jiang, Y Liu, B Ren Proceedings of the AAAI conference on artificial intelligence 34 (01), 686-693, 2020 | 7 | 2020 |
Human head detection method, eletronic device and storage medium D Jiang US Patent App. 16/351,093, 2019 | 7 | 2019 |
Grab What You Need: Rethinking Complex Table Structure Recognition with Flexible Components Deliberation H Liu, X Li, M Gong, B Liu, Y Wu, D Jiang, Y Liu, X Sun Proceedings of the AAAI Conference on Artificial Intelligence 38 (4), 3603-3611, 2024 | 6 | 2024 |