Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards A Rame, G Couairon, M Shukor, C Dancette, JB Gaya, L Soulier, M Cord NeurIPS 2023, 2023 | 26 | 2023 |
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks M Shukor, C Dancette, A Rame, M Cord Transactions on Machine Learning Research (TMLR), 2023, 2023 | 15* | 2023 |
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment M Shukor, G Couairon, M Cord BMVC 2022, 2022 | 15 | 2022 |
Synthetic training data generation for deep learning based quality inspection P Gutierrez, M Luschkova, A Cordier, M Shukor, M Schappert, T Dahmen International Conference on Quality Control by Artificial Vision (QCAV 2021 …, 2021 | 15 | 2021 |
eP-ALM: Efficient Perceptual Augmentation of Language Models M Shukor, C Dancette, M Cord ICCV 2023, 2023 | 12 | 2023 |
Transformer decoders with multimodal regularization for cross-modal food retrieval M Shukor, G Couairon, A Grechka, M Cord Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 12 | 2022 |
Semantic unfolding of stylegan latent space M Shukor, X Yao, BB Damodaran, P Hellier 2022 IEEE International Conference on Image Processing (ICIP), 221-225, 2022 | 10* | 2022 |
Beyond task performance: Evaluating and reducing the flaws of large multimodal models with in-context learning M Shukor, A Rame, C Dancette, M Cord ICLR 2024, 2023 | 8 | 2023 |
Vision and Structured-Language Pretraining for Cross-Modal Food Retrieval M Shukor, N Thome, M Cord arXiv preprint arXiv:2212.04267, 2022 | 6* | 2022 |
Video coding using learned latent gan compression M Shukor, BB Damodaran, X Yao, P Hellier Proceedings of the 30th ACM International Conference on Multimedia, 2239-2248, 2022 | 5 | 2022 |
Improved baselines for data-efficient perceptual augmentation of llms T Vallaeys, M Shukor, M Cord, J Verbeek arXiv preprint arXiv:2403.13499, 2024 | 2 | 2024 |
What Makes Multimodal In-Context Learning Work? FB Baldassini, M Shukor, M Cord, L Soulier, B Piwowarski arXiv preprint arXiv:2404.15736, 2024 | | 2024 |
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models BT Corradini, M Shukor, P Couairon, G Couairon, F Scarselli, M Cord arXiv preprint arXiv:2403.20105, 2024 | | 2024 |
Supplementary material for eP-ALM: Efficient Perceptual Augmentation of Language Models M Shukor, C Dancette, M Cord | | |