Παρακολούθηση
Dhruv Batra
Dhruv Batra
Georgia Tech and Facebook AI Research
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα gatech.edu - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Grad-cam: Visual explanations from deep networks via gradient-based localization
RR Selvaraju, M Cogswell, A Das, R Vedantam, D Parikh, D Batra
Proceedings of the IEEE international conference on computer vision, 618-626, 2017
114192017
VQA: Visual Question Answering
S Antol, A Agrawal, J Lu, M Mitchell, D Batra, C Lawrence Zitnick, ...
Proceedings of the IEEE International Conference on Computer Vision, 2425-2433, 2015
40792015
Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks
J Lu, D Batra, D Parikh, S Lee
Advances in neural information processing systems 32, 2019
17852019
Making the v in vqa matter: Elevating the role of image understanding in visual question answering
Y Goyal, T Khot, D Summers-Stay, D Batra, D Parikh
Proceedings of the IEEE conference on computer vision and pattern …, 2017
15642017
Hierarchical question-image co-attention for visual question answering
J Lu, J Yang, D Batra, D Parikh
Advances in neural information processing systems 29, 2016
15332016
Visual dialog
A Das, S Kottur, K Gupta, A Singh, D Yadav, JMF Moura, D Parikh, ...
Proceedings of the IEEE conference on computer vision and pattern …, 2017
8592017
Joint unsupervised learning of deep representations and image clusters
J Yang, D Parikh, D Batra
Proceedings of the IEEE conference on computer vision and pattern …, 2016
7192016
Habitat: A platform for embodied ai research
M Savva, A Kadian, O Maksymets, Y Zhao, E Wijmans, B Jain, J Straub, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
6862019
Graph r-cnn for scene graph generation
J Yang, J Lu, S Lee, D Batra, D Parikh
Proceedings of the European conference on computer vision (ECCV), 670-685, 2018
6422018
icoseg: Interactive co-segmentation with intelligent scribble guidance
D Batra, A Kowdle, D Parikh, J Luo, T Chen
2010 IEEE computer society conference on computer vision and pattern …, 2010
5782010
Embodied question answering
A Das, S Datta, G Gkioxari, S Lee, D Parikh, D Batra
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
4902018
A corpus and cloze evaluation for deeper understanding of commonsense stories
N Mostafazadeh, N Chambers, X He, D Parikh, D Batra, L Vanderwende, ...
Proceedings of the 2016 Conference of the North American Chapter of the …, 2016
4752016
Don't just assume; look and answer: Overcoming priors for visual question answering
A Agrawal, D Batra, D Parikh, A Kembhavi
Proceedings of the IEEE conference on computer vision and pattern …, 2018
4452018
Human attention in visual question answering: Do humans and deep networks look at the same regions?
A Das, H Agrawal, L Zitnick, D Parikh, D Batra
Computer Vision and Image Understanding 163, 90-100, 2017
4252017
Neural baby talk
J Lu, J Yang, D Batra, D Parikh
Proceedings of the IEEE conference on computer vision and pattern …, 2018
4222018
A comparative study of modern inference techniques for structured discrete energy minimization problems
JH Kappes, B Andres, FA Hamprecht, C Schnörr, S Nowozin, D Batra, ...
International Journal of Computer Vision 115 (2), 155-184, 2015
409*2015
Learning cooperative visual dialog agents with deep reinforcement learning
A Das, S Kottur, JMF Moura, S Lee, D Batra
Proceedings of the IEEE international conference on computer vision, 2951-2960, 2017
4022017
Reducing overfitting in deep networks by decorrelating representations
M Cogswell, F Ahmed, R Girshick, L Zitnick, D Batra
arXiv preprint arXiv:1511.06068, 2015
3732015
Grad-CAM: Why did you say that?
RR Selvaraju, A Das, R Vedantam, M Cogswell, D Parikh, D Batra
arXiv preprint arXiv:1611.07450, 2016
3592016
Visual Storytelling
THK Huang, F Ferraro, N Mostafazadeh, I Misra, A Agrawal, J Devlin, ...
NAACL, 2016
3422016
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20