Overcoming catastrophic forgetting in neural networks J Kirkpatrick, R Pascanu, N Rabinowitz, J Veness, G Desjardins, AA Rusu, ... Proceedings of the national academy of sciences 114 (13), 3521-3526, 2017 | 4677 | 2017 |

Progressive neural networks AA Rusu, NC Rabinowitz, G Desjardins, H Soyer, J Kirkpatrick, ... arXiv preprint arXiv:1606.04671, 2016 | 2090 | 2016 |

Theano: a CPU and GPU math expression compiler J Bergstra, O Breuleux, F Bastien, P Lamblin, R Pascanu, G Desjardins, ... Proceedings of the Python for scientific computing conference (SciPy) 4 (3), 1-7, 2010 | 1982 | 2010 |

Understanding disentangling in -VAE CP Burgess, I Higgins, A Pal, L Matthey, N Watters, G Desjardins, ... arXiv preprint arXiv:1804.03599, 2018 | 909 | 2018 |

Theano: A Python framework for fast computation of mathematical expressions R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, N Ballas, ... arXiv e-prints, arXiv: 1605.02688, 2016 | 883 | 2016 |

Theano: A CPU and GPU math compiler in Python J Bergstra, O Breuleux, F Bastien, P Lamblin, R Pascanu, G Desjardins, ... Proc. 9th python in science conf 1, 3-10, 2010 | 808 | 2010 |

Policy distillation AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ... arXiv preprint arXiv:1511.06295, 2015 | 596 | 2015 |

Combining modality specific deep neural networks for emotion recognition in video SE Kahou, C Pal, X Bouthillier, P Froumenty, Ç Gülçehre, R Memisevic, ... Proceedings of the 15th ACM on International conference on multimodal …, 2013 | 396 | 2013 |

Theano: Deep learning on gpus with python J Bergstra, F Bastien, O Breuleux, P Lamblin, R Pascanu, O Delalleau, ... NIPS 2011, BigLearning Workshop, Granada, Spain 3 (0), 2011 | 343 | 2011 |

Unsupervised and transfer learning challenge: a deep learning approach G Mesnil, Y Dauphin, X Glorot, S Rifai, Y Bengio, I Goodfellow, E Lavoie, ... Proceedings of ICML Workshop on Unsupervised and Transfer Learning, 97-110, 2012 | 264 | 2012 |

Natural neural networks G Desjardins, K Simonyan, R Pascanu Advances in neural information processing systems 28, 2015 | 201 | 2015 |

Theano: A Python framework for fast computation of mathematical expressions TTD Team, R Al-Rfou, G Alain, A Almahairi, C Angermueller, D Bahdanau, ... arXiv preprint arXiv:1605.02688, 2016 | 193 | 2016 |

Tempered Markov chain Monte Carlo for training of restricted Boltzmann machines G Desjardins, A Courville, Y Bengio, P Vincent, O Delalleau Proceedings of the thirteenth international conference on artificial …, 2010 | 141 | 2010 |

Parallel tempering for training of restricted Boltzmann machines G Desjardins, A Courville, Y Bengio, P Vincent, O Delalleau Proceedings of the thirteenth international conference on artificial …, 2010 | 111 | 2010 |

Disentangling factors of variation via generative entangling G Desjardins, A Courville, Y Bengio arXiv preprint arXiv:1210.5474, 2012 | 103 | 2012 |

Steerable Playlist Generation by Learning Song Similarity from Radio Station Playlists. F Maillet, D Eck, G Desjardins, P Lamere ISMIR, 345-350, 2009 | 94 | 2009 |

Information asymmetry in KL-regularized RL A Galashov, SM Jayakumar, L Hasenclever, D Tirumala, J Schwarz, ... arXiv preprint arXiv:1905.01240, 2019 | 87 | 2019 |

Quadratic polynomials learn better image features J Bergstra, G Desjardins, P Lamblin, Y Bengio Technical report, 1337, 2009 | 85 | 2009 |

Empirical evaluation of convolutional RBMs for vision G Desjardins, Y Bengio Technical Report 1327, Département d¢Informatique et de Recherche …, 2008 | 72 | 2008 |

Progressive neural networks. arXiv 2016 AA Rusu, NC Rabinowitz, G Desjardins, H Soyer, J Kirkpatrick, ... arXiv preprint arXiv:1606.04671, 2016 | 54 | 2016 |