Παρακολούθηση
Shawn Tan
Shawn Tan
Montreal Institute of Learning Algorithms
Η διεύθυνση ηλεκτρονικού ταχυδρομείου έχει επαληθευτεί στον τομέα mila.quebec - Αρχική σελίδα
Τίτλος
Παρατίθεται από
Παρατίθεται από
Έτος
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks
Y Shen, S Tan, A Sordoni, A Courville
International Conference on Learning Representations (ICLR), 2019, 2019
3952019
Improving the interpretability of deep neural networks with stimulated learning
S Tan, KC Sim, M Gales
2015 ieee workshop on automatic speech recognition and understanding (asru …, 2015
652015
Improving explorability in variational inference with annealed variational objectives
CW Huang, S Tan, A Lacoste, AC Courville
Advances in neural information processing systems 31, 2018
642018
Icentia11k: An unsupervised representation learning dataset for arrhythmia subtype discovery
S Tan, G Androz, A Chamseddine, P Fecteau, A Courville, Y Bengio, ...
arXiv preprint arXiv:1910.09570, 2019
302019
Ordered memory
Y Shen, S Tan, A Hosseini, Z Lin, A Sordoni, AC Courville
Advances in Neural Information Processing Systems 32, 2019
282019
Learning utterance-level normalisation using Variational Autoencoders for robust automatic speech recognition
S Tan, KC Sim
Spoken Language Technology Workshop (SLT), 2016 IEEE, 2016
252016
Moduleformer: Learning modular large language models from uncurated data
Y Shen, Z Zhang, T Cao, S Tan, Z Chen, C Gan
arXiv preprint arXiv:2306.04640, 2023
172023
Towards implicit complexity control using variable-depth deep neural networks for automatic speech recognition
S Tan, KC Sim
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
122016
Explicitly modeling syntax in language models with incremental parsing and a dynamic oracle
Y Shen, S Tan, A Sordoni, S Reddy, A Courville
arXiv preprint arXiv:2011.07960, 2020
9*2020
Sparse universal transformer
S Tan, Y Shen, Z Chen, A Courville, C Gan
arXiv preprint arXiv:2310.07096, 2023
82023
Investigating biases in textual entailment datasets
S Tan, Y Shen, C Huang, A Courville
arXiv preprint arXiv:1906.09635, 2019
82019
Self-organized hierarchical softmax
Y Shen, S Tan, C Pal, A Courville
arXiv preprint arXiv:1707.08588, 2017
82017
Unsupervised dependency graph network
Y Shen, S Tan, A Sordoni, P Li, J Zhou, A Courville
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
72022
Scattered Mixture-of-Experts Implementation
S Tan, Y Shen, R Panda, A Courville
arXiv preprint arXiv:2403.08245, 2024
52024
Generating contradictory, neutral, and entailing sentences
Y Shen, S Tan, CW Huang, A Courville
arXiv preprint arXiv:1803.02710, 2018
52018
Learning to dequantise with truncated flows
S Tan, CW Huang, A Sordoni, A Courville
International Conference on Learning Representations, 2021
42021
Recursive top-down production for sentence generation with latent trees
S Tan, Y Shen, TJ O'Donnell, A Sordoni, A Courville
arXiv preprint arXiv:2010.04704, 2020
42020
Inferring identity factors for grouped examples
S Tan, CJ Pal, A Courville
12018
Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler
Y Shen, M Stallone, M Mishra, G Zhang, S Tan, A Prasad, AM Soria, ...
arXiv preprint arXiv:2408.13359, 2024
2024
Latent variable language models
S Tan
2019
Δεν είναι δυνατή η εκτέλεση της ενέργειας από το σύστημα αυτή τη στιγμή. Προσπαθήστε ξανά αργότερα.
Άρθρα 1–20