Tong Geng
Title
Cited by
Cited by
Year
FPDeep: Acceleration and load balancing of CNN training on FPGA clusters
T Geng, T Wang, A Sanaullah, C Yang, R Xu, R Patel, M Herbordt
2018 IEEE 26th Annual International Symposium on Field-Programmable Custom …, 2018
482018
A framework for acceleration of CNN training on deeply-pipelined FPGA clusters with work and weight load balancing
T Geng, T Wang, A Sanaullah, C Yang, R Patel, M Herbordt
2018 28th International Conference on Field Programmable Logic and …, 2018
27*2018
Fully integrated FPGA molecular dynamics simulations
C Yang, T Geng, T Wang, R Patel, Q Xiong, A Sanaullah, C Wu, J Sheng, ...
Proceedings of the International Conference for High Performance Computing …, 2019
172019
FPDeep: Scalable Acceleration of CNN Training on Deeply-Pipelined FPGA Clusters
T Geng, T Wang, A Li, X Jin, M Herbordt
IEEE Transactions on Computers, 2020
16*2020
LP-BNN: Ultra-low-latency BNN inference with layer parallelism
T Geng, T Wang, C Wu, C Yang, SL Song, A Li, M Herbordt
2019 IEEE 30th International Conference on Application-specific Systems …, 2019
152019
O3BNN: an out-of-order architecture for high-performance binarized neural network inference with fine-grained pruning
T Geng, T Wang, C Wu, C Yang, W Wu, A Li, MC Herbordt
Proceedings of the ACM International Conference on Supercomputing (ICS), 461-472, 2019
142019
Ghostsz: A transparent fpga-accelerated lossy compression framework
Q Xiong, R Patel, C Yang, T Geng, A Skjellum, MC Herbordt
2019 IEEE 27th Annual International Symposium on Field-Programmable Custom …, 2019
132019
BSTC: A novel binarized-soft-tensor-core design for accelerating bit-based approximated neural nets
A Li, T Geng, T Wang, M Herbordt, SL Song, K Barker
Proceedings of the International Conference for High Performance Computing …, 2019
122019
Molecular dynamics range-limited force evaluation optimized for FPGAs
C Yang, T Geng, T Wang, C Lin, J Sheng, V Sachdeva, W Sherman, ...
2019 IEEE 30th International Conference on Application-specific Systems …, 2019
122019
AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing
T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ...
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
102020
Accelerating AP3M-Based Computational Astrophysics Simulations with Reconfigurable Clusters
T Wang, T Geng, X Jin, M Herbordt
2019 IEEE 30th International Conference on Application-specific Systems …, 2019
72019
FP-AMR: A Reconfigurable Fabric Framework for Adaptive Mesh Refinement Applications
T Wang, T Geng, X Jin, M Herbordt
2019 IEEE 27th Annual International Symposium on Field-Programmable Custom …, 2019
72019
O3BNN-R: An out-of-order architecture for high-performance and regularized BNN inference
T Geng, A Li, T Wang, C Wu, Y Li, R Shi, W Wu, M Herbordt
IEEE Transactions on Parallel and Distributed Systems 32 (1), 199-213, 2020
62020
CSB-RNN: A Faster-than-Realtime RNN Acceleration Framework with Compressed Structured Blocks
R Shi, P Dong, T Geng, Y Ding, X Ma, HKH So, M Herbordt, A Li, Y Wang
Proceedings of the ACM International Conference on Supercomputing (ICS), 2020
52020
MacSim: a MAC-enabled high-performance low-power SIMD architecture
T Geng, L Waeijen, M Peemen, H Corporaal, Y He
2016 Euromicro Conference on Digital System Design (DSD), 160-167, 2016
52016
FP-AMG: FPGA-Based Acceleration Framework for Algebraic Multigrid Solvers
P Haghi, T Geng, A Guo, T Wang, M Herbordt
2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020
42020
Soft-Core. Multiple-Lane, FPGA-based ADCs for a Liquid Helium Environment
Z Xiang, T Wang, T Geng, T Xiang, X Jin, M Herbordt
2018 IEEE High Performance extreme Computing Conference (HPEC), 1-6, 2018
42018
A configurable SIMD architecture with explicit datapath for intelligent learning
Y He, M Peemen, L Waeijen, E Diken, M Fiumara, G Rauwerda, ...
2016 International Conference on Embedded Computer Systems: Architectures …, 2016
42016
An access-pattern-aware on-chip vector memory system with automatic loading for SIMD architectures
T Geng, E Diken, T Wang, L Jozwiak, M Herbordt
2018 IEEE High Performance extreme Computing Conference (HPEC), 1-7, 2018
32018
CQNN: a CGRA-based QNN Framework
T Geng, C Wu, C Tan, B Fang, A Li, M Herbordt
2020 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2020
22020
The system can't perform the operation now. Try again later.
Articles 1–20