Chen D, Lim W, Bakhshalipour M, Gibbons P, Hoe J and Parno B. HerQules: securing programs via hardware-enforced message queues. Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems. (773-788).
Gemieux M, Li M, Savaria Y, David J and Zhu G. A Hybrid Architecture With Low Latency Interfaces Enabling Dynamic Cache Management. IEEE Access. 10.1109/ACCESS.2018.2876597. 6. (62826-62839).
Pham-Quoc C, Ashraf I, Al-Ars Z and Bertels K. Heterogeneous Hardware Accelerators with Hybrid Interconnect. Proceedings of the 2015 International Conference on Advanced Computing and Applications (ACOMP). (59-66).
Yang H, Fleming K, Adler M and Emer J.
(2014). LEAP Shared Memories: Automating the Construction of FPGA Coherent Memories 2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM). 10.1109/FCCM.2014.43. 978-1-4799-5111-6. (117-124).
Papakonstantinou A, Gururaj K, Stratton J, Chen D, Cong J and Hwu W.
(2013). Efficient compilation of CUDA kernels for high-performance computing on FPGAs. ACM Transactions on Embedded Computing Systems. 13:2. (1-26). Online publication date: 1-Sep-2013.
Sun S, Monga M, Jones P and Zambreno J. An I/O Bandwidth-Sensitive Sparse Matrix-Vector Multiplication Engine on FPGAs. IEEE Transactions on Circuits and Systems I: Regular Papers. 10.1109/TCSI.2011.2161389. 59:1. (113-123).
Yan L, Wu B, Wen Y, Zhang S and Chen T. A Reconfigurable Processor Architecture Combining Multi-core and Reconfigurable Processing Unit. Proceedings of the 2010 10th IEEE International Conference on Computer and Information Technology. (2897-2902).