Parallelism-Aware Batch Scheduling: Enhancing both Performance and Fairness of Shared DRAM Systems Thomas Moscibroda, Onur Mutlu
|
MISE: Providing Performance Predictability and Improving Fairness in Shared Main Memory Systems Onur Mutlu, Ben Jaiyen, Yoongu Kim, Vivek Seshadri, Lavanya Subramanian
|
AƩrgia: Exploiting Packet Latency Slack in On-Chip Networks Chita R Das, Thomas Moscibroda, Onur Mutlu, Reetuparna Das
|
Fairness via Source Throttling: A Configurable and High-Performance Fairness Substrate for Multi-Core Memory Systems Yale N Patt, Onur Mutlu, Chang Joo Lee, Eiman Ebrahimi
|
A Case for Bufferless Routing in On-Chip Networks Onur Mutlu, Thomas Moscibroda
|
CHIPPER: A Low-Complexity Bufferless Deflection Router Onur Mutlu, Chris Craik, Chris Fallin
|
NVIDIA Tesla: A Unified Graphics and Computing Architecture John Montrym, Stuart Oberman, John Nickolls, Erik Lindholm
|
A closer look at GPUs Mike Houston, Kayvon Fatahalian
|
Improving GPU performance via large warps and two-level warp scheduling Yale N Patt, Onur Mutlu, Rustam Miftakhutdinov, Chang Joo Lee, Michael Shebanow, Veynu Narasiman
|
A preliminary architecture for a basic data flow processor David Glasco, Michael Garland, Brucek Khailany, William J Dally, Stephen W Keckler
|