Selected Papers: (for complete list please
check my CV )
[Book] Mohamed Zahran, Heterogeneous Computing: Hardware and Software Perspectives, ACM Books, 2019 [ISBN: 9781450362337 | PDF ISBN: 9781450361002
 Nick Greenquist, Doruk Kilitcioglu, Mohamed Zahran and Anasse Bari, GPU Accelerated Matrix Factorization for Recommender Systems, the 6th IEEE International Conference on Big Data Analytics (ICBDA 2021), March 2021. (Best Presentation Award)
 Antonio Mallia, Michał Siedlaczek, Torsten Suel, and Mohamed Zahran ,GPU-Accelerated Decoding of Integer Lists, in The 28th ACM International Conference on Information and Knowledge Management (CIKM), Beijing, China, November 2019.
 Tulsi Jain, Nitish Agarwal, and Mohamed Zahran, Performance Prediction for Multi-threaded Applications in The 2nd International Workshop on AI-assisted Design for Architecture (AIDArc), held in conjunction with the International Symposium on Computer Architecture (ISCA), June 2019.
 Mohamed Zahran and Marsha Berger, "Parallel Computing At The Undergraduate Level: Lessons Learned and Insights", in Workshop on Computer Architecture Education Held in conjunction with 46th International Symposium on Computer Architecture (ISCA), June 2019.
 Mahmoud Khairy, Amr Wassal, and Mohamed Zahran, A survey of architectural approaches for improving GPGPU performance, programmability and heterogeneity, Elsevier Journal of Parallel and Distributed Computing, Volume 127, May 2019, Pages 65-88.
 Chris Quackenbush and Mohamed Zahran, Beyond Profiling, in The 1st International Workshop on AI-assisted Design for Architecture (AIDArc),
 Chris Quackenbush and Mohamed Zahran, Beyond Profiling: Scaling Profiling Data Usage to Multiple Applications, arXiv:1711.01654 , 2017.
 Mahmoud Khairy, Mohamed Zahran, and Amr Wassal, SACAT: Streaming-Aware Conflict-Avoiding Thrashing-Resistant GPGPU Cache Management Scheme, IEEE Transactions on Parallel and Distributed Systems, vol 28, issue 6, June 2017.
 Numair Khan and Mohamed Zahran, Space-efficient Pointwise Computation of the Distance Transform on GPUs, in 7th IEEE Workshop Parallel / Distributed Computing and Optimization
 Chris Rohlfs and Mohamed Zahran, Optimal Bandwidth Selection for Kernel Regression Using a Fast Grid Search and a GPU, in 7th IEEE Workshop Parallel / Distributed Computing and Optimization (PDCO 2017), in conjunction with 31st IEEE International Parallel & Distributed Processing Symposium (IPDPS), May 2017.
 Mohamed Zahran, Heterogeneous Computing: Here to Stay, ACM Queue, vol 14, No. 6, Nov/Dec 2016, and Communications of the ACM, March 2017.
 Mohamed Zahran, Brain-Inspired Machines: What, Exactly, Are We Looking for?, IEEE Pulse, Mar 2016.
 Mahmoud Khairy, Mohamed Zahran, and Amr G. Wassal, Efficient utilization of GPGPU cache hierarchy, in the 8th Workshop on General-purpose processing using
 M. Zahran, Multicore processors: Status quo and future directions, in 10th International Computer Engineering Conference (ICENCO), Dec 2014 (Invited Paper) (pdf).
 J. Rajendran, A. K. Kanuparthi, M. Zahran, S. Addepalli,
G. Ormazabal, and R. Karri, Securing processors against insider attacks:
a circuit-microarchitecture co-design approach, IEEE Design
and Test of C2mputers, Vol 30, issue 2, Mar/Apr, 2013
 H. Chtioui, S. Niar Lamih, R. Ben-Atitallah, M. Zahran, Jl. Dekeyser, andM. Abid, A Dynamic Hybrid Cache Coherency Protocol for Shared-Memory MPSoC Architectures,
 Corey Malone, Mohamed Zahran, and Ramesh Karri, Are Hardware Performance Counters a Cost Effective Way for Integrity Checking of Programs?, The Sixth ACM Workshop on Scalable Trusted Computing, October 2011. (pdf)
 Mohamed Salah Souahi, Smail Niar, Mohamed Zahran, Mohamed Benmohamed, Towards Dynamic Cache Block Placement for Multi-processor NUCA, IEEE International Conference on Microelectronics, December 2011.
 Artem Durytskyy, Mohamed Zahran, and Ramesh Karri, Improving Robustness of GPUs by Making Use of Faulty Parts, Proc. International Conference on Computer Design (ICCD11), October 2011. (pdf)
 Arun K. Kanuparthi, Mohamed Zahran, and Ramesh Karri, Feasibility Study of Dynamic Trusted Platform Module, Proc. International Conference on Computer Design (ICCD10),
 Ahmed Youssef, Mohamed Zahran, Mohab Anis, and Mohamed Elmasry,
On the Power Management of Simultaneous
Multithreading Processors, IEEE Transactions on VLSI ,
 Mohamed Zahran and Sally A. McKee, Global Management of Cache Hierarchies , The ACM International Conference on Computing Frontiers (CF'10), Italy, May 2010. (pdf)
 Yufu Zhang , Ankur Srivastava and Mohamed Zahran, On-Chip Sensor Driven Efficient Thermal Profile Estimation Algorithms, ACM Transactions on Design Automation of Electronic Systems, Vol 15, issue 3, May 2010.
 Kim Hazelwood and Mohamed Zahran. Challenges and Opportunities at All Levels: Interactions Among Operating Systems, Compilers, and Multicore Processors, ACM SIGOPS Operating System Review. Volume 43, Issue 2. April 2009.
 Najla Alfaraj, H. Jonathan Chao, and Mohamed Zahran, NBC: Network-based Cache Coherence Protocol for Multistage NoCs, in The International SoC Design Conference (ISOCC), 2009.
 Bushra Ahsan and Mohamed Zahran, Managing Off-Chip Bandwidth: A Case for Bandwidth-Friendly Replacement Policy, in The 2nd Workshop on Managed Multi-Core Systems (MMCS'09), held in conjunction with ASPLOS 2009. (pdf)
 Mohamed Zahran and Sally A. McKee, Adaptive Block Placement Policy for Cache Hierarchies,in SMART'09:3rd Workshop on Statistical and Machine learning approaches to ARchitectures and compilaTion, held in conjunction with HiPEAC 2009. (pdf)
 Bushra Ahsan and Mohamed Zahran, Cache Performance, System Performance, and Off-Chip Bandwidth... Pick any Two , in 3rd workshop Interconnection Network Architectures: On-Chip, Multi-Chip (INA-OCMC), held in conjunction with HiPEAC 2009. (pdf)
Zahran, Kursad Albayraktaroglu, and Manoj Franklin,
Non-Inclusion Property in multi-level Caches Revisited, in the International
Journal of Computers and Their Applications Special Issue on
Techniques and Architectures for High Performance and Energy
Efficient Computing Systems, Vol 14, Num 2, June 2007. ( bib,
 Mohamed Zahran and Anasua Bhowmik, Hybrid Compiler and Microarchitecture Technique for Cache Traffic Optimization, in 9th Workshop on Interaction between Compilers and Computer Architectures (INTERACT 9), held in Conjunction with the 11th International Symposium on High-Performance Computer Architecture (HPCA-11), 2005. (bib, pdf)
 Francois Cantonnet, Yiyi Yao, Mohamed Zahran and Tarek El-Ghazawi, Productivity Analysis of the UPC Language, in 3rd International Workshop on Performance Modeling, Evaluation, and Optimization of Parallel and Distributed Systems (PMEO-PDS), to be held in conjunction with the International Parallel and Distributed Processing Symposium (IPDPS 2004).
 Mohamed Zahran and Manoj Franklin, Dynamic Thread Resizing for Speculative Multithreaded Processors, in International Conference on Computer Design (ICCD), San Jose, CA, October, 2003. (ps)(pdf) (Best Paper Award)
 Mohamed Zahran, Manoj Franklin and Renju Thomas, Confidence Estimation for Register Value Communication in Speculative Multithreaded Architectures, in first value prediction workshop (VPW1), held in conjunction with the 30th Annual International Symposium on Computer Architecture (ISCA), San Diego, California, 2003. (ps)(pdf)
 Mohamed Zahran, On Cache Memory Hierarchy for Chip-Multiprocessor, in MEDEA workshop held in conjunction with PACT 2002 Conference, Charlottesville, Virginia, 2002. Also Appeared in ACM Computer Architecture News, Vol 31, No. 1, March 2003.
 Mohamed Zahran and Manoj Franklin, A Feasibility Study of Hierarchical Multithreading, in International Parallel and Distributed Processing Symposium (IPDPS 2002), Marriott Marina, Fort Lauderdale, Florida, 2002. (ps) (pdf)
 Mohamed Zahran and Manoj Franklin, Hierarchical Multi-threading For Exploiting Parallelism at Multiple Granularities, Workshop on MULTITHREADED EXECUTION, ARCHITECTURE and COMPILATION (MTEAC-5), Austin, Texas, 2001. (ps) (pdf)
 Mohamed Zahran, Ashraf Abdel-Wahab and Samir Shaheen, Adaptive Genetic Algorithm for Multiprocessor Scheduling, poster presentation at the Genetic and Evolutionary Computation Conference (GECCO), Orlando, 1999.
Selected Presentations & Talks (for complete list please check my CV ): Toward Exascale Machine: Challenges and Opportunities, IBM T. J.Watson lab , April 2017.
 Architecture Support for Big Data, Bloomberg, November 2016.
 Panel at IBM Research Workshop on Architectures for Cognitive Computing and Datacenters, IBM T. J. Watson lab , October 2016.
 Heterogeneous Computing: Hardware and Software Perspective, ACM Applicative, June 2016.
 "Off-Chip Bandwidth: The New Wall in The Multicore Era", in CS Departmental seminar series, University of Delaware, March 2009.
[6 ] "Attacking The Von-Neumann Bottleneck: Smart and Scalable Cache Hierarchy in The Chip Multiprocessor Era",
IBM T. J. Watson, Feb 2007.