  Assignment #1, Due Tuesday, April 11:  PDF,   Word97
     (Files for question 9:    Master Program: psum.c,  Slave Program: spsum.c,   rand_data.txt)

  Assignment #2, Due Tuesday, May 2:  PDF,   Word97


For the following lecture notes you can download or view a lecture as an Acrobat PDF file, or as a Microsoft Powerpoint97 file:

3-7-2K The Need and Feasibility of Parallel Computing, Technology Trends, Microprocessor Performance Attributes, Goal of Parallel Computing. Computing Elements, Programming Models, Flynn's Classification, Multiprocessors Vs. Multicomputers. Current Trends In Parallel Architectures, Communication Architecture.

3-9-2K Parallel Architectures Convergence: Naming, Operations, Ordering, Replication. Communication Cost with respect to various programming models, Communication Cost Model

3-14-2K Parallel Programs: Conditions of Parallelism. Asymptotic Notations for Algorithm Analysis, PRAM. Levels of Parallelism, Hardware Vs. Software Concurrency. Amdahl’s Law, DOP, Concurrency Profile. Steps in Creating Parallel Programs: Decomposition, Assignment, Orchestration, Mapping.

3-16-2K Parallelization of An Example Program: Ocean simulation Iterative equation solver.

3-21-2K Parallel Programming for Performance.

3-28-2K Message-Passing Programming: Parallel Virtual Machine (PVM).

3-30-2K Message-Passing Computing Examples: Image Transformations, Mandelbrot Set, Divide-and-conquer Problem Partitioning: Parallel Bucket Sort, Numerical Integration.

4-6-2K Synchronous Iteration: Barriers, Iterative Solution of Linear Equations. Dynamic Load Balancing: Centralized, Distributed, Moore's Shortest Path Algorithm.

4-11-2K Network Requirements For Parallel Computing. Static Point-to-point Connection Network Topologies. Network Embeddings. Dynamic Connection Networks.

4-13-2K Parallel System Performance: Evaluation & Scalability. Workload Selection. Parallel Performance Metrics Revisited. Application Models of Parallel Computers. Parallel System Scalability.

4-18-2K Shared Memory Multiprocessors. The Cache Coherence Problem. Memory Access Consistency Models. Cache Coherence Approaches. Snoopy Bus Protocols: Write-invalidate: MSI, MESI, Write-Update: Dragon.

4-25-2K Scalable Distributed Memory Machines. MPPs Scalability Issues: Node, Network, Communication Assist, OS, Cost. Message Processing Issues: Functionality of CA, Physical DMA, System-Level Vs. User-Level Ports. MPP Physical Scaling Examples: nCUBE/2, CM-5, IBM SP-2, iWARP, Intel Paragon.

4-27-2K Cache Coherence in Scalable Distributed Memory Machines: Hierarchical Snooping, Directory-based cache coherence.

5-4-2K Exam Review.


This course covers a number of issues involved in the design and utilization of high performance parallel computing systems. This includes: parallel computer models, the concept of scalable performance, the memory hierarchy, cache coherence issues, parallel and scalable architectures, parallel programming concepts. A number of current parallel machines will be studied.


Advanced Computer Architecture EECC-722.



Parallel Computer Architecture: A Hardware/Software Approach, David E. Culler, Jaswinder P. Singh, Morgan Kaufmann Publishers, 1999.

Parallel Programming: Techniques and Applications Using Networked Workstations and Parallel Computers, Barry Wilkinson, Micheal Allen, Prentice Hall, 1998.  ISBN: 0-13-671710-1  Buy online from Amazon.com


Designing and Building Parallel Programs, Ian Foster, Addison-Wesley, 1995, complete textbook online.

PVM (Parallel Virtual Machine)

PVM: Parallel Virtual Machine: A Users' Guide and Tutorial for Networked Parallel Computing, Al Geist(Editor), et al, MIT Press, 1994, complete online version.

Advanced Tutorial on PVM 3.4 New Features and CapabilitiesAl Geist, Presented at EuroPVM-MPI'97, 1997.

Scalable Parallel Computing, Kai Hwang, Zhiwei, McGraw-Hill, 1998.

Advanced Computer Architecture: Parallelism, Scalability, Programmability, Kai Hwang, McGraw-Hill, 1993.

Selected papers.

