RIT Computer Engineering


Tuesday, 23-Jul-2019 18:39:55 EDT

5205 Visitors Since 12-March-2002


  Assignment #1, Due Tuesday, April 16:    PDF,   Word97

  Assignment #2, Due Thursday, May 9:  PDF,   Word97

Files for question 1:
PVM Master Program: psum.c PVM Slave Program:spsum.c Numbers file: rand_data.txt
Files for question 3:
problem3-region-color.bmp problem3-geometry.txt problem3-colormap.txt


For the following lecture notes you can download or view a lecture as an Acrobat PDF file, or as a Microsoft Powerpoint97 file:

3-12-2K2 The Need and Feasibility of Parallel Computing, Technology Trends, Microprocessor Performance Attributes, Goal of Parallel Computing. Computing Elements, Programming Models, Flynn's Classification, Multiprocessors Vs. Multicomputers. Current Trends In Parallel Architectures, Communication Architecture.

3-14-2K2 Parallel Architectures Convergence: Naming, Operations, Ordering, Replication. Communication Cost with respect to various programming models, Communication Cost Model.

3-19-2K2 Parallel Programs: Conditions of Parallelism. Asymptotic Notations for Algorithm Analysis, PRAM. Levels of Parallelism, Hardware Vs. Software Concurrency. Amdahl’s Law, DOP, Concurrency Profile. Steps in Creating Parallel Programs: Decomposition, Assignment, Orchestration, Mapping.

3-21-2K2 Parallelization of An Example Program: Ocean simulation Iterative equation solver.

3-28-2K2 Parallel Programming for Performance.

4-4-2K2 Message-Passing Programming: Parallel Virtual Machine (PVM).

4-11-2K2 Message Passing Interface (MPI).

4-16-2K2 Parallel Programming Techniques and Examples.   Massively Parallel Computations: Image Transformations, Mandelbrot Set.   Divide-and-conquer Problem Partitioning: Parallel Bucket Sort, Numerical Integration, Gravitational N-Body Problem.   Pipelined Computations: Addition, Insertion Sort, Solving Upper-triangular System of Linear Equations.

4-18-2K2 Synchronous Iteration: Barriers, Iterative Solution of Linear Equations. Dynamic Load Balancing: Centralized, Distributed, Moore's Shortest Path Algorithm.

4-23-2K2 Network Requirements For Parallel Computing. Static Point-to-point Connection Network Topologies. Network Embeddings. Dynamic Connection Networks.

4-25-2K2 Shared Memory Multiprocessors. The Cache Coherence Problem. Memory Access Consistency Models. Cache Coherence Approaches. Snoopy Bus Protocols: Write-invalidate: MSI, MESI, Write-Update: Dragon.

4-30-2K2 Parallel System Performance: Evaluation & Scalability. Workload Selection. Parallel Performance Metrics Revisited. Application Models of Parallel Computers. Parallel System Scalability.

5-2-2K2 Scalable Distributed Memory Machines. MPPs Scalability Issues: Node, Network, Communication Assist, OS, Cost. Message Processing Issues: Functionality of CA, Physical DMA, System-Level Vs. User-Level Ports. MPP Physical Scaling Examples: nCUBE/2, CM-5, IBM SP-2, iWARP, Intel Paragon.

5-9-2K2 Cache Coherence in Scalable Distributed Memory Machines: Hierarchical Snooping, Directory-based cache coherence.


Tuesday and Thursday: 4:00-5:50 PM, 9/3149


Dr. Muhammad Shaaban
e-mail: meseec@osfmail.isc.rit.edu
Office: 17-2507 X2373

Office Hours:
My Spring 2002 schedule


Current:    http://www.rit.edu/~meseec/eecc756-spring2002/
Spring 2001: http://www.rit.edu/~meseec/eecc756-spring2001/
Spring 2000: http://www.rit.edu/~meseec/eecc756-spring2000/
Spring 99: http://www.rit.edu/~meseec/eecc756-spring99/
Spring 98: http://www.rit.edu/~meseec/eecc756-spring98/
Spring 97: http://www.rit.edu/~meseec/eecc756/


This course covers a number of issues involved in the design and utilization of high performance parallel computing systems. This includes: parallel computer models, the concept of scalable performance, the memory hierarchy, cache coherence issues, parallel and scalable architectures, parallel programming concepts. A number of current parallel machines will be studied.


Computer Architecture EECC-551.



Parallel Computer Architecture: A Hardware/Software Approach, David E. Culler, Jaswinder P. Singh, Morgan Kaufmann Publishers, 1999.

Parallel Programming: Techniques and Applications Using Networked Workstations and Parallel Computers, Barry Wilkinson, Micheal Allen, Prentice Hall, 1998.  ISBN: 0-13-671710-1  Buy online from Amazon.com


Designing and Building Parallel Programs, Ian Foster, Addison-Wesley, 1995, complete textbook online, ( includes a chapter on MPI).

PVM (Parallel Virtual Machine) Home Page

PVM: Parallel Virtual Machine: A Users' Guide and Tutorial for Networked Parallel Computing, Al Geist(Editor), et al, MIT Press, 1994, complete online version , PDF version , postscript version.

Advanced Tutorial on PVM 3.4 New Features and CapabilitiesAl Geist, Presented at EuroPVM-MPI'97, 1997.

MPI Home Page.

MPICH Home Page A Portable Implementation of MPI.

Beginner's Guide to MPI , University of Delaware.

MPI: The Complete Reference , Marc Snir et al., First Edition, 1995 (html version of book).

Scalable Parallel Computing, Kai Hwang, Zhiwei, McGraw-Hill, 1998.

Advanced Computer Architecture: Parallelism, Scalability, Programmability, Kai Hwang, McGraw-Hill, 1993.

Selected papers.

EECC756 Spring 1999 Class Projects.


Participation and class presence: 5%
Homework assignments: 30%
Exam: 35%
Class project: 30%


Attending all lecture sessions is expected.


Week1: Motivation for Parallel Computing, Parallel Programming Models, Parallel Architectures. Communication Cost Model
Week2: Parallel Program characteristics and creation Steps.
Program Parallelization Example: Ocean simulation Iterative equation solver.
Week3: Parallel Programming for Performance.
Week4: Message-Passing Programming: Parallel Virtual Machine (PVM).
Week5: Message-Passing Computing Examples.
Week6: Network Requirements For Parallel Computing. Static Point-to-point Connection Network Topologies. Dynamic Connection Networks.
Week7: Parallel System Performance: Evaluation & Scalability Metrics.
Week8: Shared Memory Multiprocessors. The Cache Coherence Problem. Scalable Distributed Memory Machines.
Week9: Cache Coherence in Scalable Distributed Memory Machines.
Week10: Exam/Project Presentations.

  This page is 24Kbytes long and was last modified on:   Monday, 25-Feb-2008 17:22:59 EST.

Made with at least 30% post-consumer recycled bits