24小时热门版块排行榜    

查看: 3377  |  回复: 56
【奖励】 本帖被评价53次,作者sciencejoy增加金币 41.8

sciencejoy

新虫 (著名写手)


[资源] Algorithms and parallel computing 算法与并行计算

Algorithms and Parallel Computing
Fayez Gebali


Table of Contents
Preface.

List of Acronyms.

1 Introduction.

1.1 Introduction.

1.2 Toward Automating Parallel Programming.

1.3 Algorithms.

1.4 Parallel Computing Design Considerations.

1.5 Parallel Algorithms and Parallel Architectures.

1.6 Relating Parallel Algorithm and Parallel Architecture.

1.7 Implementation of Algorithms: A Two-Sided Problem.

1.8 Measuring Benefi ts of Parallel Computing.

1.9 Amdahl’s Law for Multiprocessor Systems.

1.10 Gustafson–Barsis's Law.

1.11 Applications of Parallel Computing.

2 Enhancing Uniprocessor Performance.

2.1 Introduction.

2.2 Increasing Processor Clock Frequency.

2.3 Parallelizing ALU Structure.

2.4 Using Memory Hierarchy.

2.5 Pipelining.

2.6 Very Long Instruction Word (VLIW) Processors.

2.7 Instruction-Level Parallelism (ILP) and Superscalar Processors.

2.8 Multithreaded Processor.

3 Parallel Computers.

3.1 Introduction.

3.2 Parallel Computing.

3.3 Shared-Memory Multiprocessors (Uniform Memory Access [UMA]).

3.4 Distributed-Memory Multiprocessor (Nonuniform Memory Access [NUMA]).

3.5 SIMD Processors.

3.6 Systolic Processors.

3.7 Cluster Computing.

3.8 Grid (Cloud) Computing.

3.9 Multicore Systems.

3.10 SM.

3.11 Communication Between Parallel Processors.

3.12 Summary of Parallel Architectures.

4 Shared-Memory Multiprocessors.

4.1 Introduction.

4.2 Cache Coherence and Memory Consistency.

4.3 Synchronization and Mutual Exclusion.

5 Interconnection Networks.

5.1 Introduction.

5.2 Classification of Interconnection Networks by Logical Topologies.

5.3 Interconnection Network Switch Architecture.

6 Concurrency Platforms.

6.1 Introduction.

6.2 Concurrency Platforms.

6.3 Cilk++.

6.4 OpenMP.

6.5 Compute Unifi ed Device Architecture (CUDA).

7 Ad Hoc Techniques for Parallel Algorithms.

7.1 Introduction.

7.2 Defining Algorithm Variables.

7.3 Independent Loop Scheduling.

7.4 Dependent Loops.

7.5 Loop Spreading for Simple Dependent Loops.

7.6 Loop Unrolling.

7.7 Problem Partitioning.

7.8 Divide-and-Conquer (Recursive Partitioning) Strategies.

7.9 Pipelining.

8 Nonserial–Parallel Algorithms.

8.1 Introduction.

8.2 Comparing DAG and DCG Algorithms.

8.3 Parallelizing NSPA Algorithms Represented by a DAG.

8.4 Formal Technique for Analyzing NSPAs.

8.5 Detecting Cycles in the Algorithm.

8.6 Extracting Serial and Parallel Algorithm Performance Parameters.

8.7 Useful Theorems.

8.8 Performance of Serial and Parallel Algorithms on Parallel Computers.

9 z-Transform Analysis.

9.1 Introduction.

9.2 Definition of z-Transform.

9.3 The 1-D FIR Digital Filter Algorithm.

9.4 Software and Hardware Implementations of the z-Transform.

9.5 Design 1: Using Horner’s Rule for Broadcast Input and Pipelined Output.

9.6 Design 2: Pipelined Input and Broadcast Output.

9.7 Design 3: Pipelined Input and Output.

10 Dependence Graph Analysis.

10.1 Introduction.

10.2 The 1-D FIR Digital Filter Algorithm.

10.3 The Dependence Graph of an Algorithm.

10.4 Deriving the Dependence Graph for an Algorithm.

10.5 The Scheduling Function for the 1-D FIR Filter.

10.6 Node Projection Operation.

10.7 Nonlinear Projection Operation.

10.8 Software and Hardware Implementations of the DAG Technique.

11 Computational Geometry Analysis.

11.1 Introduction.

11.2 Matrix Multiplication Algorithm.

11.3 The 3-D Dependence Graph and Computation Domain D.

11.4 The Facets and Vertices of D.

11.5 The Dependence Matrices of the Algorithm Variables.

11.6 Nullspace of Dependence Matrix: The Broadcast Subdomain B.

11.7 Design Space Exploration: Choice of Broadcasting versus Pipelining Variables.

11.8 Data Scheduling.

11.9 Projection Operation Using the Linear Projection Operator.

11.10 Effect of Projection Operation on Data.

11.11 The Resulting Multithreaded/Multiprocessor Architecture.

11.12 Summary of Work Done in this Chapter.

12 Case Study: One-Dimensional IIR Digital Filters.

12.1 Introduction.

12.2 The 1-D IIR Digital Filter Algorithm.

12.3 The IIR Filter Dependence Graph.

12.4 z-Domain Analysis of 1-D IIR Digital Filter Algorithm.

13 Case Study: Two- and Three-Dimensional Digital Filters.

13.1 Introduction.

13.2 Line and Frame Wraparound Problems.

13.3 2-D Recursive Filters.

13.4 3-D Digital Filters.

14 Case Study: Multirate Decimators and Interpolators.

14.1 Introduction.

14.2 Decimator Structures.

14.3 Decimator Dependence Graph.

14.4 Decimator Scheduling.

14.5 Decimator DAG for s1 = [1 0].

14.6 Decimator DAG for s2 = [1 -1].

14.7 Decimator DAG for s3 = [1 1].

14.8 Polyphase Decimator Implementations.

14.9 Interpolator Structures.

14.10 Interpolator Dependence Graph.

14.11 Interpolator Scheduling.

14.12 Interpolator DAG for s1 = [1 0].

14.13 Interpolator DAG for s2 = [1 -1].

14.14 Interpolator DAG for s3 = [1 1].

14.15 Polyphase Interpolator Implementations.

15 Case Study: Pattern Matching.

15.1 Introduction.

15.2 Expressing the Algorithm as a Regular Iterative Algorithm (RIA).

15.3 Obtaining the Algorithm Dependence Graph.

15.4 Data Scheduling.

15.5 DAG Node Projection.

15.6 DESIGN 1: Design Space Exploration When s = [1 1]t.

15.7 DESIGN 2: Design Space Exploration When s = [1 -1]t.

15.8 DESIGN 3: Design Space Exploration When s = [1 0]t.

16 Case Study: Motion Estimation for Video Compression.

16.1 Introduction.

16.2 FBMAs.

16.3 Data Buffering Requirements.

16.4 Formulation of the FBMA.

16.5 Hierarchical Formulation of Motion Estimation.

16.6 Hardware Design of the Hierarchy Blocks.

17 Case Study: Multiplication over GF(2m).

17.1 Introduction.

17.2 The Multiplication Algorithm in GF(2m).

17.3 Expressing Field Multiplication as an RIA.

17.4 Field Multiplication Dependence Graph.

17.5 Data Scheduling.

17.6 DAG Node Projection.

17.7 Design 1: Using d1 = [1 0]t.

17.8 Design 2: Using d2 = [1 1]t.

17.9 Design 3: Using d3 = [1 -1]t.

17.10 Applications of Finite Field Multipliers.

18 Case Study: Polynomial Division over GF(2).

18.1 Introduction.

18.2 The Polynomial Division Algorithm.

18.3 The LFSR Dependence Graph.

18.4 Data Scheduling.

18.5 DAG Node Projection.

18.6 Design 1: Design Space Exploration When s1 = [1 -1].

18.7 Design 2: Design Space Exploration When s2 = [1 0].

18.8 Design 3: Design Space Exploration When s3 = [1 -0.5].

18.9 Comparing the Three Designs.

19 The Fast Fourier Transform.

19.1 Introduction.

19.2 Decimation-in-Time FFT.

19.3 Pipeline Radix-2 Decimation-in-Time FFT Processor.

19.4 Decimation-in-Frequency FFT.

19.5 Pipeline Radix-2 Decimation-in-Frequency FFT Processor.

20 Solving Systems of Linear Equations.

20.1 Introduction.

20.2 Special Matrix Structures.

20.3 Forward Substitution (Direct Technique).

20.4 Back Substitution.

20.5 Matrix Triangularization Algorithm.

20.6 Successive over Relaxation (SOR) (Iterative Technique).

20.7 Problems.

21 Solving Partial Differential Equations Using Finite Difference Method.

21.1 Introduction.

21.2 FDM for 1-D Systems.

References.

Index.
回复此楼

» 本帖附件资源列表

  • 欢迎监督和反馈:小木虫仅提供交流平台,不对该内容负责。
    本内容由用户自主发布,如果其内容涉及到知识产权问题,其责任在于用户本人,如对版权有异议,请联系邮箱:xiaomuchong@tal.com
  • 附件 1 : Algorithms_and_parallel_computing_2011.pdf
  • 2015-08-13 07:47:34, 8.14 M

» 收录本帖的淘帖专辑推荐

书籍下载网站 专业书籍(外文版)WM 专业书籍之数学力学WM BIAOXYZ的电子书专辑~
MATLAB文档 电子书 科研必备 信息安全

» 本帖已获得的红花(最新10朵)

» 猜你喜欢

已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

kkdd00

铁杆木虫 (著名写手)


送红花一朵
谢谢楼主
9楼2015-08-15 18:09:27
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖

seatar

木虫 (著名写手)


★★★★★ 五星级,优秀推荐

41楼2015-12-18 19:25:01
已阅   回复此楼   关注TA 给TA发消息 送TA红花 TA的回帖
简单回复
lin4172楼
2015-08-13 08:13   回复  
五星好评  顶一下,感谢分享!
一维UA3楼
2015-08-13 08:44   回复  
五星好评  顶一下,感谢分享!
xmc1411184楼
2015-08-13 17:15   回复  
五星好评  顶一下,感谢分享!
luom5楼
2015-08-14 06:55   回复  
五星好评  顶一下,感谢分享!
bcbooo6楼
2015-08-14 17:40   回复  
五星好评  顶一下,感谢分享!
2015-08-14 18:02   回复  
五星好评  顶一下,感谢分享!
kkdd008楼
2015-08-15 18:09   回复  
五星好评  顶一下,感谢分享!
lwk20510楼
2015-08-16 07:19   回复  
五星好评  顶一下,感谢分享!
wmnick11楼
2015-08-16 08:50   回复  
五星好评  顶一下,感谢分享!
wyf_199912楼
2015-08-16 09:34   回复  
五星好评  顶一下,感谢分享!
efoxxx13楼
2015-08-16 11:16   回复  
五星好评  顶一下,感谢分享!
2015-08-16 11:25   回复  
五星好评  顶一下,感谢分享!
PLDJINDUI15楼
2015-08-16 13:20   回复  
五星好评  顶一下,感谢分享!
freaglee16楼
2015-08-16 15:09   回复  
五星好评  顶一下,感谢分享!
吠陀17楼
2015-08-16 20:21   回复  
五星好评  顶一下,感谢分享!
everie18楼
2015-08-17 00:22   回复  
五星好评  感谢分享!
muxinjin19楼
2015-08-17 08:13   回复  
五星好评  顶一下,感谢分享!
zym100320楼
2015-08-17 10:13   回复  
五星好评  顶一下,感谢分享!
guo-ch21楼
2015-08-17 10:14   回复  
五星好评  顶一下,感谢分享!
gate_open22楼
2015-08-17 11:00   回复  
五星好评  顶一下,感谢分享!
bingoabs23楼
2015-08-17 12:55   回复  
五星好评  顶一下,感谢分享!
Jeffreyak24楼
2015-08-17 14:43   回复  
五星好评  顶一下,感谢分享!
2015-08-17 16:03   回复  
五星好评  顶一下,感谢分享!
fingerlake26楼
2015-08-17 17:14   回复  
五星好评  顶一下,感谢分享!
2015-08-17 23:43   回复  
五星好评  顶一下,感谢分享!
2015-08-18 07:42   回复  
五星好评  顶一下,感谢分享!
licro29楼
2015-08-18 07:55   回复  
五星好评  顶一下,感谢分享!
dhl2011dj30楼
2015-08-20 13:35   回复  
五星好评  顶一下,感谢分享!
tanqqd16331楼
2015-08-20 17:40   回复  
五星好评  顶一下,感谢分享!
alpha32楼
2015-08-22 09:09   回复  
五星好评  顶一下,感谢分享!
lbcxt33楼
2015-10-16 23:50   回复  
五星好评  顶一下,感谢分享!
张强101634楼
2015-11-04 10:40   回复  
五星好评  顶一下,感谢分享!
张强101635楼
2015-11-04 10:41   回复  
ybiao36楼
2015-11-05 09:59   回复  
五星好评  顶一下,感谢分享!
一终子37楼
2015-11-05 15:58   回复  
五星好评  顶一下,感谢分享!
qeterqeter38楼
2015-11-21 20:21   回复  
五星好评  顶一下,感谢分享!
沧浪浦39楼
2015-12-13 14:48   回复  
一般  顶一下,感谢分享!
chyanting40楼
2015-12-14 17:08   回复  
五星好评  顶一下,感谢分享!
Nanobee42楼
2015-12-30 16:14   回复  
五星好评  顶一下,感谢分享! [ 发自手机版 http://muchong.com/3g ]
mcwu43楼
2016-01-10 22:56   回复  
五星好评  顶一下,感谢分享!
123456mxq44楼
2016-01-18 11:05   回复  
五星好评  顶一下,感谢分享!
zhang-c-j45楼
2016-03-25 10:52   回复  
五星好评  顶一下,感谢分享!
walker203046楼
2016-07-21 22:15   回复  
三星好评  顶一下,感谢分享!
sxau201347楼
2016-12-17 23:49   回复  
五星好评  顶一下,感谢分享!
2017-03-10 16:36   回复  
五星好评  顶一下,感谢分享!
xiaobozou49楼
2017-04-26 15:30   回复  
五星好评  顶一下,感谢分享!
zndx200650楼
2017-09-11 20:44   回复  
五星好评  顶一下,感谢分享!
相关版块跳转 我要订阅楼主 sciencejoy 的主题更新
☆ 无星级 ★ 一星级 ★★★ 三星级 ★★★★★ 五星级
普通表情 高级回复(可上传附件)
信息提示
请填处理意见