计算机组成与设计

出版社:机械工业出版社
出版日期:2013-2
ISBN:9787111412373
作者:(美)David A. Patterson,(美)John L. Hennessy
页数:688页

章节摘录

版权页:   插图:   Part of the power of the Intel x86 is, the prefixes that can modify the execution ofthe following instruction. One prefix can repeat the following instruction untila counter counts down to 0. Thus, to move data in memory, it would seem thatthe natural instruction sequence is to use move with the repeat prefix to perform 32-bit memory-to-memory moves. An alternative method, which uses the standard instructions found in all computers, is to load the data into the registers and then store the registers back tomemory. This second version of this program, with the code replicated to reduceloop overhead, copies at about 1.5 times faster. A third version, which uses thelarger floating-point registers instead of the integer registers of the x86, copies atabout 2.0 times faster than the complex move instruction. Fallacy: Write in assembly language to obtain the highest performance.At one time compilers for programming languages produced naive instructionsequences; the increasing sophistication of compilers means the gap betweencompiled code and code produced by hand is closing fast. In fact, to competewith current compilers, the assembly language programmer needs to understandthe concepts in Chapters 4 and 5 thoroughly (processor pipelining and memoryhierarchy).

内容概要

David A. Patterson 加州大学伯克利分校计算机科学系教授,美国国家工程研究院院士,IEEE和ACM会士,曾因成功的启发式教育方法被IEEE授予James H. Mulligan,Jr教育奖章。他因为对RISC技术的贡献而荣获1995年IEEE技术成就奖,而在RAID技术方面的成就为他赢得了1999年IEEE Reynold Johnson信息存储奖。2000年他和John L. Hennessy分享了John von Neumann奖。
John L. Hennessy 斯坦福大学校长,IEEE和ACM会士,美国国家工程研究院院士及美国科学艺术研究院院士。Hennessy教授因为在RISC技术方面做出了突出贡献而荣获2001年的Eckert-Mauchly奖章,他也是2001年Seymour Cray计算机工程奖得主,并且和David A. Patterson分享了2000年John von Neumann奖。

书籍目录

1 Computer Abstractions and Technology 1.1 Introduction 1.2 Below Your Program 1.3 Under the Covers 1.4 Performance 1.5 The Power Wall 1.6 The Sea Change: The Switch from Uniprocessors to Multiprocessors 1.7 Real Stuff. Ma:nufacturing and Benchmarking the AMD Opteron X4 1.8 Fallacies and Pitfalls 1.9 Concluding Remarks 1.10 Historical Perspective and Further Reading 1.11 Exercises 2 Instructions:l.anguage of the Computer 2.1 Introduction 2.2 0perations of the Computer Hardware 2.3 0perands of the Computer Hardware 2.4 Signed and Unsigned Numbers 2.5 Representing Instructions in the Computer 2.6 Logical Operations 2.7 Instructions for Making Decisions 2.8 Supporting Procedures in Computer Hardware 2.9 Communicating with People 2.10 MIPS Addressing for 32-Bit Immediates and Addresses 2.11 Parallelism and Instructions: Synchronization 2.12 Translating and Starting a Program 2.13 A C Sort Example to Put It All Together2.14 Arrays versus Pointers 2.15 Advanced Material: Compiling C and Interpreting Java 2.16 Real Stuff:ARM Instructions 2.17 Real Stuff: x86 Instructions 2.18 Fallacies and Pitfalls 2.19 Concluding Remarks 2.20 Historical Perspective and Further Reading 2.21 Exerases 3 Arithmetic for Computers 3.1 Introduction 3.2 Addition and Subtraction 3.3 Multiplication 3.4 Division 3.5 Floating Point 3.6 Parallelism and Computer Arithmetic: Associativity 3.7 Real Stuff: Floating Point in the x86 3.8 Fallacies and Pitfalls 3.9 Concluding Remarks 3.10 Historical Perspective and Further Reading 3.11 Exerases 4 The Processor 4.1 Introduction 4.2 Logic Design Conventions 4.3 Building a Datapath 4.4 A Simple Implementation Scheme 4.5 An Overview of Pipelining 4.6 Pipelined Datapath and Control 4.7 Data Hazards: Forwarding versus Stalling 4.8 Control Hazards 4.9 Exceptions 4.10 Parallelism and Advanced Instruction-Level Parallelism 4.11 Real Stuff the AMD Opteron X4 (Barcelona) Pipeline 4.12 Advanced Topic: an Introduction to Digital Design Using a Hardware Design Language to Describe and Model a Pipeline and More Pipelining Illustrations 4.13 Fallciaes and Pitfalls 4.14 Concluding Remarks 4.15 Historical Perspective and Further Reading 4.16 Exerases 5 Large and Fast: Exploiting Memory Hierarchy 6 Storage and Other I/O Topics 7 Multicores, Multiprocessors, and Clusters A Graphics and Computing GPUs A-2 B Assemblers, Linkers, and the SPIM Simulator B-2 C The Basics of Loglc Design C-2 D Mapping Control to Hardware D-2 e A Survey of RISC Architectures for Desktop, Server, and Embedded Computers E-2 G Glossary G-1 F Further Reading FR-1

编辑推荐

《计算机组成与设计:硬件/软件接口(英文版•第4版)》作者展示了软硬件部件(如算法、编程语言、编译器、指令集系统结构以及处理器的实现)如何影响程序的性能。另外,本版对软硬件的讨论更加深入,并在光盘中为侧重硬件和侧重软件的读者分别提供了相关资料。随书光盘的内容非常丰富,不仅包括第9章、附录、《计算机组成与设计:硬件/软件接口(英文版•第4版)》网站内容、附加习题、术语表、参考文献、索引等,而且还提供了HDL模拟器、MIPS模拟器以及FPGA设计工具等软件。

作者简介

这本最畅销的计算机组成书籍经过全面更新,关注现今发生在计算机体系结构领域的革命性变革:从单处理器发展到多核微处理器,从串行发展到并行。与前几版一样,本书采用了MIPS处理器来展示计算机硬件技术、汇编语言、计算机算术、流水线、存储器层次结构以及I/O等基本功能。此外,本书还包括一些关于ARM和x86体系结构的介绍。
本书特色

涵盖从串行计算到并行计算的革命性变革,新增了关于并行化的一章,并且每章中还有一些强调并行硬件和软件主题的小节。

新增一个由NVIDIA的首席科学家和架构主管撰写的附录,介绍了现代GPU的出现和重要性,首次详细描述了这个针对可视计算进行了优化的高度并行化、多线程、多核的处理器。

描述一种度量多核性能的独特方法——Roofline模型,自带AMD Opteron X4、Intel Xeon 5000、Sun UltraSPARC T2和 IBM Cell的基准测试和分析。

涵盖一些关于闪存和虚拟机的新内容。

提供了大量富有启发性的练习题。

将AMD Opteron X4和Intel Nehalem作为贯穿本书的实例。

用SPEC CPU2006组件更新了所有处理器性能实例。


 计算机组成与设计下载



发布书评

 
 


精彩短评 (总计6条)

  •     内容不多说印刷,纸张都很好。虽然是黑白的不过阅读感觉也很好
  •     这本书呢...还是不错的,就是课程有点麻烦....
  •     书的印刷还好,没有破损,没有像其他人说的那么差,书里的插图丰富,每页都留有足够空白位置做笔记,但为什么不像CSAPP那样双色印刷呢?就算贵一点也没关系
  •     书页太薄了,每一页都能看到反面的内容,看起来太费劲了。
  •     从封面中间开始,有深达30多页的压痕。书送来时有气袋保护,推断不是运输造成;也就是说发货的书就有问题。不厚道!
  •     第一、定位目前已有很多书可以替代本书。例如:Computer systems: A programmer's perspective Introduction to Computing Systems: From Bits and Gates to C and Beyond Computer... 阅读更多
 

外国儿童文学,篆刻,百科,生物科学,科普,初中通用,育儿亲子,美容护肤PDF图书下载,。 零度图书网 

零度图书网 @ 2024