fbpx
Wikipedia

Memory bandwidth

Memory bandwidth is the rate at which data can be read from or stored into a semiconductor memory by a processor. Memory bandwidth is usually expressed in units of bytes/second, though this can vary for systems with natural data sizes that are not a multiple of the commonly used 8-bit bytes.

Memory bandwidth that is advertised for a given memory or system is usually the maximum theoretical bandwidth. In practice the observed memory bandwidth will be less than (and is guaranteed not to exceed) the advertised bandwidth. A variety of computer benchmarks exist to measure sustained memory bandwidth using a variety of access patterns. These are intended to provide insight into the memory bandwidth that a system should sustain on various classes of real applications.

Measurement conventions Edit

There are three different conventions for defining the quantity of data transferred in the numerator of "bytes/second":

  1. The bcopy convention: counts the amount of data copied from one location in memory to another location per unit time. For example, copying 1 million bytes from one location in memory to another location in memory in one second would be counted as 1 million bytes per second. The bcopy convention is self-consistent, but is not easily extended to cover cases with more complex access patterns, for example three reads and one write.
  2. The Stream convention: sums the amount of data that the application code explicitly reads plus the amount of data that the application code explicitly writes.[1] Using the previous 1 million byte copy example, the STREAM bandwidth would be counted as 1 million bytes read plus 1 million bytes written in one second, for a total of 2 million bytes per second. The STREAM convention is most directly tied to the user code, but may not count all the data traffic that the hardware is actually required to perform.
  3. The hardware convention: counts the actual amount of data read or written by the hardware, whether the data motion was explicitly requested by the user code or not. Using the same 1 million byte copy example, the hardware bandwidth on computer systems with a write allocate cache policy would include an additional 1 million bytes of traffic because the hardware reads the target array from memory into cache before performing the stores. This gives a total of 3 million bytes per second actually transferred by the hardware. The hardware convention is most directly tied to the hardware, but may not represent the minimum amount of data traffic required to implement the user's code.
For example, some computer systems have the ability to avoid write allocate traffic using special instructions, leading to the possibility of misleading comparisons of bandwidth based on different amounts of data traffic performed.

Bandwidth computation and nomenclature Edit

The nomenclature differs across memory technologies, but for commodity DDR SDRAM, DDR2 SDRAM, and DDR3 SDRAM memory, the total bandwidth is the product of:

  • Base DRAM clock frequency
  • Number of data transfers per clock: Two, in the case of "double data rate" (DDR, DDR2, DDR3, DDR4) memory.
  • Memory bus (interface) width: Each DDR, DDR2, or DDR3 memory interface is 64 bits wide. Those 64 bits are sometimes referred to as a "line."
  • Number of interfaces: Modern personal computers typically use two memory interfaces (dual-channel mode) for an effective 128-bit bus width.

For example, a computer with dual-channel memory and one DDR2-800 module per channel running at 400 MHz would have a theoretical maximum memory bandwidth of:

400,000,000 clocks per second × 2 lines per clock × 64 bits per line × 2 interfaces =
102,400,000,000 (102.4 billion) bits per second (in bytes, 12,800 MB/s or 12.8 GB/s)

This theoretical maximum memory bandwidth is referred to as the "burst rate," which may not be sustainable.

The naming convention for DDR, DDR2 and DDR3 modules specifies either a maximum speed (e.g., DDR2-800) or a maximum bandwidth (e.g., PC2-6400). The speed rating (800) is not the maximum clock speed, but twice that (because of the doubled data rate). The specified bandwidth (6400) is the maximum megabytes transferred per second using a 64-bit width. In a dual-channel mode configuration, this is effectively a 128-bit width. Thus, the memory configuration in the example can be simplified as: two DDR2-800 modules running in dual-channel mode.

Two memory interfaces per module is a common configuration for PC system memory, but single-channel configurations are common in older, low-end, or low-power devices. Some personal computers and most modern graphics cards use more than two memory interfaces (e.g., four for Intel's LGA 2011 platform and the NVIDIA GeForce GTX 980). High-performance graphics cards running many interfaces in parallel can attain very high total memory bus width (e.g., 384 bits in the NVIDIA GeForce GTX TITAN and 512 bits in the AMD Radeon R9 290X using six and eight 64-bit interfaces respectively).

ECC bits Edit

In systems with error-correcting memory (ECC), the additional width of the interfaces (typically 72 rather than 64 bits) is not counted in bandwidth specifications because the extra bits are unavailable to store user data. ECC bits are better thought of as part of the memory hardware rather than as information stored in that hardware.

See also Edit

References Edit

  1. ^ STREAM Benchmark FAQ: Counting Bytes and FLOPS: http://www.cs.virginia.edu/stream/ref.html#counting

General Edit

  • BSS Random Access Benchmark Performance Evaluation and Optimization of Random Memory Access on Multicores with High Productivity at ACM/IEEE HiPC 2010

External links Edit

  • STREAM Benchmark
  • Memory Buy the Numbers

memory, bandwidth, this, article, needs, additional, citations, verification, please, help, improve, this, article, adding, citations, reliable, sources, unsourced, material, challenged, removed, find, sources, news, newspapers, books, scholar, jstor, february. This article needs additional citations for verification Please help improve this article by adding citations to reliable sources Unsourced material may be challenged and removed Find sources Memory bandwidth news newspapers books scholar JSTOR February 2018 Learn how and when to remove this template message Memory bandwidth is the rate at which data can be read from or stored into a semiconductor memory by a processor Memory bandwidth is usually expressed in units of bytes second though this can vary for systems with natural data sizes that are not a multiple of the commonly used 8 bit bytes Memory bandwidth that is advertised for a given memory or system is usually the maximum theoretical bandwidth In practice the observed memory bandwidth will be less than and is guaranteed not to exceed the advertised bandwidth A variety of computer benchmarks exist to measure sustained memory bandwidth using a variety of access patterns These are intended to provide insight into the memory bandwidth that a system should sustain on various classes of real applications Contents 1 Measurement conventions 2 Bandwidth computation and nomenclature 3 ECC bits 4 See also 5 References 5 1 General 6 External linksMeasurement conventions EditThere are three different conventions for defining the quantity of data transferred in the numerator of bytes second The bcopy convention counts the amount of data copied from one location in memory to another location per unit time For example copying 1 million bytes from one location in memory to another location in memory in one second would be counted as 1 million bytes per second The bcopy convention is self consistent but is not easily extended to cover cases with more complex access patterns for example three reads and one write The Stream convention sums the amount of data that the application code explicitly reads plus the amount of data that the application code explicitly writes 1 Using the previous 1 million byte copy example the STREAM bandwidth would be counted as 1 million bytes read plus 1 million bytes written in one second for a total of 2 million bytes per second The STREAM convention is most directly tied to the user code but may not count all the data traffic that the hardware is actually required to perform The hardware convention counts the actual amount of data read or written by the hardware whether the data motion was explicitly requested by the user code or not Using the same 1 million byte copy example the hardware bandwidth on computer systems with a write allocate cache policy would include an additional 1 million bytes of traffic because the hardware reads the target array from memory into cache before performing the stores This gives a total of 3 million bytes per second actually transferred by the hardware The hardware convention is most directly tied to the hardware but may not represent the minimum amount of data traffic required to implement the user s code For example some computer systems have the ability to avoid write allocate traffic using special instructions leading to the possibility of misleading comparisons of bandwidth based on different amounts of data traffic performed dd Bandwidth computation and nomenclature EditThe nomenclature differs across memory technologies but for commodity DDR SDRAM DDR2 SDRAM and DDR3 SDRAM memory the total bandwidth is the product of Base DRAM clock frequency Number of data transfers per clock Two in the case of double data rate DDR DDR2 DDR3 DDR4 memory Memory bus interface width Each DDR DDR2 or DDR3 memory interface is 64 bits wide Those 64 bits are sometimes referred to as a line Number of interfaces Modern personal computers typically use two memory interfaces dual channel mode for an effective 128 bit bus width For example a computer with dual channel memory and one DDR2 800 module per channel running at 400 MHz would have a theoretical maximum memory bandwidth of 400 000 000 clocks per second 2 lines per clock 64 bits per line 2 interfaces 102 400 000 000 102 4 billion bits per second in bytes 12 800 MB s or 12 8 GB s This theoretical maximum memory bandwidth is referred to as the burst rate which may not be sustainable The naming convention for DDR DDR2 and DDR3 modules specifies either a maximum speed e g DDR2 800 or a maximum bandwidth e g PC2 6400 The speed rating 800 is not the maximum clock speed but twice that because of the doubled data rate The specified bandwidth 6400 is the maximum megabytes transferred per second using a 64 bit width In a dual channel mode configuration this is effectively a 128 bit width Thus the memory configuration in the example can be simplified as two DDR2 800 modules running in dual channel mode Two memory interfaces per module is a common configuration for PC system memory but single channel configurations are common in older low end or low power devices Some personal computers and most modern graphics cards use more than two memory interfaces e g four for Intel s LGA 2011 platform and the NVIDIA GeForce GTX 980 High performance graphics cards running many interfaces in parallel can attain very high total memory bus width e g 384 bits in the NVIDIA GeForce GTX TITAN and 512 bits in the AMD Radeon R9 290X using six and eight 64 bit interfaces respectively ECC bits EditMain article ECC memory In systems with error correcting memory ECC the additional width of the interfaces typically 72 rather than 64 bits is not counted in bandwidth specifications because the extra bits are unavailable to store user data ECC bits are better thought of as part of the memory hardware rather than as information stored in that hardware See also EditCAS latency Dynamic random access memory List of device bandwidths Memory latency Memory timings Random access memoryReferences Edit STREAM Benchmark FAQ Counting Bytes and FLOPS http www cs virginia edu stream ref html counting General Edit BSS Random Access Benchmark Performance Evaluation and Optimization of Random Memory Access on Multicores with High Productivity at ACM IEEE HiPC 2010External links EditSTREAM Benchmark Memory Buy the Numbers Retrieved from https en wikipedia org w index php title Memory bandwidth amp oldid 1175270738, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.