Loading [MathJax]/extensions/MathMenu.js
Embedding Read-Only Memory in Spin-Transfer Torque MRAM-Based On-Chip Caches | IEEE Journals & Magazine | IEEE Xplore

Embedding Read-Only Memory in Spin-Transfer Torque MRAM-Based On-Chip Caches


Abstract:

We propose a design technique for embedding read-only memory (ROM) in spin-transfer torque MRAM (STT-MRAM) arrays by adding an extra bit-line in every column of the array...Show More

Abstract:

We propose a design technique for embedding read-only memory (ROM) in spin-transfer torque MRAM (STT-MRAM) arrays by adding an extra bit-line in every column of the array. RAM and ROM data, which can be different, are stored in the same bitcell and the ROM capacity may be as large as the RAM capacity. Furthermore, our proposed ROM-embedding technique is applicable to any resistive memory technology in which the bit-cell topology is identical to that of the STT-MRAM bit-cell. An additional sense amplifier is required in the peripheral circuitry, hence we propose an area-optimized peripheral circuitry to minimize the total area penalty of embedding ROM. Our analysis reveals that the ROM may be embedded in the STT-MRAM array without area overhead and without any penalty in the performance of the memory as RAM. Furthermore, our simulations show that the embedded ROM may be used to accelerate applications that use lookup tables with as much as 30% improvement in instructions per cycle of a processor using ROM-embedded STT-MRAM for its L2 cache.
Page(s): 992 - 1002
Date of Publication: 26 June 2015

ISSN Information:

Funding Agency:

References is not available for this document.

I. Introduction

Many applications, such as digital signal processing, math libraries, and on-chip built-in self-test, use data that are determined at design time and stay constant during runtime (which we refer to as static data). Static data may be stored as lookup tables in on-chip read-only memory (ROM). However, storing large amounts of static data in on-chip ROM incurs significant area and power overheads. An alternative method is used to store static data off-chip. In this case, the processor has to fetch the required static data into on-chip cache during program execution, which leads to performance degradation. The problem is further exacerbated when data for each program running concurrently are mapped to the same cache location (also called cache thrashing). The static data used by a program may need to be evicted from cache as a result and the processor has to fetch the evicted data from off-chip memory again in order to continue program execution. Hence, realizing an on-chip ROM with minimal overhead allows static data to be stored closer to the processor, and may be used to accelerate the execution of applications.

Select All
1.
D. Lee and K. Roy, "Area efficient ROM-embedded SRAM cache", IEEE Trans. Very Large Scale Integr. (VLSI) Syst., vol. 21, no. 9, pp. 1583-1595, Sep. 2013.
2.
Y. Huai, "Spin-transfer torque MRAM (STT-MRAM): Challenges and prospects", AAPPS Bull., vol. 18, no. 6, pp. 33-40, 2008.
3.
H. Yoda et al., "Progress of STT-MRAM technology and the effect on normally-off computing systems", Proc. IEEE Int. Electron Devices Meeting, pp. 11.3.1-11.3.4, Dec. 2012.
4.
D. Lee, X. Fong and K. Roy, "R-MRAM: A ROM-embedded STT MRAM cache", IEEE Electron Device Lett., vol. 34, no. 10, pp. 1256-1258, Oct. 2013.
5.
S. P. Park, S. Gupta, N. Mojumder, A. Raghunathan and K. Roy, "Future cache design using STT MRAMs for improved energy efficiency: Devices circuits and architecture", Proc. 49th ACM/EDAC/IEEE Design Autom. Conf. (DAC), pp. 492-497, Jun. 2012.
6.
X. Fong and K. Roy, "Complimentary polarizers STT-MRAM (CPSTT) for on-chip caches", IEEE Electron Device Lett., vol. 34, no. 2, pp. 232-234, Feb. 2013.
7.
X. Fong, S. K. Gupta, N. N. Mojumder, S. H. Choday, C. Augustine and K. Roy, "KNACK: A hybrid spin-charge mixed-mode simulator for evaluating different genres of spin-transfer torque MRAM bit-cells", Proc. Int. Conf. Simulation Semiconductor Process. Devices, pp. 51-54, Sep. 2011.
8.
C. J. Lin et al., "45 nm low power CMOS logic compatible embedded STT MRAM utilizing a reverse-connection 1 T/1 MTJ cell", Proc. IEEE Int. Electron Devices Meeting (IEDM), pp. 1-4, Dec. 2009.
9.
T. Kishi et al., "Lower-current and fast switching of a perpendicular TMR for high speed and high density spin-transfer-torque MRAM", Proc. IEEE Int. Electron Devices Meeting, pp. 1-4, Dec. 2008.
10.
Object-Oriented MicroMagnetic Framework (OOMMF), Aug. 2014, [online] Available: http://math.nist.gov/oommf.
11.
HSPICE, Aug. 2014, [online] Available: http://www.synopsys.com/Tools/Verification/AMSVerification/CircuitSimulation/HSPICE/.
12.
N. Muralimanohar, R. Balasubramonian and N. Jouppi, "Optimizing NUCA organizations and wiring alternatives for large caches with CACTI 6.0", Proc. 40th Annu. IEEE/ACM Int. Symp. Microarchitecture (MICRO), pp. 3-14, Dec. 2007.
13.
T. Austin, E. Larson and D. Ernst, "SimpleScalar: An infrastructure for computer system modeling", Computer, vol. 35, no. 2, pp. 59-67, Feb. 2002.
14.
J. Harrison, T. Kubaska, S. Story and P. T. Tang, "The computation of transcendental functions on the IA-64 architecture", Intel Technol. J., vol. 4, pp. 234-251, Nov. 1999.
15.
L. Chen, "Pattern classification by assembling small neural networks", Proc. IEEE Int. Joint Conf. Neural Netw., vol. 3, pp. 1947-1952, Jul./Aug. 2005.
16.
H. Qiao, J. Peng, Z.-B. Xu and B. Zhang, "A reference model approach to stability analysis of neural networks", IEEE Trans. Syst. Man Cybern. B Cybern., vol. 33, no. 6, pp. 925-936, Jan. 2003.
17.
S. Razavi and B. A. Tolson, "A new formulation for feedforward neural networks", IEEE Trans. Neural Netw., vol. 22, no. 10, pp. 1588-1598, Oct. 2011.
18.
C. Mead and L. Conway, Introduction to VLSI Systems, Reading, MA, USA:Addison-Wesley, 1980.

Contact IEEE to Subscribe

References

References is not available for this document.