US9588881B2 - Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses - Google Patents

Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses Download PDF

Info

Publication number
US9588881B2
US9588881B2 US13/467,874 US201213467874A US9588881B2 US 9588881 B2 US9588881 B2 US 9588881B2 US 201213467874 A US201213467874 A US 201213467874A US 9588881 B2 US9588881 B2 US 9588881B2
Authority
US
United States
Prior art keywords
stack
data
processor
top portion
return
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
US13/467,874
Other versions
US20120297120A1 (en
Inventor
Franck Fillere
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Morgan Stanley Senior Funding Inc
Original Assignee
Cypress Semiconductor Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cypress Semiconductor Corp filed Critical Cypress Semiconductor Corp
Priority to US13/467,874 priority Critical patent/US9588881B2/en
Assigned to RAMTRON INTERNATIONAL CORPORATION reassignment RAMTRON INTERNATIONAL CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: FILLERE, FRANCK
Priority to CN201210154918.2A priority patent/CN102880446B/en
Publication of US20120297120A1 publication Critical patent/US20120297120A1/en
Assigned to CYPRESS SEMICONDUCTOR CORPORATION reassignment CYPRESS SEMICONDUCTOR CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: RAMTRON INTERNATIONAL CORPORATION
Assigned to MORGAN STANLEY SENIOR FUNDING, INC. reassignment MORGAN STANLEY SENIOR FUNDING, INC. SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: CYPRESS SEMICONDUCTOR CORPORATION, SPANSION LLC
Publication of US9588881B2 publication Critical patent/US9588881B2/en
Application granted granted Critical
Assigned to MUFG UNION BANK, N.A. reassignment MUFG UNION BANK, N.A. ASSIGNMENT AND ASSUMPTION OF SECURITY INTEREST IN INTELLECTUAL PROPERTY Assignors: MORGAN STANLEY SENIOR FUNDING, INC.
Assigned to MORGAN STANLEY SENIOR FUNDING, INC. reassignment MORGAN STANLEY SENIOR FUNDING, INC. CORRECTIVE ASSIGNMENT TO CORRECT THE 8647899 PREVIOUSLY RECORDED ON REEL 035240 FRAME 0429. ASSIGNOR(S) HEREBY CONFIRMS THE SECURITY INTERST. Assignors: CYPRESS SEMICONDUCTOR CORPORATION, SPANSION LLC
Assigned to CYPRESS SEMICONDUCTOR CORPORATION, SPANSION LLC reassignment CYPRESS SEMICONDUCTOR CORPORATION RELEASE BY SECURED PARTY (SEE DOCUMENT FOR DETAILS). Assignors: MUFG UNION BANK, N.A.
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/0223User address space allocation, e.g. contiguous or non contiguous base addressing
    • G06F12/023Free address space management
    • G06F12/0238Memory management in non-volatile memory, e.g. resistive RAM or ferroelectric memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/0223User address space allocation, e.g. contiguous or non contiguous base addressing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/10Providing a specific technical effect
    • G06F2212/1016Performance improvement
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/10Providing a specific technical effect
    • G06F2212/1028Power efficiency
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/17Embedded application
    • G06F2212/175Industrial control system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/17Embedded application
    • G06F2212/178Electronic token or RFID
    • Y02B60/1225
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • the present invention relates, in general, to the field of customizable integrated circuit devices incorporating non-volatile memory. More particularly, the present invention relates to a stack processor and method implemented using a ferroelectric random access memory (F-RAM) for code and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses.
  • F-RAM ferroelectric random access memory
  • Nonvolatile memory technologies include, among others, electrically erasable programmable read only memory (EEPROM) and Flash memory.
  • EEPROM electrically erasable programmable read only memory
  • Flash memory Flash memory
  • the endurance rate of Flash memory is still multiple orders of magnitude below the endurance rate of F-RAM. Therefore, for applications using Flash memory that require high endurance, some products will actually include a large Flash memory array with the associated user/program ensuring that data is stored in specific memory locations (e.g. banks of memory). Once a memory bank approaches its endurance limit, the user/program would then enable the movement of all data to a new memory bank, marking the previous memory bank as worn out and indicating it should not be used again.
  • the typical endurance of such floating gate devices is approximately between 100,000 to 1 million write cycles.
  • a currently implemented architecture for a stack processor is the b16 Processor described in Paysan, B., “A Forth Processor in an FPGA”, Feb. 2, 2003; Paysan, B., “b16—small—Less is More”, Jul. 9, 2006; and Paysan, B., “b16: Modern Processor Core”, Apr. 29, 2005 and disclosed at http://www.jwdt.com/ ⁇ paysan/b16.html.
  • the b16 stack based processor has the top of the stacks maintained in volatile registers and the bottom of the stacks in two complementary metal oxide semiconductor (CMOS) memories.
  • CMOS complementary metal oxide semiconductor
  • a stack processor architecture which provides for maintaining the stacks in volatile memory would cause it to suffer from very long and power demanding power-down times as the contents of a relatively large number of registers would have to be saved to nonvolatile memory on power-down. Placing some of the registers in nonvolatile Flash memory in an attempt to ameliorate this situation would, of course, lead to the endurance issues inherent in Flash. Still further, a stack processor architecture which utilizes a different memory type for code and the stacks would suffer from high power consumption peaks since all of the memories are likely to be accessed simultaneously in normal operation.
  • each 16 bit word is mapped as three, 5 bit instructions and one extra 1 bit instruction which can only be a “no operation” (NOP) or CALL.
  • NOP no operation
  • CALL CALL
  • the fourth instruction will generally be a NOP and the instruction set is, therefore, wasting one bit per word along with a clock cycle (needed to execute the NOP) every three instructions.
  • the b16 stack processor does not share code and data space so its architecture is even more power demanding as it can access code space and the data and return stacks all simultaneously.
  • Customizable integrated circuit devices will generally require the provision of some sort of processing unit on-chip to enable the device to execute different functions depending on a user defined program.
  • One example market for customizable devices is in the field of metering functions which require higher endurance cycles than can reasonably be supplied by nonvolatile memory technologies such as Flash.
  • Another possible market is radio frequency identification (RFID) in which low power memory cells are required.
  • RFID radio frequency identification
  • the stack-based processor has all of its stacks stored in F-RAM memory.
  • the time for saving all critical registers to memory is also minimized in that all registers are already maintained in non-volatile F-RAM per se.
  • the fast write times and low power access inherent in F-RAM provide many advantages while the use of a single, unique memory type on-chip affords easier hardware management, less intellectual property (IP) costs and also results in a smaller on-chip die area.
  • IP intellectual property
  • the stack-based processor has some, but not all, of its stacks stored in non-volatile F-RAM memory and the remainder in volatile memory.
  • CMOS complementary metal oxide semiconductor
  • read/write operations to only F-RAM would be obviated.
  • F-RAM memory requires relatively low power compared to other non-volatile memory technologies, an access to volatile memory instead could provide at least some power advantages.
  • read/write accesses to F-RAM memory only can somewhat slow down program execution and being able to access the stack memories in parallel to code execution would result in certain speed advantages. This is particularly relevant to RFID applications wherein power is only available for a relatively short, finite time when the card enters the RF field of the associated controller.
  • processor implementation utilizing an instruction set optimized to minimize memory fetches and thus minimize program execution time.
  • This provides a number of distinct advantages including various low power applications and those in which the power supply is only available for a finite period of time such as RFID implementations.
  • the use of a processor with such an optimized instruction set also allows the architecture of the present invention to be instantiated across multiple devices to support a wide range of possible operations and differing numbers of peripherals by supporting interrupts as will be more fully described hereinafter.
  • This particular embodiment of the present invention provides an instruction set for a stack processor architecture that is relatively small so that the associated logic is itself not too large thereby reducing current leakage and dynamic power requirements. Also disclosed is a complete set of instructions so that a multitude of possible applications can be supported with a program execution time that is not too long. Further, the instruction set disclosed is neither too complex nor too light. That is, there are not too many F-RAM memory fetches and nor is there too much logic leakage. As disclosed, the instruction set of the present invention supports the basic Forth commands as well as other operation code for generic operations while also supporting interrupts.
  • a method for operating a stack processor in an integrated circuit device which comprises coupling a non-volatile memory to the stack processor for storing program code and at least a portion of the stack memory space and also coupling a volatile memory to the stack processor for storing another portion of the stack memory space.
  • the stack memory space comprises data and return stacks and the bottom portions of the data and return stacks are maintained in the non-volatile memory and the top portions are maintained in the volatile memory.
  • FIG. 1 is a functional block diagram of a stack processor using a ferroelectric random access memory (F-RAM) for both code and data space in accordance with a particular representative embodiment of the present invention
  • FIG. 2 is a follow-on functional block diagram of a portion of a stack processor utilizing both F-RAM and CMOS memory in accordance with another particular representative embodiment of the present invention
  • FIGS. 3A through 3D inclusive are exemplary illustrations of an instruction for the copying of the top of the data. stack (“T”) into the top of a return stack (“R”);
  • FIG. 4 is an exemplary illustration of a method in accordance with the representative embodiment of the present invention illustrated in FIG. 2 wherein the CMOS registers are only updated when required and unused portions of the CMOS stack are not updated to conserve power;
  • FIG. 5 is an exemplary illustration of a 16 bit instruction set in accordance with a particular embodiment of the present invention which has been optimized to minimize memory fetches;
  • FIG. 6 is an exemplary table of possible operating code mappings for the Forth core of the preceding FIGS. 1 and 2 .
  • FIG. 1 a functional block diagram of a stack processor 100 using a ferroelectric random access memory (F-RAM) for both code and data space in accordance with a particular representative embodiment of the present invention is shown.
  • the stack processor 100 comprises, in pertinent part, a F-RAM memory array 102 and associated processor 104 .
  • the F-RAM memory array 102 may be of the type provided by Ramtron International Corporation, Colorado Springs, Colo., assignee of the present invention.
  • the processor 104 as shown, may comprise a Forth core 106 developed by Ramtron International Corporation, assignee of the present invention.
  • An associated interrupt controller 108 forms a portion of the processor 104 which also operates in conjunction with a clock reset circuit 110 as shown.
  • eight modules respectively labeled as 112 1 through 112 8 inclusive are associated with the stack processor 100 .
  • a single code/data bus couples the Forth core 106 to the F-RAM memory array 102 and comprises memory read and write lines (mem_rd, mem_wr) and 16 bit memory address, memory write data and memory read data buses (mem_address, mem_wr_data, mem_rd_data) buses as illustrated.
  • the clock reset circuit 110 provides a clock interrupt signal (clk_int) to the interrupt controller 108 as well as a reset n (rst_n) signal to both the Forth core 106 and interrupt controller 108 .
  • the clock reset circuit 110 also provides a core clock signal (clk_core) to the Forth Core 106 and receives a core_need_clock therefrom.
  • the interrupt controller 108 provides an interrupt signal (int) and a 3 bit int_nb signal to the Forth core 106 and receives an interrupt clear (int_clr) and ongoing interrupt (int_ongoing) signals therefrom.
  • FIG. 2 a follow-on functional block diagram of a portion of a stack processor 200 utilizing both F-RAM and CMOS memory in accordance with another particular representative embodiment of the present invention is shown. As indicated, there is only one processor code/data bus, and it goes to the F-RAM memory array 202 only.
  • the stack processor 200 comprises a Forth Core 204 having a data stack 210 and a return stack 212 while the F-RAM memory array 202 has a corresponding data stack bottom 206 and a return stack bottom 208 .
  • the data stack 210 has its top part in the processor core 204 in volatile CMOS registers in this case.
  • the bottom of the data stack 206 is in the nonvolatile F-RAM memory array 202 .
  • the top of the return stack 212 is in the processor core 204 in volatile CMOS registers 212 and the bottom of the return stack 208 is in the F-RAM memory array 202 . Therefore, if a program only needs to modify the top of the data stack with no push/pop to the F-RAM memory array 202 stack, only CMOS register accesses will be involved. This would result in overall lower power consumption. If a power down were to occur, there would then only be the contents of a very limited number of CMOS registers to save before a power loss occurs.
  • FIGS. 3A through 3D inclusive exemplary illustrations of an instruction for the copying of the top of the data stack (“T”) into the top of a return stack (“R”) are shown.
  • the proposed stack access method illustrated maintains track of the where the stack pointer is relative to the boundary between the CMOS and F-RAM portions of the stack. As illustrated, a data transfer between the data and return stacks can result in four possible initial configurations. The shaded boxes indicate that there is valid data in this stack memory address.
  • FIG. 3A the data AND return stack have not passed the F-RAM boundary.
  • FIG. 3B the data stack ONLY has passed the F-RAM boundary.
  • FIG. 3C the return stack ONLY have passed the F-RAM boundary and in FIG. 3D , both the data and return stack have passed the F-RAM boundary
  • the proposed algorithm performs or does not perform a stack pop/push and for the specific instruction outlined above, the algorithm would do the following:
  • FIG. 4 an exemplary illustration of a method in accordance with the representative embodiment of the present invention illustrated in FIG. 2 is shown wherein the CMOS registers are only updated when required and unused portions of the CMOS stack are not updated to conserve power.
  • the third CMOS register bus of the stack has not been updated or reset and since it is now in the unused part of the stack its current value is “don't care”. As can be determined, not updating that register bus allows for the saving of some more power.
  • FIG. 5 an exemplary illustration of a 16 bit instruction set in accordance with a particular embodiment of the present invention is shown which has been optimized to minimize memory fetches.
  • the disclosed instruction set is based on a 16 bit memory code space.
  • the following possibilities may be examined:
  • each instruction can be provided while minimizing the waste of memory bits.
  • the 16 bit word MSB is a bit used by each instruction, therefore each instruction is 6 bits wide (which gives a total of 64 instructions maximum). As a consequence, the full 16 bits of code are used.
  • FIG. 6 an exemplary table of possible operating code (“opcode”) mappings for the Forth core of the preceding FIGS. 1 and 2 is shown.
  • the Forth core instructions are grouped by the categories of: Branching, Memory Access, Logic, Arithmetic and Data Stack and those opcodes are shown which share the same five lower significant bits and differ only by the one most significant group bit as shown in the preceding FIG. 5 .
  • the terms “comprises”, “comprising”, or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a recitation of certain elements does not necessarily include only those elements but may include other elements not expressly recited or inherent to such process, method, article or apparatus. None of the description in the present application should be read as implying that any particular element, step, or function is an essential element which must be included in the claim scope and THE SCOPE OF THE PATENTED SUBJECT MATTER IS DEFINED ONLY BY THE CLAIMS AS ALLOWED. Moreover, none of the appended claims are intended to invoke paragraph six of 35 U.S.C. Sect. 112 unless the exact phrase “means for” is employed and is followed by a participle.

Abstract

A stack processor and method implemented using a ferroelectric random access memory (F-RAM) for code and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses and thus minimize program execution time. This is particularly advantageous in low power applications and those in which the power supply is only available for a finite period of time such as RFID implementations. Disclosed herein is a relatively small but complete set of instructions enabling a multitude of possible applications to be supported with a program execution time that is not too long.

Description

RELATED APPLICATION
The present invention claims priority from U.S. Provisional Application No. 61/486,652, filed May 16, 2011, and is related to U.S. patent application Ser. No. 13/467,831 “Stack Processor Using a Ferroelectric Random Access Memory (F-RAM) for Both Code and Data Space”; Ser. No. 13/467,849 for “Stack Processor Using a Ferroelectric Random Access Memory (F-RAM) for Code Space and a Portion of the Stack Memory Space”; and Ser. No. 13/467,816 for “Stack Processor Using a Ferroelectric Random Access Memory (F-RAM) Having an Instruction Set Optimized to Minimize Memory Fetch Operations” all filed on even date herewith. The disclosures of the aforementioned applications are herein specifically incorporated by reference in their entirety.
BACKGROUND OF THE INVENTION
The present invention relates, in general, to the field of customizable integrated circuit devices incorporating non-volatile memory. More particularly, the present invention relates to a stack processor and method implemented using a ferroelectric random access memory (F-RAM) for code and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses.
Current nonvolatile memory technologies include, among others, electrically erasable programmable read only memory (EEPROM) and Flash memory. Despite continuing improvements to this technology, the endurance rate of Flash memory is still multiple orders of magnitude below the endurance rate of F-RAM. Therefore, for applications using Flash memory that require high endurance, some products will actually include a large Flash memory array with the associated user/program ensuring that data is stored in specific memory locations (e.g. banks of memory). Once a memory bank approaches its endurance limit, the user/program would then enable the movement of all data to a new memory bank, marking the previous memory bank as worn out and indicating it should not be used again. The typical endurance of such floating gate devices is approximately between 100,000 to 1 million write cycles.
It is also well known that writes to EEPROM and Flash is relatively slow compared to that of F-RAM. While a F-RAM write cycle is completed almost immediately, EEPROM and Flash write times take meaningfully longer. Still further, writes to an F-RAM memory cell occur at a relatively low voltage and very little current is required to change the data in the cell.
A currently implemented architecture for a stack processor is the b16 Processor described in Paysan, B., “A Forth Processor in an FPGA”, Feb. 2, 2003; Paysan, B., “b16—small—Less is More”, Jul. 9, 2006; and Paysan, B., “b16: Modern Processor Core”, Apr. 29, 2005 and disclosed at http://www.jwdt.com/˜paysan/b16.html. The b16 stack based processor has the top of the stacks maintained in volatile registers and the bottom of the stacks in two complementary metal oxide semiconductor (CMOS) memories. Such an architecture will lead to the possibility of the data and return stacks and code space being accessed simultaneously. Moreover, a stack processor architecture which provides for maintaining the stacks in volatile memory would cause it to suffer from very long and power demanding power-down times as the contents of a relatively large number of registers would have to be saved to nonvolatile memory on power-down. Placing some of the registers in nonvolatile Flash memory in an attempt to ameliorate this situation would, of course, lead to the endurance issues inherent in Flash. Still further, a stack processor architecture which utilizes a different memory type for code and the stacks would suffer from high power consumption peaks since all of the memories are likely to be accessed simultaneously in normal operation.
In the b16 stack processor, each 16 bit word is mapped as three, 5 bit instructions and one extra 1 bit instruction which can only be a “no operation” (NOP) or CALL. In practice this means that in the majority of cases, the fourth instruction will generally be a NOP and the instruction set is, therefore, wasting one bit per word along with a clock cycle (needed to execute the NOP) every three instructions. Still further, the b16 stack processor does not share code and data space so its architecture is even more power demanding as it can access code space and the data and return stacks all simultaneously.
SUMMARY OF THE INVENTION
Customizable integrated circuit devices will generally require the provision of some sort of processing unit on-chip to enable the device to execute different functions depending on a user defined program. One example market for customizable devices is in the field of metering functions which require higher endurance cycles than can reasonably be supplied by nonvolatile memory technologies such as Flash. Another possible market is radio frequency identification (RFID) in which low power memory cells are required.
Currently many companies also intend to develop products that operate in an environment where power supply voltage can, at times, be lost. In such applications, a need exists to be able to save current data quickly into non-volatile storage before power-down occurs.
To this end, a need also exists for an integrated circuit architecture that embodies a processor in conjunction with associated F-RAM memory. As power-down time in the event of a supply voltage disruption should be as quick as possible, it is advantageous to have as much data as possible in non-volatile memory. Stack based processors are particularly useful in such an application inasmuch as most of the registers it manipulates are in the stack.
Therefore, in using a stack-based processor, if some or all of the stacks are in F-RAM memory, most of the data is already in non-volatile memory at any given time. Even if some of the data is not already stored in F-RAM memory, the very fast write time of that memory technology also allows for the rapid back-up of the data. Moreover, such an architecture can be readily customized through the changing of the processor program allowing a wide range of potential applications such as metering functions, RFID and the like. Further, inherent in the operation of F-RAM memory is its low leakage and dynamic power requirements. The architecture disclosed herein also results in less routing congestion as there is only one code/data bus employed and it communicates with only one memory in a particular embodiment of the present invention.
In a particular embodiment of the architecture of the present invention disclosed herein the stack-based processor has all of its stacks stored in F-RAM memory. This presents the advantages of easy stack pointer management inasmuch as the stack pointer is itself a memory address. The time for saving all critical registers to memory is also minimized in that all registers are already maintained in non-volatile F-RAM per se. As previously mentioned the fast write times and low power access inherent in F-RAM provide many advantages while the use of a single, unique memory type on-chip affords easier hardware management, less intellectual property (IP) costs and also results in a smaller on-chip die area.
In accordance with another particular embodiment of the architecture of the present invention disclosed herein the stack-based processor has some, but not all, of its stacks stored in non-volatile F-RAM memory and the remainder in volatile memory. By storing some of the associated stacks in complementary metal oxide semiconductor (CMOS) or other volatile memory, read/write operations to only F-RAM would be obviated. Even though F-RAM memory requires relatively low power compared to other non-volatile memory technologies, an access to volatile memory instead could provide at least some power advantages. Further, read/write accesses to F-RAM memory only can somewhat slow down program execution and being able to access the stack memories in parallel to code execution would result in certain speed advantages. This is particularly relevant to RFID applications wherein power is only available for a relatively short, finite time when the card enters the RF field of the associated controller.
Implementing a stack processor with this type of architecture results in a compromise between an all F-RAM memory implementation providing a quicker power-down time, smaller on-chip die area and a relatively slower, more power consuming program execution versus a somewhat longer power-down time, larger on-chip die area, and a faster, less power consuming operation with a faster program execution time. Having the processor stack only partially maintained in F-RAM memory also has the advantage that, if the program compiler makes efficient use of this particular architecture, it can generate firmware code that will tend to concentrate the more intensive calculations to that part of the stack that is in volatile memory and minimize POP/PUSH operations to the F-RAM portion of the stack. Such an application will result in faster program execution through parallel access to the volatile registers which also consume less power than F-RAM memory. Moreover, since only the top of the stack is in volatile memory, most of it remains in F-RAM which means the application can still benefit from the high F-RAM endurance and shorter power-down times.
In accordance with yet another embodiment of the architecture of the present invention disclosed herein is a particular processor implementation utilizing an instruction set optimized to minimize memory fetches and thus minimize program execution time. This provides a number of distinct advantages including various low power applications and those in which the power supply is only available for a finite period of time such as RFID implementations. The use of a processor with such an optimized instruction set also allows the architecture of the present invention to be instantiated across multiple devices to support a wide range of possible operations and differing numbers of peripherals by supporting interrupts as will be more fully described hereinafter.
This particular embodiment of the present invention provides an instruction set for a stack processor architecture that is relatively small so that the associated logic is itself not too large thereby reducing current leakage and dynamic power requirements. Also disclosed is a complete set of instructions so that a multitude of possible applications can be supported with a program execution time that is not too long. Further, the instruction set disclosed is neither too complex nor too light. That is, there are not too many F-RAM memory fetches and nor is there too much logic leakage. As disclosed, the instruction set of the present invention supports the basic Forth commands as well as other operation code for generic operations while also supporting interrupts.
In accordance with still another embodiment of the architecture of the present invention disclosed herein is a particular method of operation which minimizes processor stack accesses when the associated stack is only partially implemented in F-RAM memory. The minimization of F-RAM stack accesses in conjunction with the particular processor described and disclosed herein results in less power being consumed as well as overall faster program execution as would be the case were all of the stack held in non-volatile F-RAM memory, in which instance the code could not be read while accessing the F-RAM stack.
Specifically disclosed herein is a method for operating a stack processor in an integrated circuit device which comprises coupling a non-volatile memory to the stack processor for storing program code and at least a portion of the stack memory space and also coupling a volatile memory to the stack processor for storing another portion of the stack memory space. The stack memory space comprises data and return stacks and the bottom portions of the data and return stacks are maintained in the non-volatile memory and the top portions are maintained in the volatile memory.
BRIEF DESCRIPTION OF THE DRAWINGS
The aforementioned and other features and objects of the present invention and the manner of attaining them will become more apparent and the invention itself will be best understood by reference to the following description of a preferred embodiment taken in conjunction with the accompanying drawings, wherein:
FIG. 1 is a functional block diagram of a stack processor using a ferroelectric random access memory (F-RAM) for both code and data space in accordance with a particular representative embodiment of the present invention;
FIG. 2 is a follow-on functional block diagram of a portion of a stack processor utilizing both F-RAM and CMOS memory in accordance with another particular representative embodiment of the present invention;
FIGS. 3A through 3D inclusive are exemplary illustrations of an instruction for the copying of the top of the data. stack (“T”) into the top of a return stack (“R”);
FIG. 4 is an exemplary illustration of a method in accordance with the representative embodiment of the present invention illustrated in FIG. 2 wherein the CMOS registers are only updated when required and unused portions of the CMOS stack are not updated to conserve power;
FIG. 5 is an exemplary illustration of a 16 bit instruction set in accordance with a particular embodiment of the present invention which has been optimized to minimize memory fetches; and
FIG. 6 is an exemplary table of possible operating code mappings for the Forth core of the preceding FIGS. 1 and 2.
DESCRIPTION OF A REPRESENTATIVE EMBODIMENT
With reference now to FIG. 1, a functional block diagram of a stack processor 100 using a ferroelectric random access memory (F-RAM) for both code and data space in accordance with a particular representative embodiment of the present invention is shown. The stack processor 100 comprises, in pertinent part, a F-RAM memory array 102 and associated processor 104. The F-RAM memory array 102 may be of the type provided by Ramtron International Corporation, Colorado Springs, Colo., assignee of the present invention. The processor 104, as shown, may comprise a Forth core 106 developed by Ramtron International Corporation, assignee of the present invention.
An associated interrupt controller 108 forms a portion of the processor 104 which also operates in conjunction with a clock reset circuit 110 as shown. In the representative embodiment illustrated, eight modules (Module 1 through Module 8) respectively labeled as 112 1 through 112 8 inclusive are associated with the stack processor 100. A single code/data bus couples the Forth core 106 to the F-RAM memory array 102 and comprises memory read and write lines (mem_rd, mem_wr) and 16 bit memory address, memory write data and memory read data buses (mem_address, mem_wr_data, mem_rd_data) buses as illustrated.
The clock reset circuit 110 provides a clock interrupt signal (clk_int) to the interrupt controller 108 as well as a reset n (rst_n) signal to both the Forth core 106 and interrupt controller 108. The clock reset circuit 110 also provides a core clock signal (clk_core) to the Forth Core 106 and receives a core_need_clock therefrom. The interrupt controller 108 provides an interrupt signal (int) and a 3 bit int_nb signal to the Forth core 106 and receives an interrupt clear (int_clr) and ongoing interrupt (int_ongoing) signals therefrom.
With reference additionally now to FIG. 2, a follow-on functional block diagram of a portion of a stack processor 200 utilizing both F-RAM and CMOS memory in accordance with another particular representative embodiment of the present invention is shown. As indicated, there is only one processor code/data bus, and it goes to the F-RAM memory array 202 only. The stack processor 200 comprises a Forth Core 204 having a data stack 210 and a return stack 212 while the F-RAM memory array 202 has a corresponding data stack bottom 206 and a return stack bottom 208.
In this figure the organization of an embodiment of the stack processor 200 is shown. The data stack 210 has its top part in the processor core 204 in volatile CMOS registers in this case. The bottom of the data stack 206 is in the nonvolatile F-RAM memory array 202. In the same manner, the top of the return stack 212 is in the processor core 204 in volatile CMOS registers 212 and the bottom of the return stack 208 is in the F-RAM memory array 202. Therefore, if a program only needs to modify the top of the data stack with no push/pop to the F-RAM memory array 202 stack, only CMOS register accesses will be involved. This would result in overall lower power consumption. If a power down were to occur, there would then only be the contents of a very limited number of CMOS registers to save before a power loss occurs.
It should be noted that a portion of the performance of this particular stack processor 200 implementation is due to the instruction set. However, some of it also results from the fact that the program is relatively small and can be written to derive benefit from the top of the stacks being in CMOS registers.
With reference additionally now to FIGS. 3A through 3D inclusive, exemplary illustrations of an instruction for the copying of the top of the data stack (“T”) into the top of a return stack (“R”) are shown.
The proposed stack access method illustrated maintains track of the where the stack pointer is relative to the boundary between the CMOS and F-RAM portions of the stack. As illustrated, a data transfer between the data and return stacks can result in four possible initial configurations. The shaded boxes indicate that there is valid data in this stack memory address.
In FIG. 3A, the data AND return stack have not passed the F-RAM boundary. In FIG. 3B, the data stack ONLY has passed the F-RAM boundary. In FIG. 3C, the return stack ONLY have passed the F-RAM boundary and in FIG. 3D, both the data and return stack have passed the F-RAM boundary
As can be seen, according to the specific configuration, the proposed algorithm performs or does not perform a stack pop/push and for the specific instruction outlined above, the algorithm would do the following:
As shown in FIG. 3A, copy T into R; as shown in FIG. 3B, POP the data stack and copy T into R; as shown in FIG. 3C, PUSH the return stack and copy T into R; and as shown in FIG. 3D, PUSH the return stack, POP the data stack and copy T into R. In this manner, only a minimum number of F-RAM memory array accesses are required.
With reference additionally now to FIG. 4, an exemplary illustration of a method in accordance with the representative embodiment of the present invention illustrated in FIG. 2 is shown wherein the CMOS registers are only updated when required and unused portions of the CMOS stack are not updated to conserve power. In this implementation of the present invention, the third CMOS register bus of the stack has not been updated or reset and since it is now in the unused part of the stack its current value is “don't care”. As can be determined, not updating that register bus allows for the saving of some more power.
With reference additionally now to FIG. 5, an exemplary illustration of a 16 bit instruction set in accordance with a particular embodiment of the present invention is shown which has been optimized to minimize memory fetches.
In this particular implementation of the present invention, the disclosed instruction set is based on a 16 bit memory code space. In order to determine the appropriate instruction set, the following possibilities may be examined:
    • any 4 bit instruction (or less) set would allow up to 16 instructions (or less) and so would not be sufficient as the program would take too long to execute with too many F-RAM memory array fetches;
    • any 7 bit instruction set (or more) would require 128 instructions or more, so too much leakage/dynamic power would be required and the logic would be excessively large;
    • a 6 bit instruction set would require 64 instructions, which while sufficient, is nonetheless very difficult to map into a 16-bit word without wasting too many bits;
    • a 5 bit instruction set would require 32 instructions and would appear to be a bit too limiting.
As can be seen in the exemplary instruction set illustrated, 64 instructions can be provided while minimizing the waste of memory bits. The 16 bit word MSB is a bit used by each instruction, therefore each instruction is 6 bits wide (which gives a total of 64 instructions maximum). As a consequence, the full 16 bits of code are used.
With reference additionally now to FIG. 6, an exemplary table of possible operating code (“opcode”) mappings for the Forth core of the preceding FIGS. 1 and 2 is shown. As illustrated, the Forth core instructions are grouped by the categories of: Branching, Memory Access, Logic, Arithmetic and Data Stack and those opcodes are shown which share the same five lower significant bits and differ only by the one most significant group bit as shown in the preceding FIG. 5.
While there have been described above the principles of the present invention in conjunction with specific circuitry and technology, it is to be clearly understood that the foregoing description is made only by way of example and not as a limitation to the scope of the invention. Particularly, it is recognized that the teachings of the foregoing disclosure will suggest other modifications to those persons skilled in the relevant art. Such modifications may involve other features which are already known per se and which may be used instead of or in addition to features already described herein. Although claims have been formulated in this application to particular combinations of features, it should be understood that the scope of the disclosure herein also includes any novel feature or any novel combination of features disclosed either explicitly or implicitly or any generalization or modification thereof which would be apparent to persons skilled in the relevant art, whether or not such relates to the same invention as presently claimed in any claim and whether or not it mitigates any or all of the same technical problems as confronted by the present invention. The applicants hereby reserve the right to formulate new claims to such features and/or combinations of such features during the prosecution of the present application or of any further application derived therefrom.
As used herein, the terms “comprises”, “comprising”, or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a recitation of certain elements does not necessarily include only those elements but may include other elements not expressly recited or inherent to such process, method, article or apparatus. None of the description in the present application should be read as implying that any particular element, step, or function is an essential element which must be included in the claim scope and THE SCOPE OF THE PATENTED SUBJECT MATTER IS DEFINED ONLY BY THE CLAIMS AS ALLOWED. Moreover, none of the appended claims are intended to invoke paragraph six of 35 U.S.C. Sect. 112 unless the exact phrase “means for” is employed and is followed by a participle.

Claims (7)

What is claimed is:
1. A method for operating a stack processor comprising:
a data POP wherein data is transferred from a bottom portion of a data stack to a top portion of the data stack in a first cycle of the stack processor, wherein the top portion of the data stack is in a volatile memory in a Forth core in the stack processor and the bottom portion of the data stack is in a ferroelectric random access memory in the stack processor, separate from the Forth core and coupled to the Forth core by a single code/data bus extending between the Forth core and ferroelectric random access memory only, and wherein data is transferred from the bottom portion of the data stack to the top portion of the data stack through the single code/data bus;
transferring data from the top portion of the data stack to a top portion of a return stack in a second cycle of the stack processor, wherein the top portion of the return stack is a single register in the volatile memory, and wherein transferring data between the top portion of the data stack and the top portion of the return stack comprises copying data in the top portion of the data stack to the top portion of the return stack; and
a return PUSH wherein data is transferred from the top portion of the return stack to a bottom portion of the return stack in a third cycle of the stack processor, wherein the bottom portion of the return stack is in the ferroelectric random access memory, and wherein data is transferred from the top portion of the return stack to the bottom portion of the return stack through the single code/data bus,
wherein the first, second and third clock cycles are sequential, and wherein the data POP, return PUSH and the transferring data are performed in response to a stack-based computer programming language.
2. The method of claim 1 wherein the stack-based computer programming language comprises 64 possible instructions based upon a 16 bit word, wherein each of the instructions in the 16 bit word comprises 3 five bit instructions and a 16th bit applicable to each of the 3 five bit instructions.
3. A method for operating a stack processor comprising:
in a first cycle of the stack processor transferring data from a bottom portion of a data stack to a top portion of the data stack, wherein the bottom portion of the data stack is in a ferroelectric random access memory (F-RAM) and the top portion of the data stack is in a volatile memory in a Forth core in the stack processor, and the data is transferred through a single code/data bus coupled between the F-RAM and the Forth core only;
in a second cycle of the stack processor copying the data from the top portion of the data stack to a top portion of a return stack, wherein the top portion of the return stack comprises a single register in the volatile memory; and
in a third cycle of the stack processor transferring the data from the top portion of the return stack to a bottom portion of the return stack, wherein the bottom portion of the return stack is in F-RAM and the data is transferred through the single code/data bus.
4. The method of claim 3 wherein the transferring and the copying of data are performed in response to a stack-based computer programming language supporting Forth core instructions.
5. The method of claim 4 wherein the Forth core instructions comprise 64 possible instructions based upon a 16 bit word, wherein each of the instructions in the 16 bit word comprises 3 five bit instructions and a 16th bit applicable to each of the 3 five bit instructions.
6. A method for operating a stack processor comprising:
transferring data from a bottom portion of a data stack to a top portion of the data stack, wherein the bottom portion of the data stack is in a ferroelectric random access memory (F-RAM) and the top portion of the data stack is in a volatile memory in a Forth core in the stack processor and the data is transferred through a single code/data bus coupled between the F-RAM and the Forth core only;
copying data from the top portion of the data stack to a top portion of a return stack, wherein the top portion of the return stack comprises a single register in the volatile memory; and
transferring the data from the top portion of the return stack to a bottom portion of the return stack, wherein the bottom portion of the return stack is in F-RAM and the data is transferred through the single code/data bus,
wherein the transferring and the copying of data are performed in three sequential clock cycles of the stack processor in response to Forth core instructions.
7. The method of claim 6 wherein the Forth core instructions comprise 64 possible instructions based upon a 16 bit word, wherein each of the instructions in the 16 bit word comprises 3 five bit instructions and a 16th bit applicable to each of the 3 five bit instructions.
US13/467,874 2011-05-16 2012-05-09 Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses Active US9588881B2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US13/467,874 US9588881B2 (en) 2011-05-16 2012-05-09 Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses
CN201210154918.2A CN102880446B (en) 2011-05-16 2012-05-16 Use ferroelectric RAM and there is the stack processor optimizing instruction set

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201161486652P 2011-05-16 2011-05-16
US13/467,874 US9588881B2 (en) 2011-05-16 2012-05-09 Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses

Publications (2)

Publication Number Publication Date
US20120297120A1 US20120297120A1 (en) 2012-11-22
US9588881B2 true US9588881B2 (en) 2017-03-07

Family

ID=47175827

Family Applications (1)

Application Number Title Priority Date Filing Date
US13/467,874 Active US9588881B2 (en) 2011-05-16 2012-05-09 Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses

Country Status (2)

Country Link
US (1) US9588881B2 (en)
CN (1) CN102880446B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20220405003A1 (en) * 2021-06-17 2022-12-22 Seagate Technology Llc Balancing power, endurance and latency in a ferroelectric memory
CN115237475B (en) * 2022-06-23 2023-04-07 云南大学 Forth multi-core stack processor and instruction set

Citations (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5530932A (en) 1994-12-23 1996-06-25 Intel Corporation Cache coherent multiprocessing computer system with reduced power operating features
EP0720087A1 (en) * 1994-12-27 1996-07-03 Motorola, Inc. Apparatus and method for a memory extension stack in a data processing system
US5890199A (en) 1996-10-21 1999-03-30 Ramtron International Corporation Data processor incorporating a ferroelectric memory array selectably configurable as read/write and read only memory
US5953741A (en) 1996-11-27 1999-09-14 Vlsi Technology, Inc. Stack cache for stack-based processor and method thereof
US6157979A (en) 1998-03-14 2000-12-05 Advanced Technology Materials, Inc. Programmable controlling device with non-volatile ferroelectric state-machines for restarting processor when power is restored with execution states retained in said non-volatile state-machines on power down
CN1295279A (en) 1999-11-09 2001-05-16 摩托罗拉公司 Device and method for execution of stack pull and push-down operation in processing system
US6289300B1 (en) 1998-02-06 2001-09-11 Analog Devices, Inc. Integrated circuit with embedded emulator and emulation system for use with such an integrated circuit
US20030126487A1 (en) 2001-08-29 2003-07-03 Joern Soerensen Method and apparatus for clock and power control in wireless systems
US20030154471A1 (en) 2002-02-13 2003-08-14 Power Measurement Ltd. Method for upgrading firmware in an electronic device
US20040015664A1 (en) * 2000-08-22 2004-01-22 Daniel Ciesinger Method for virtually enlarging the stack of a portable data carrier
CN1655118A (en) 2004-02-12 2005-08-17 松下电器产业株式会社 Processor and compiler
US20050204090A1 (en) 2004-03-10 2005-09-15 Eilert Sean S. Hardware stack for blocked nonvolatile memories
US20070192570A1 (en) * 2006-02-16 2007-08-16 Moore Charles H Execution of instructions directly from input source
US20070250682A1 (en) 2006-03-31 2007-10-25 Moore Charles H Method and apparatus for operating a computer processor array
US20070271441A1 (en) 1995-10-06 2007-11-22 Shaw George W Availability of space in a RISC microprocessor architecture
US20080040586A1 (en) * 2002-10-30 2008-02-14 Stmicroelectronics, Inc. Predicated execution using operand predicates
US20090055635A1 (en) * 2007-08-24 2009-02-26 Matsushita Electric Industrial Co., Ltd. Program execution control device
US20100023730A1 (en) 2008-07-24 2010-01-28 Vns Portfolio Llc Circular Register Arrays of a Computer
US20100185811A1 (en) 2009-01-21 2010-07-22 Samsung Electronics Co., Ltd. Data processing system and method
US20100293342A1 (en) 2007-04-10 2010-11-18 Morfey Alistair G Data processing apparatus
US20100325389A1 (en) * 2008-04-04 2010-12-23 Moore Charles H Microprocessor communications system
US20120047580A1 (en) 2010-08-18 2012-02-23 Smith Ned M Method and apparatus for enforcing a mandatory security policy on an operating system (os) independent anti-virus (av) scanner
US20120297166A1 (en) 2011-05-16 2012-11-22 Ramtron International Corporation Stack processor using a ferroelectric random access memory (f-ram) having an instruction set optimized to minimize memory fetch operations
US8934279B2 (en) 2011-05-16 2015-01-13 Cypress Semiconductor Corporation Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space
US8949514B2 (en) 2011-05-16 2015-02-03 Cypress Semiconductor Corporation Stack processor using a ferroelectric random access memory (F-RAM) for both code and data space

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4412905B2 (en) * 2003-01-28 2010-02-10 パナソニック株式会社 Low power operation control device and program optimization device

Patent Citations (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1171159A (en) 1994-12-23 1998-01-21 英特尔公司 Cache coherent multiprocessing computer system with reduced power operating features
US5530932A (en) 1994-12-23 1996-06-25 Intel Corporation Cache coherent multiprocessing computer system with reduced power operating features
EP0720087A1 (en) * 1994-12-27 1996-07-03 Motorola, Inc. Apparatus and method for a memory extension stack in a data processing system
US20070271441A1 (en) 1995-10-06 2007-11-22 Shaw George W Availability of space in a RISC microprocessor architecture
US20080072021A1 (en) * 1995-10-06 2008-03-20 Shaw George W Floating point exception handling in a risc microprocessor architecture
US5890199A (en) 1996-10-21 1999-03-30 Ramtron International Corporation Data processor incorporating a ferroelectric memory array selectably configurable as read/write and read only memory
US5953741A (en) 1996-11-27 1999-09-14 Vlsi Technology, Inc. Stack cache for stack-based processor and method thereof
US6289300B1 (en) 1998-02-06 2001-09-11 Analog Devices, Inc. Integrated circuit with embedded emulator and emulation system for use with such an integrated circuit
US6157979A (en) 1998-03-14 2000-12-05 Advanced Technology Materials, Inc. Programmable controlling device with non-volatile ferroelectric state-machines for restarting processor when power is restored with execution states retained in said non-volatile state-machines on power down
US6654871B1 (en) * 1999-11-09 2003-11-25 Motorola, Inc. Device and a method for performing stack operations in a processing system
CN1295279A (en) 1999-11-09 2001-05-16 摩托罗拉公司 Device and method for execution of stack pull and push-down operation in processing system
US20040015664A1 (en) * 2000-08-22 2004-01-22 Daniel Ciesinger Method for virtually enlarging the stack of a portable data carrier
US20080077820A1 (en) 2001-08-29 2008-03-27 Analog Devices, Inc. Method and apparatus for timing and event processing in wireless systems
US20080077770A1 (en) 2001-08-29 2008-03-27 Analog Devices, Inc. Method and apparatus for timing and event processing in wireless systems
US20030149809A1 (en) 2001-08-29 2003-08-07 Jensen Poul R. Method and apparatus for timing and event processing in wireless systems
US20030126487A1 (en) 2001-08-29 2003-07-03 Joern Soerensen Method and apparatus for clock and power control in wireless systems
US20030154471A1 (en) 2002-02-13 2003-08-14 Power Measurement Ltd. Method for upgrading firmware in an electronic device
US20080040586A1 (en) * 2002-10-30 2008-02-14 Stmicroelectronics, Inc. Predicated execution using operand predicates
CN1655118A (en) 2004-02-12 2005-08-17 松下电器产业株式会社 Processor and compiler
US20080201560A1 (en) * 2004-02-12 2008-08-21 Takahiro Kageyama Very long instruction word (vliw) computer having efficient instruction code format
US20050204090A1 (en) 2004-03-10 2005-09-15 Eilert Sean S. Hardware stack for blocked nonvolatile memories
US20070192570A1 (en) * 2006-02-16 2007-08-16 Moore Charles H Execution of instructions directly from input source
US20100281238A1 (en) 2006-02-16 2010-11-04 Moore Charles H Execution of instructions directly from input source
US20070250682A1 (en) 2006-03-31 2007-10-25 Moore Charles H Method and apparatus for operating a computer processor array
US20100293342A1 (en) 2007-04-10 2010-11-18 Morfey Alistair G Data processing apparatus
US20090055635A1 (en) * 2007-08-24 2009-02-26 Matsushita Electric Industrial Co., Ltd. Program execution control device
US20100325389A1 (en) * 2008-04-04 2010-12-23 Moore Charles H Microprocessor communications system
US20100023730A1 (en) 2008-07-24 2010-01-28 Vns Portfolio Llc Circular Register Arrays of a Computer
US20100185811A1 (en) 2009-01-21 2010-07-22 Samsung Electronics Co., Ltd. Data processing system and method
US20120047580A1 (en) 2010-08-18 2012-02-23 Smith Ned M Method and apparatus for enforcing a mandatory security policy on an operating system (os) independent anti-virus (av) scanner
US20120297166A1 (en) 2011-05-16 2012-11-22 Ramtron International Corporation Stack processor using a ferroelectric random access memory (f-ram) having an instruction set optimized to minimize memory fetch operations
US8934279B2 (en) 2011-05-16 2015-01-13 Cypress Semiconductor Corporation Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space
US8949514B2 (en) 2011-05-16 2015-02-03 Cypress Semiconductor Corporation Stack processor using a ferroelectric random access memory (F-RAM) for both code and data space

Non-Patent Citations (31)

* Cited by examiner, † Cited by third party
Title
B. Paysan, "b16: Modern Processor Core", Apr. 29, 2005, pp. 1-29.
B. Paysan, "b16-A Forth Processor in an FPGA", Feb. 2, 2003, pp. 1-9.
B. Paysan, "b16-small-Less is More", EuroForth 2004 Proceedings, Jul. 9, 2006, pp. 1-8.
Carns, T. K.; Zheng, X.; Wang, K.L., "A novel high speed, high density SRAM cell utilizing a bistable GeSi/Si tunnel diode," Electron Devices Meeting, 1994. IEDM '94. Technical Digest., International , vol., No., pp. 381,384, Dec. 11-14, 1994. *
Instruction tables: Lists of instruction latencies, throughputs and micro-operation breakdowns for Intel, AMD and VIA CPUs, Agner Fog. Copenhagen University College of Engineering, Feb. 16, 2010. *
Klaus Schleisiek, Three Instruction Set Structures for Stack Processor, Jan. 2, 1996.
P.H.W et al., A FPGA based Forth microprocessor, IEEE Symposium, 1998.
Phillip J Koopman Jr, Stack Computers: The New Wave, 1989, pp. 24-26. *
SIPO Office Action for Application No. 2012101154919.7 dated Aug. 7, 2015; 2 pages.
SIPO Office Action for Application No. 2012101154919.7 dated Dec. 22, 2014; 4 pages.
SIPO Office Action for Application No. 2012101154919.7 dated May 30, 2014; 5 pages.
SIPO Office Action for Application No. 201210154887.0 dated Sep. 2, 2014; 5 pages.
SIPO Office Action for Application No. 201210154889.X dated Jul. 23, 2014; 4 pages.
SIPO Office Action for Application No. 201210154889.X dated Mar. 24, 2015; 4 pages.
SIPO Office Action for Application No. 201210154918.2 dated Apr. 23, 2015; 6 pages.
SIPO Office Action for Application No. 201210154918.2 dated Jan. 18, 2016; 3 pages.
SIPO Office Action for Application No. 201210154918.2 dated Jun. 5, 2014; 4 pages.
T.K Cams el al. "A Novel High Speed, High Density SRAM Cell Utilizing a Bistable GeSi/Si Tunnel Diode", Electron Devices Meeting. 1994, IEDM, Technical Diget International, pp. 381-384, Dec. 14, 1994.
U.S. Appl. No. 13/467,874 : "Stack Processor Using a Ferroelectric Random Access Memory (F-RAM) for Code Space and a Portion of the Stack Memory Space Having an Instruction Set Optimized to Minimize Processor Stack Accesses," Franck Fillere, filed on May 9, 2012; 26 pages.
USPTO Advisory Action for U.S. Appl. No. 13/467,874 dated Oct. 8, 2014; 5 pages.
USPTO Final Rejection for U.S. Appl. No. 13/467,816 dated Sep. 24, 2014; 18 pages.
USPTO Final Rejection for U.S. Appl. No. 13/467,831 dated Aug. 29, 2014; 16 pages.
USPTO Non Final Rejection for U.S. Appl. No. 13/467,831 dated Mar. 13, 2014; 16 pages.
USPTO Non-Final Rejection for U.S. Appl. No. 13/467,816 dated May 1, 2014; 20 pages.
USPTO Non-Final Rejection for U.S. Appl. No. 13/467,874 dated Jan. 14, 2014; 10 pages.
USPTO Notice of Allowance for U.S. Appl. No. 13/467,816 dated Dec. 17, 2014; 10 pages.
USPTO Notice of Allowance for U.S. Appl. No. 13/467,816 dated Feb. 18, 2016; 7 pages.
USPTO Notice of Allowance for U.S. Appl. No. 13/467,816 dated Jul. 17, 2015; 7 pages.
USPTO Notice of Allowance for U.S. Appl. No. 13/467,816 dated Mar. 19, 2015; 7 pages.
USPTO Notice of Allowance for U.S. Appl. No. 13/467,831 dated Nov. 7, 2014; 9 pages.
USPTO Notice of Allowance for U.S. Appl. No. 13/467,849 dated Sep. 22, 2014; 8 pages.

Also Published As

Publication number Publication date
CN102880446A (en) 2013-01-16
US20120297120A1 (en) 2012-11-22
CN102880446B (en) 2016-07-06

Similar Documents

Publication Publication Date Title
US10878883B2 (en) Apparatuses and methods for cache invalidate
EP3014623B1 (en) Hybrid memory device
CN107368433B (en) Dynamic partial power down of memory-side caches in a level 2 memory hierarchy
EP3712774B1 (en) Apparatus and method for implementing a multi-level memory hierarchy
JP4989872B2 (en) Semiconductor memory device and arithmetic processing unit
CN104115129B (en) System and method for the intelligent refresh data from processor to memory sub-system
US11755328B2 (en) Coprocessor operation bundling
US11429555B2 (en) Coprocessors with bypass optimization, variable grid architecture, and fused vector operations
WO2013048500A1 (en) Apparatus and method for implementing a multi-level memory hierarchy over common memory channels
WO2013048503A1 (en) Apparatus and method for implementing a multi-level memory hierarchy having different operating modes
WO2013095385A1 (en) Apparatus and method for phase change memory drift management
CN103513964A (en) Loop buffer packing
US9910823B2 (en) Stack processor using a ferroelectric random access memory (F-RAM) having an instruction set optimized to minimize memory fetch
CN103902013A (en) Memory control device and method
US8934279B2 (en) Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space
US8949514B2 (en) Stack processor using a ferroelectric random access memory (F-RAM) for both code and data space
US10846091B2 (en) Coprocessor with distributed register
US9588881B2 (en) Stack processor using a ferroelectric random access memory (F-RAM) for code space and a portion of the stack memory space having an instruction set optimized to minimize processor stack accesses
EP4020216A1 (en) Performance circuit monitor circuit and method to concurrently store multiple performance monitor counts in a single register
US7185170B2 (en) Data processing system having translation lookaside buffer valid bits with lock and method therefor
US20240028213A1 (en) System, method and/device for managing memory devices

Legal Events

Date Code Title Description
AS Assignment

Owner name: RAMTRON INTERNATIONAL CORPORATION, COLORADO

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:FILLERE, FRANCK;REEL/FRAME:028189/0944

Effective date: 20120508

AS Assignment

Owner name: CYPRESS SEMICONDUCTOR CORPORATION, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:RAMTRON INTERNATIONAL CORPORATION;REEL/FRAME:029408/0437

Effective date: 20121203

AS Assignment

Owner name: MORGAN STANLEY SENIOR FUNDING, INC., NEW YORK

Free format text: SECURITY INTEREST;ASSIGNORS:CYPRESS SEMICONDUCTOR CORPORATION;SPANSION LLC;REEL/FRAME:035240/0429

Effective date: 20150312

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: MUFG UNION BANK, N.A., CALIFORNIA

Free format text: ASSIGNMENT AND ASSUMPTION OF SECURITY INTEREST IN INTELLECTUAL PROPERTY;ASSIGNOR:MORGAN STANLEY SENIOR FUNDING, INC.;REEL/FRAME:050896/0366

Effective date: 20190731

MAFP Maintenance fee payment

Free format text: PAYMENT OF MAINTENANCE FEE, 4TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1551); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment: 4

AS Assignment

Owner name: MORGAN STANLEY SENIOR FUNDING, INC., NEW YORK

Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE 8647899 PREVIOUSLY RECORDED ON REEL 035240 FRAME 0429. ASSIGNOR(S) HEREBY CONFIRMS THE SECURITY INTERST;ASSIGNORS:CYPRESS SEMICONDUCTOR CORPORATION;SPANSION LLC;REEL/FRAME:058002/0470

Effective date: 20150312

AS Assignment

Owner name: SPANSION LLC, CALIFORNIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:MUFG UNION BANK, N.A.;REEL/FRAME:059410/0438

Effective date: 20200416

Owner name: CYPRESS SEMICONDUCTOR CORPORATION, CALIFORNIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:MUFG UNION BANK, N.A.;REEL/FRAME:059410/0438

Effective date: 20200416