US20160110286A1 - Data writing method and memory system - Google Patents

Data writing method and memory system Download PDF

Info

Publication number
US20160110286A1
US20160110286A1 US14/982,353 US201514982353A US2016110286A1 US 20160110286 A1 US20160110286 A1 US 20160110286A1 US 201514982353 A US201514982353 A US 201514982353A US 2016110286 A1 US2016110286 A1 US 2016110286A1
Authority
US
United States
Prior art keywords
cache line
data
memory
changed
data block
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US14/982,353
Inventor
Fei Xia
Dejun Jiang
Jin Xiong
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Huawei Technologies Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Publication of US20160110286A1 publication Critical patent/US20160110286A1/en
Assigned to HUAWEI TECHNOLOGIES CO., LTD. reassignment HUAWEI TECHNOLOGIES CO., LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: JIANG, DEJUN, XIA, Fei, XIONG, JIN
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0806Multiuser, multiprocessor or multiprocessing cache systems
    • G06F12/0811Multiuser, multiprocessor or multiprocessing cache systems with multilevel cache hierarchies
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0804Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches with main memory updating
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/10Address translation
    • G06F12/109Address translation for multiple virtual address spaces, e.g. segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0866Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches for peripheral storage systems, e.g. disk cache
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0877Cache access modes
    • G06F12/0879Burst mode
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F12/00Accessing, addressing or allocating within memory systems or architectures
    • G06F12/02Addressing or allocation; Relocation
    • G06F12/08Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
    • G06F12/0802Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
    • G06F12/0893Caches characterised by their organisation or structure
    • G06F12/0897Caches characterised by their organisation or structure with two or more cache hierarchy levels
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/28Using a specific disk cache architecture
    • G06F2212/283Plural cache memories
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2212/00Indexing scheme relating to accessing, addressing or allocation within memory systems or architectures
    • G06F2212/65Details of virtual memory and virtual address translation
    • G06F2212/657Virtual address space management
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Definitions

  • Embodiments of the present disclosure relates to computer technologies, and in particular, to a data writing method and a memory system.
  • An existing memory system basically includes a memory controller (MC), a memory device, and the like.
  • the memory controller and the memory device exchange data by using the double data rate (DDR) protocol.
  • the memory controller writes data into the memory device in a burst write manner, and a size of a data block on which one burst write is performed is a memory data bus width;
  • a cache and the memory system exchange data in unit of cache line, and a size of data read or written each time is a size of one cache line of a last level cache (LLC) in the cache. Therefore, the memory controller needs to perform multiple consecutive burst writes to write data of one cache line into the memory device, where a quantity of consecutive burst writes is called a burst length (BL).
  • BL burst length
  • a BL is generally equal to 8, and a size of a data block in one burst write is used as a granularity to divide one cache line into multiple data blocks. For example, if a size of one cache line of the LLC is 64 bytes and the memory data bus width is 64 bits, when burst write data appears on a data bus, the memory controller needs to perform eight burst writes in consecutive four clock cycles to write data of one cache line of the LLC into the memory device. However, actually, when data of one cache line of the LLC is written into the memory device, many data blocks are not changed. During a writing process, it is possible that invalid data (unchanged data) is written into the memory device in some burst writes. As a result, a speed of writing valid data (changed data) is low, and writing a large amount of invalid data leads to an increase in power consumption of the memory system, thereby reducing performance of the memory system.
  • BC4 burst chop 4
  • a total of four burst writes occur in two consecutive clock cycles, and there is no burst write in subsequent two clock cycles, to write a first half or a latter half of data of one cache line into the memory device.
  • invalid data is written into the memory device in some burst writes.
  • a speed of writing valid data is low, and writing a large amount of invalid data leads to an increase in power consumption of the memory system, thereby reducing performance of the memory system.
  • Embodiments of the present disclosure provide a data writing method and a memory system, where whether data in a data block of a cache line is changed is differentiated and a write is performed only on a changed data block, so that objectives to quickly write valid data, reduce power consumption of a memory system, and improve performance of the memory system are achieved.
  • an embodiment of the present disclosure provides a data writing method, which is applied to a memory system including at least a memory controller and a memory device and includes:
  • change information is information that is generated after the cache divides a first to-be-written cache line of a last level cache (LLC) into at least one data block and that is used to indicate whether data in each of the at least one data block is changed; and
  • LLC last level cache
  • skipping sending, by the memory controller according to the change information, a column address corresponding to each unchanged data block and data corresponding to each unchanged data block to the memory device; and for each changed data block in which data is changed as indicated by the change information, sending, by the memory controller according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device;
  • sending, by the memory controller according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device includes:
  • the writing, by the memory device according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block includes:
  • sending, by the memory controller according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device includes:
  • the writing, by the memory device according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block includes:
  • the first to-be-written cache line and the at least one second to-be-written cache line are in a same row of a same storage group Bank, and there is no read command of the same row in the LLC.
  • the writing, by the memory device according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block includes:
  • an embodiment of the present disclosure provides a memory system, including at least a memory controller and a memory device, where:
  • the memory controller is configured to: receive change information sent by a cache, where the change information is information that is generated after the cache divides a first to-be-written cache line cache line of a last level cache (LLC) into at least one data block and that is used to indicate whether data in each of the at least one data block is changed; for each unchanged data block in which data is not changed as indicated by the change information, skip sending, according to the change information, a column address corresponding to each unchanged data block and data corresponding to each unchanged data block to the memory device; and for each changed data block in which data is changed as indicated by the change information, send, according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device; and
  • LLC last level cache
  • the memory device is configured to write, according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block, where the burst length is equal to a quantity of the at least one data block.
  • the memory controller is configured to: if a quantity of the changed data blocks of the first to-be-written cache line is equal to the burst length, send the column address corresponding to each changed data block and the data corresponding to each changed data block to the memory device; and
  • the memory device is configured to perform, according to the column address corresponding to each changed data block and the data corresponding to each changed data block, the data write of the burst length on each changed data block of the first to-be-written cache line.
  • the memory controller is configured to: if a quantity of the changed data blocks of the first to-be-written cache line is less than the burst length, send the column address and the data corresponding to each changed data block of the first to-be-written cache line and a column address and data corresponding to each changed data block of at least one second to-be-written cache line to the memory device, where a sum of a quantity of the changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length; and
  • the memory device is configured to perform, according to each column address of the first to-be-written cache line and each column address of the at least one second to-be-written cache line, the data write of the burst length on each changed data block of the first to-be-written cache line and each changed data block of the at least one second to-be-written cache line, where the second to-be-written cache line is a to-be-written cache line except the first to-be-written cache line in the LLC.
  • the first to-be-written cache line and the at least one second to-be-written cache line are in a same row of a same storage group Bank, and there is no read command of the same row in the LLC.
  • a fourth possible implementation manner of the second aspect when column address buffers whose quantity is equal to the burst length and column decoders whose quantity is equal to the burst length are disposed on the memory device, the data write is performed on each changed data block by using an independent column address buffer and an independent column decoder.
  • a memory controller sends, according to change information sent by a cache, a column address and data to a memory device only for a data block in which data is changed, so that the memory device performs a data write on each changed data block and does not perform a write on a data block in which data is not changed. Therefore, objectives to quickly write valid data, reduce power consumption of a memory system, and improve performance of the memory system are achieved.
  • FIG. 1 is a flowchart of a data writing method according to embodiment 1 of the present disclosure
  • FIG. 2 is a schematic diagram showing working of an LLC in the data writing method according to embodiment(s) of the present disclosure
  • FIG. 3 is a schematic diagram showing working of a memory controller in the data writing method according to embodiment(s) of the present disclosure
  • FIG. 4 is a schematic diagram showing working of a memory device in the data writing method according to embodiment(s) of the present disclosure
  • FIG. 5 is a sequence diagram of write command combining in the data writing method according to embodiment(s) of the present disclosure.
  • FIG. 6 is a schematic structural diagram of a memory system according to embodiment(s) of the present disclosure.
  • FIG. 1 is a flowchart of a data writing method according to Embodiment 1 of the present disclosure. This embodiment is applied to a scenario in which data is written into a memory system including at least a memory controller and a memory device. Specifically, this embodiment includes the following steps:
  • the memory controller receives change information sent by a cache, where the change information is information that is generated after the cache divides a first to-be-written cache line of a last level cache (LLC) into at least one data block and that is used to indicate whether data in each of the at least one data block is changed.
  • LLC last level cache
  • a cache is located between a central processing unit (CPU) and a large-capacity memory system and has a relatively high access rate.
  • the cache divides the first to-be-written cache line of the last level cache (LLC) into at least one data block, and adds one flag bit to each of the at least one data block, where the flag bit indicates whether data in the data block is changed, one cache line needs multiple flag bits, and multiple flag bits of each cache line constitute change information indicating whether data in each of the at least one data block of the cache line is changed.
  • LLC last level cache
  • one cache line is divided into multiple data blocks by using a memory data bus width as a granularity, and one flag bit that is represented by 0 or 1 is added to each of the multiple data blocks, where 0 indicates that data in the data block is not changed, that is, a value of the data block is not changed; and 1 indicates that the data in the data block is changed, that is, the value of the data block is changed.
  • Flag bits of each cache line constitute a changed block vector (CBV), that is, change information, of the cache line.
  • one cache line may be divided into eight data blocks, and a burst length BL is equal to 8, that is, a size of one CBV is eight bits.
  • FIG. 2 is a schematic diagram showing working of an LLC in the data writing method according to embodiment(s) of the present disclosure.
  • the upper-level cache when an upper-level cache of the cache writes data into one cache line of the LLC, the upper-level cache first reads old data in the cache line and compares the old data with to-be-written data (new data); if data in one data block is not changed, the upper-level cache sets a flag bit of this data block to 0; if the data is changed, the upper-level cache sets the flag bit of this data block to 1.
  • the upper-level cache executes the following steps: (1) when writing data into a cache line of the LLC, read old data D 1 in the data block, first compares D 1 with to-be-written data D 2 by using a comparator, and record a comparison result in a CBV.
  • the upper-level cache reads data of other data blocks in the cache line, compares the data with new data that is to be written to these data blocks, and record a comparison result of each of the other data blocks in the CBV, to obtain CBV information of this cache line, that is, change information.
  • the LLC executes the following step: (2) write data into the memory system, that is, send a write request to the memory system to write data
  • the LLC simultaneously executes the following step: (3) send change information of the to-be-written cache line to the memory system.
  • the memory controller does not send a column address corresponding to each unchanged data block and data corresponding to each unchanged data block to the memory device; for each changed data block in which data is changed as indicated by the change information, the memory controller sends a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device.
  • the memory controller in the memory system determines, according to the received change information, whether it is needed to perform a write on each data block of the first to-be-written cache line. Specifically, refer to FIG. 3 .
  • FIG. 3 is a schematic diagram showing working of a memory controller in the data writing method according to embodiment(s) of the present disclosure.
  • the memory controller in the memory system includes a request queue (transaction queue), a command queue, and the like.
  • the write request of the LLC is first placed in the request queue, the memory controller converts the write request to a specific command for operating the memory device and stores the command in the command queue.
  • the memory controller determines, according to the change information of the first to-be-written cache line, whether it is needed to perform a write on each of the at least one data block of the first to-be-written cache line.
  • a burst write is not performed; for a changed data block in which data is changed, a column address and data corresponding to the changed data block is sent to the memory controller each beat by using an address bus, a data bus, and the like. Because data is transmitted in both a rising edge and a falling edge of one clock cycle in the DDR technology and a data frequency of the data bus is twice a clock frequency of the data bus, each beat is half a clock cycle.
  • the memory device writes, according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block, where the burst length is equal to a quantity of the at least one data block.
  • a quantity of data blocks into which the first to-be-written cache line is divided is a quantity of consecutive burst writes.
  • the memory device performs the data write of the burst length on each changed data block according to each received column address and each piece of received data.
  • FIG. 4 is a schematic diagram showing working of a memory device in the data writing method according to embodiment(s) of the present disclosure. Referring to FIG.
  • the memory device includes a row address buffer (row address buffer), a row address decoder (row decoder), column address buffers (column address buffer) whose quantity is equal to the burst length, column address decoders (column decoder) whose quantity is equal to the burst length, a sense amplifier array (sense amplifier array, SAA), a memory array (memory array), a buffer with written data (data in buffer) and the like. For each changed data block, the memory device performs a data write by using an independent column address buffer and an independent column decoder.
  • a memory controller sends, according to change information sent by a cache, a column address and data to a memory device only for a data block in which data is changed, so that the memory device performs a data write on each changed data block and does not perform a write on a data block in which data is not changed. Therefore, objectives to quickly write valid data, reduce power consumption of a memory system, and improve performance of the memory system are achieved.
  • the memory controller determines, according to the change information, whether it is needed to perform a write on each of the at least one data block. For each changed data block in which data is changed as indicated by the change information, if a quantity of the changed data blocks of the first to-be-written cache line is equal to the burst length, the memory controller sends the column address corresponding to each changed data block and the data corresponding to each changed data block to the memory device. Correspondingly, the memory device performs, according to each column address, the data write of the burst length on each changed data block of the first to-be-written cache line.
  • the cache divides the first to-be-written cache line of the LLC into at least one data block and performs one burst write on each of the at least one data block, where a quantity of data blocks obtained after the division is a quantity of burst writes, that is, the burst length.
  • the change information received by the memory controller indicates that data in all the data blocks of the cache line is changed.
  • the memory controller sends a column address and data corresponding to the data block to the memory device; the memory device stores multiple received column addresses in different column address buffers, performs decoding concurrently by using different column decoders, selects different columns in the SAA, writes data to these selected columns, and finally writes the data in the SAA into the memory array.
  • a write is performed on each of the at least one data block of the first to-be-written cache line.
  • the first to-be-written cache line is divided into eight data blocks by using the memory data bus width as a granularity, and data in all the eight data blocks is changed.
  • the memory controller sends eight column addresses and corresponding data to the memory device.
  • Eight column address buffers and eight column decoders are disposed on the memory device, each column address buffer stores one column address, and the decoders corresponding to the column addresses perform decoding concurrently.
  • the memory controller determines, according to the change information, whether it is needed to perform a write on each data block. For each changed data block in which data is changed as indicated by the change information, if a quantity of the changed data blocks of the first to-be-written cache line is less than the burst length, the memory controller sends the column address and the data corresponding to each changed data block of the first to-be-written cache line and a column address and data corresponding to each changed data block of at least one second to-be-written cache line to the memory device.
  • a sum of a quantity of the changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length.
  • the memory device performs, according to each column address of the first to-be-written cache line and each column address of the at least one second to-be-written cache line, the data write of the burst length on each changed data block of the first to-be-written cache line and each changed data block of the at least one second to-be-written cache line, where the second to-be-written cache line is a to-be-written cache line except the first to-be-written cache line in the LLC.
  • the cache divides the first to-be-written cache line of the LLC into at least one data block and performs one burst write on each of the at least one data block, where a quantity of data blocks obtained after the division is a quantity of burst writes, that is, the burst length.
  • a quantity of data blocks obtained after the division is a quantity of burst writes, that is, the burst length.
  • the memory controller when performing command scheduling, the memory controller combines write commands, and completes multiple write in a clock cycle of one fixed burst length by combining the write commands, thereby preventing waste of the clock cycle, reducing power consumption of the memory system, and improving performance of the memory system.
  • write requests that are sent by the LLC to the memory controller and used to request that data of a size of the cache line is written are first stored in the request queue, and the memory controller converts these write requests to write commands for operating the memory device and stores the write commands in the command queue.
  • the memory controller converts these write requests to write commands for operating the memory device and stores the write commands in the command queue.
  • the memory controller sends a write command of the first to-be-written cache line
  • the memory controller discovers, according to the change information of the first to-be-written cache line, that the quantity of the changed data blocks of the cache line is less than the burst length, a write command corresponding to the at least one second to-be-written cache line is selected from the command queue.
  • a sum of a quantity of changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length.
  • the memory controller sends a column address and data corresponding to one changed data block of the first to-be-written cache line to the memory device each beat; after column addresses and data corresponding to the changed data blocks of the first to-be-written cache line are sent, the memory controller subsequently continues to send a column address and data corresponding to one changed data block of the second to-be-written cache line to the memory device each beat, and repeats this process until data is written into data blocks whose quantity is equal to the BL, or until write commands that can be combined cannot be found in the command queue, that is, a quantity of data blocks into which data is written is less than the BL.
  • the quantity of the changed data blocks of the first to-be-written cache line is less than the burst length, and the write command corresponding to the first to-be-written cache line and the write command corresponding to the at least one second to-be-written cache line need to be combined during the data writing process, the following needs to be met: the sum of the quantity of the changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length, where the first to-be-written cache line and the at least one second to-be-written cache line correspond to the write commands that can be combined.
  • the write commands further needs to meet the following condition: the first to-be-written cache line and the at least one second to-be-written cache line are in a same row of a same storage group Bank, and there is no read command of the same row in the LLC. That is, the write command corresponding to the first to-be-written cache line and the write commands corresponding to the at least one second to-be-written cache line are used for a write in the same row of the same storage group Bank, and there is no read request of the same row in the write commands corresponding to the at least one second to-be-written cache line.
  • the memory device further includes a row test module (row test), which is configured to test whether write commands are used to perform a data write in a same row of a same storage group Bank.
  • Table 1 shows information about commands in the command queue of the memory controller: three write commands are used to operate a same Bank, write commands Write 1 and Write 3 are used for a write in a row Row 1 , and a write command Write 2 is used for a write in row Row 2 .
  • Write 1 and Write 3 are used for the write in the same row;
  • CBV namely change information, indicates that a sum of a quantity of changed data blocks of a cache line corresponding to Write 1 and a quantity of changed data blocks of a cache line corresponding to Write 3 (as shown in the cross-hatching in Table 1) is equal to 8. Therefore, write command combining is performed for Write 1 and Write 3 ; the memory controller schedules Write 2 after completing scheduling Write 1 and Write 3 . Specifically, refer to FIG. 5 .
  • FIG. 5 is a sequence diagram of write command combining in the data writing method according to this embodiment of the present disclosure.
  • the memory controller in the first four beats, that is, rising edges and falling edges of clock cycles T 0 and T 1 , the memory controller sends Write 1 and column address col 1 , col 2 , col 3 and col 4 ; in subsequent four beats, that is, rising edges and falling edges of clock cycles T 2 and T 3 , the memory controller sends Write 3 and column addresses col 3 , col 4 , col 5 and col 6 . Then, the memory controller sends column addresses corresponding to Write 2 .
  • burst write data Dn appears on a data bus, and eight burst writes are performed, so that the changed data blocks of the cache line corresponding to Write 1 and the cache line corresponding to Write 3 are written into the memory device.
  • Col 1 indicates a column address corresponding to the first data block of eight data blocks of the cache line corresponding to Write 1
  • D 1 indicates data corresponding to the first data block, and the rest can be deduced by analogy.
  • the embodiment of the present disclosure is described in detail by using an example in which two write commands Write 1 and Write 3 are combined and the sum of the quantity of the changed data blocks of the cache line corresponding to Write 1 and the quantity of the changed data blocks of the cache line corresponding to Write 3 is equal to BL.
  • the embodiment of the present disclosure is not limited thereto.
  • multiple write commands may be combined. For example, the sum of the quantity of the changed data blocks of the cache line corresponding to Write 1 and the quantity of the changed data blocks of the cache line corresponding to Write 3 is less than the BL, other write commands that can be combined may be selected from the command queue.
  • FIG. 5 shows only three memory clocks (internal CK) of a dynamic random access memory (dynamic random access memory, DRAM). In fact, there are a total of eight memory clocks.
  • FIG. 6 is a schematic structural diagram of a memory system according to the embodiment of the present disclosure and is an apparatus embodiment corresponding to the embodiment of the present disclosure in FIG. 1 ; therefore, a specific implementation process is not described herein again.
  • a memory system 100 in this embodiment includes at least a memory controller 10 and a memory device 11 .
  • the memory controller 10 is configured to: receive change information sent by a cache, where the change information is information that is generated after the cache divides a first to-be-written cache line cache line of a last level cache (LLC) into at least one data block and that is used to indicate whether data in each of the at least one data block is changed; for each unchanged data block in which data is not changed as indicated by the change information, skip sending, according to the change information, a column address corresponding to each unchanged data block and data corresponding to each unchanged data block to the memory device; and for each changed data block in which data is changed as indicated by the change information, send, according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device; and
  • LLC last level cache
  • the memory device 11 is configured to write, according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block, where the burst length is equal to a quantity of the at least one data block.
  • the memory controller 10 is configured to: if a quantity of the changed data blocks of the first to-be-written cache line is equal to the burst length, send the column address corresponding to each changed data block and the data corresponding to each changed data block to the memory device 11 .
  • the memory device 11 is configured to perform, according to each column address, the data write of the burst length on each changed data block of the first to-be-written cache line.
  • the memory controller 10 is configured to: if a quantity of the changed data blocks of the first to-be-written cache line is less than the burst length, send the column address and the data corresponding to each changed data block of the first to-be-written cache line and a column address and data corresponding to each changed data block of at least one second to-be-written cache line to the memory device 11 , where a sum of a quantity of the changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length; and
  • the memory device 11 is configured to perform, according to each column address of the first to-be-written cache line and each column address of the at least one second to-be-written cache line, the data write of the burst length on each changed data block of the first to-be-written cache line and each changed data block of the at least one second to-be-written cache line, where the second to-be-written cache line is a to-be-written cache line except the first to-be-written cache line in the LLC.
  • first to-be-written cache line and the at least one second to-be-written cache line are in a same row of a same storage group Bank, and there is no read command of the same row in the LLC.
  • the program may be stored in a computer-readable storage medium.
  • the foregoing storage medium includes: any medium that can store program code, such as a ROM, a RAM, a magnetic disk, or an optical disc.

Abstract

A data writing method and a memory system are disclosed. The method is applied to a memory system including at least a memory controller and a memory device, and the method includes: receiving, by the memory controller, change information sent by a cache, wherein the change information is information that is generated after the cache divides a first to-be-written cache line cache line of a last level cache LLC into at least one data block and that is used to indicate whether data in each of the at least one data block is changed; for each changed data block in which data is changed, sending, by the memory controller according to the change information, a corresponding column address and corresponding data to the memory device; and for a data block in which data is not changed, skipping performing, by the memory controller according to the change information, a write.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of International Application No. PCT/CN2014/080073, filed on Jun. 17, 2014, which claims priority to Chinese Patent Application No. 201310270239.6, filed on Jun. 29, 2013, both of which are hereby incorporated by reference in their entireties.
  • TECHNICAL FIELD
  • Embodiments of the present disclosure relates to computer technologies, and in particular, to a data writing method and a memory system.
  • BACKGROUND
  • An existing memory system basically includes a memory controller (MC), a memory device, and the like. The memory controller and the memory device exchange data by using the double data rate (DDR) protocol. The memory controller writes data into the memory device in a burst write manner, and a size of a data block on which one burst write is performed is a memory data bus width; a cache and the memory system exchange data in unit of cache line, and a size of data read or written each time is a size of one cache line of a last level cache (LLC) in the cache. Therefore, the memory controller needs to perform multiple consecutive burst writes to write data of one cache line into the memory device, where a quantity of consecutive burst writes is called a burst length (BL).
  • In the DDR3 protocol, a BL is generally equal to 8, and a size of a data block in one burst write is used as a granularity to divide one cache line into multiple data blocks. For example, if a size of one cache line of the LLC is 64 bytes and the memory data bus width is 64 bits, when burst write data appears on a data bus, the memory controller needs to perform eight burst writes in consecutive four clock cycles to write data of one cache line of the LLC into the memory device. However, actually, when data of one cache line of the LLC is written into the memory device, many data blocks are not changed. During a writing process, it is possible that invalid data (unchanged data) is written into the memory device in some burst writes. As a result, a speed of writing valid data (changed data) is low, and writing a large amount of invalid data leads to an increase in power consumption of the memory system, thereby reducing performance of the memory system.
  • In a BC4 (burst chop 4) technology supported by the DDR3 protocol, when the memory controller writes data into the memory device, a total of four burst writes occur in two consecutive clock cycles, and there is no burst write in subsequent two clock cycles, to write a first half or a latter half of data of one cache line into the memory device. During this process, within the first two clock cycles, it is also possible that in a write manner in which whether data in a data block is changed or not is not considered, invalid data is written into the memory device in some burst writes. As a result, a speed of writing valid data is low, and writing a large amount of invalid data leads to an increase in power consumption of the memory system, thereby reducing performance of the memory system.
  • SUMMARY
  • Embodiments of the present disclosure provide a data writing method and a memory system, where whether data in a data block of a cache line is changed is differentiated and a write is performed only on a changed data block, so that objectives to quickly write valid data, reduce power consumption of a memory system, and improve performance of the memory system are achieved.
  • According to a first aspect, an embodiment of the present disclosure provides a data writing method, which is applied to a memory system including at least a memory controller and a memory device and includes:
  • receiving, by the memory controller, change information sent by a cache, where the change information is information that is generated after the cache divides a first to-be-written cache line of a last level cache (LLC) into at least one data block and that is used to indicate whether data in each of the at least one data block is changed; and
  • for each unchanged data block in which data is not changed as indicated by the change information, skipping sending, by the memory controller according to the change information, a column address corresponding to each unchanged data block and data corresponding to each unchanged data block to the memory device; and for each changed data block in which data is changed as indicated by the change information, sending, by the memory controller according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device; and
  • writing, by the memory device according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block, where the burst length is equal to a quantity of the at least one data block.
  • In a first possible implementation manner of the first aspect, where the for each changed data block in which data is changed as indicated by the change information, sending, by the memory controller according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device includes:
  • if a quantity of the changed data blocks of the first to-be-written cache line is equal to the burst length, sending, by the memory controller, the column address corresponding to each changed data block and the data corresponding to each changed data block to the memory device; and
  • the writing, by the memory device according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block includes:
  • performing, by the memory device according to the column address corresponding to each changed data block and the data corresponding to each changed data block, the data write of the burst length on each changed data block of the first to-be-written cache line.
  • In a second possible implementation manner of the first aspect, where the for each changed data block in which data is changed as indicated by the change information, sending, by the memory controller according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device includes:
  • if a quantity of the changed data blocks of the first to-be-written cache line is less than the burst length, sending the column address and the data corresponding to each changed data block of the first to-be-written cache line and a column address and data corresponding to each changed data block of at least one second to-be-written cache line to the memory device, where a sum of a quantity of the changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length; and
  • the writing, by the memory device according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block includes:
  • performing, by the memory device according to each column address of the first to-be-written cache line and each column address of the at least one second to-be-written cache line, the data write of the burst length on each changed data block of the first to-be-written cache line and each changed data block of the at least one second to-be-written cache line, where the second to-be-written cache line is a to-be-written cache line except the first to-be-written cache line in the LLC.
  • With reference to the second possible implementation manner of the first aspect, in a third possible implementation manner of the first aspect, the first to-be-written cache line and the at least one second to-be-written cache line are in a same row of a same storage group Bank, and there is no read command of the same row in the LLC.
  • With reference to the first aspect or any one of the first to the third possible implementation manners of the first aspect, in a fourth possible implementation manner of the first aspect, the writing, by the memory device according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block includes:
  • when column address buffers whose quantity is equal to the burst length and column decoders whose quantity is equal to the burst length are disposed on the memory device, performing the data write on each changed data block by using an independent column address buffer and an independent column decoder.
  • According to a second aspect, an embodiment of the present disclosure provides a memory system, including at least a memory controller and a memory device, where:
  • the memory controller is configured to: receive change information sent by a cache, where the change information is information that is generated after the cache divides a first to-be-written cache line cache line of a last level cache (LLC) into at least one data block and that is used to indicate whether data in each of the at least one data block is changed; for each unchanged data block in which data is not changed as indicated by the change information, skip sending, according to the change information, a column address corresponding to each unchanged data block and data corresponding to each unchanged data block to the memory device; and for each changed data block in which data is changed as indicated by the change information, send, according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device; and
  • the memory device is configured to write, according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block, where the burst length is equal to a quantity of the at least one data block.
  • In a first possible implementation manner of the second aspect, the memory controller is configured to: if a quantity of the changed data blocks of the first to-be-written cache line is equal to the burst length, send the column address corresponding to each changed data block and the data corresponding to each changed data block to the memory device; and
  • the memory device is configured to perform, according to the column address corresponding to each changed data block and the data corresponding to each changed data block, the data write of the burst length on each changed data block of the first to-be-written cache line.
  • In a second possible implementation manner of the second aspect, the memory controller is configured to: if a quantity of the changed data blocks of the first to-be-written cache line is less than the burst length, send the column address and the data corresponding to each changed data block of the first to-be-written cache line and a column address and data corresponding to each changed data block of at least one second to-be-written cache line to the memory device, where a sum of a quantity of the changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length; and
  • the memory device is configured to perform, according to each column address of the first to-be-written cache line and each column address of the at least one second to-be-written cache line, the data write of the burst length on each changed data block of the first to-be-written cache line and each changed data block of the at least one second to-be-written cache line, where the second to-be-written cache line is a to-be-written cache line except the first to-be-written cache line in the LLC.
  • With reference to the second possible implementation manner of the second aspect, in a third possible implementation manner of the second aspect, the first to-be-written cache line and the at least one second to-be-written cache line are in a same row of a same storage group Bank, and there is no read command of the same row in the LLC.
  • With reference to the second aspect or the first, the second, or the third possible implementation manner of the second aspect, in a fourth possible implementation manner of the second aspect, when column address buffers whose quantity is equal to the burst length and column decoders whose quantity is equal to the burst length are disposed on the memory device, the data write is performed on each changed data block by using an independent column address buffer and an independent column decoder.
  • In the data writing method and the memory system provided in the embodiments of the present disclosure, a memory controller sends, according to change information sent by a cache, a column address and data to a memory device only for a data block in which data is changed, so that the memory device performs a data write on each changed data block and does not perform a write on a data block in which data is not changed. Therefore, objectives to quickly write valid data, reduce power consumption of a memory system, and improve performance of the memory system are achieved.
  • BRIEF DESCRIPTION OF DRAWINGS
  • To describe the technical solutions in the embodiments of the present disclosure more clearly, the following briefly introduces the accompanying drawings needed for describing the embodiments.
  • FIG. 1 is a flowchart of a data writing method according to embodiment 1 of the present disclosure;
  • FIG. 2 is a schematic diagram showing working of an LLC in the data writing method according to embodiment(s) of the present disclosure;
  • FIG. 3 is a schematic diagram showing working of a memory controller in the data writing method according to embodiment(s) of the present disclosure;
  • FIG. 4 is a schematic diagram showing working of a memory device in the data writing method according to embodiment(s) of the present disclosure;
  • FIG. 5 is a sequence diagram of write command combining in the data writing method according to embodiment(s) of the present disclosure; and
  • FIG. 6 is a schematic structural diagram of a memory system according to embodiment(s) of the present disclosure.
  • DESCRIPTION OF EMBODIMENTS
  • To make the objectives, technical solutions, and advantages of the embodiments of the present disclosure clearer, the following clearly describes the technical solutions in the embodiments of the present disclosure with reference to the accompanying drawings in the embodiments of the present disclosure.
  • FIG. 1 is a flowchart of a data writing method according to Embodiment 1 of the present disclosure. This embodiment is applied to a scenario in which data is written into a memory system including at least a memory controller and a memory device. Specifically, this embodiment includes the following steps:
  • 101. The memory controller receives change information sent by a cache, where the change information is information that is generated after the cache divides a first to-be-written cache line of a last level cache (LLC) into at least one data block and that is used to indicate whether data in each of the at least one data block is changed.
  • A cache is located between a central processing unit (CPU) and a large-capacity memory system and has a relatively high access rate. In this step, the cache divides the first to-be-written cache line of the last level cache (LLC) into at least one data block, and adds one flag bit to each of the at least one data block, where the flag bit indicates whether data in the data block is changed, one cache line needs multiple flag bits, and multiple flag bits of each cache line constitute change information indicating whether data in each of the at least one data block of the cache line is changed. For example, one cache line is divided into multiple data blocks by using a memory data bus width as a granularity, and one flag bit that is represented by 0 or 1 is added to each of the multiple data blocks, where 0 indicates that data in the data block is not changed, that is, a value of the data block is not changed; and 1 indicates that the data in the data block is changed, that is, the value of the data block is changed. Flag bits of each cache line constitute a changed block vector (CBV), that is, change information, of the cache line. Specifically, assuming that a size of one cache line is 64 bytes and the memory data bus width is 64 bits, one cache line may be divided into eight data blocks, and a burst length BL is equal to 8, that is, a size of one CBV is eight bits.
  • FIG. 2 is a schematic diagram showing working of an LLC in the data writing method according to embodiment(s) of the present disclosure. As shown in FIG. 2, when an upper-level cache of the cache writes data into one cache line of the LLC, the upper-level cache first reads old data in the cache line and compares the old data with to-be-written data (new data); if data in one data block is not changed, the upper-level cache sets a flag bit of this data block to 0; if the data is changed, the upper-level cache sets the flag bit of this data block to 1. When the LLC writes the data of the cache line to the memory system, CBV information corresponding to the cache line is transferred to the memory controller in the memory system, and the memory controller receives the CBV information, that is, receives corresponding change information. Taking a data block whose tag is Tag1 as an example, the upper-level cache executes the following steps: (1) when writing data into a cache line of the LLC, read old data D1 in the data block, first compares D1 with to-be-written data D2 by using a comparator, and record a comparison result in a CBV. Likewise, the upper-level cache reads data of other data blocks in the cache line, compares the data with new data that is to be written to these data blocks, and record a comparison result of each of the other data blocks in the CBV, to obtain CBV information of this cache line, that is, change information. When the LLC executes the following step: (2) write data into the memory system, that is, send a write request to the memory system to write data, the LLC simultaneously executes the following step: (3) send change information of the to-be-written cache line to the memory system.
  • 102. According to the change information, for each unchanged data block in which data is not changed as indicated by the change information, the memory controller does not send a column address corresponding to each unchanged data block and data corresponding to each unchanged data block to the memory device; for each changed data block in which data is changed as indicated by the change information, the memory controller sends a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device.
  • In this step, the memory controller in the memory system determines, according to the received change information, whether it is needed to perform a write on each data block of the first to-be-written cache line. Specifically, refer to FIG. 3.
  • FIG. 3 is a schematic diagram showing working of a memory controller in the data writing method according to embodiment(s) of the present disclosure. Referring to FIG. 3, the memory controller in the memory system includes a request queue (transaction queue), a command queue, and the like. The write request of the LLC is first placed in the request queue, the memory controller converts the write request to a specific command for operating the memory device and stores the command in the command queue. The memory controller determines, according to the change information of the first to-be-written cache line, whether it is needed to perform a write on each of the at least one data block of the first to-be-written cache line. Specifically, for an unchanged data block in which data is not changed, a burst write is not performed; for a changed data block in which data is changed, a column address and data corresponding to the changed data block is sent to the memory controller each beat by using an address bus, a data bus, and the like. Because data is transmitted in both a rising edge and a falling edge of one clock cycle in the DDR technology and a data frequency of the data bus is twice a clock frequency of the data bus, each beat is half a clock cycle.
  • 103. The memory device writes, according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block, where the burst length is equal to a quantity of the at least one data block.
  • Generally, a quantity of data blocks into which the first to-be-written cache line is divided is a quantity of consecutive burst writes. In this step, the memory device performs the data write of the burst length on each changed data block according to each received column address and each piece of received data.
  • Optionally, compared with that one memory device has only one column address buffer and one column decoder in the prior art, the memory device in this embodiment includes multiple column address buffers and multiple column decoders. FIG. 4 is a schematic diagram showing working of a memory device in the data writing method according to embodiment(s) of the present disclosure. Referring to FIG. 4, in this embodiment, the memory device includes a row address buffer (row address buffer), a row address decoder (row decoder), column address buffers (column address buffer) whose quantity is equal to the burst length, column address decoders (column decoder) whose quantity is equal to the burst length, a sense amplifier array (sense amplifier array, SAA), a memory array (memory array), a buffer with written data (data in buffer) and the like. For each changed data block, the memory device performs a data write by using an independent column address buffer and an independent column decoder. Each time data is written, multiple column addresses sent by the memory controller are stored in different column address buffers and decoded concurrently by using different column decoders, different columns in the SAA are selected, data is written into these selected columns, and finally the data in the SAA is written into the memory array.
  • In the data writing method provided in this embodiment of the present disclosure, a memory controller sends, according to change information sent by a cache, a column address and data to a memory device only for a data block in which data is changed, so that the memory device performs a data write on each changed data block and does not perform a write on a data block in which data is not changed. Therefore, objectives to quickly write valid data, reduce power consumption of a memory system, and improve performance of the memory system are achieved.
  • Optionally, in the foregoing Embodiment 1, the memory controller determines, according to the change information, whether it is needed to perform a write on each of the at least one data block. For each changed data block in which data is changed as indicated by the change information, if a quantity of the changed data blocks of the first to-be-written cache line is equal to the burst length, the memory controller sends the column address corresponding to each changed data block and the data corresponding to each changed data block to the memory device. Correspondingly, the memory device performs, according to each column address, the data write of the burst length on each changed data block of the first to-be-written cache line.
  • Specifically, the cache divides the first to-be-written cache line of the LLC into at least one data block and performs one burst write on each of the at least one data block, where a quantity of data blocks obtained after the division is a quantity of burst writes, that is, the burst length. In this embodiment, if the quantity of the changed data blocks is equal to the burst length, that is, data in all the data blocks obtained after the division is changed, the change information received by the memory controller indicates that data in all the data blocks of the cache line is changed. In this case, for each of the at least one data block of the first to-be-written cache line, the memory controller sends a column address and data corresponding to the data block to the memory device; the memory device stores multiple received column addresses in different column address buffers, performs decoding concurrently by using different column decoders, selects different columns in the SAA, writes data to these selected columns, and finally writes the data in the SAA into the memory array. In this way, a write is performed on each of the at least one data block of the first to-be-written cache line. For example, the first to-be-written cache line is divided into eight data blocks by using the memory data bus width as a granularity, and data in all the eight data blocks is changed. Therefore, the memory controller sends eight column addresses and corresponding data to the memory device. Eight column address buffers and eight column decoders are disposed on the memory device, each column address buffer stores one column address, and the decoders corresponding to the column addresses perform decoding concurrently.
  • Optionally, in the foregoing Embodiment 1, the memory controller determines, according to the change information, whether it is needed to perform a write on each data block. For each changed data block in which data is changed as indicated by the change information, if a quantity of the changed data blocks of the first to-be-written cache line is less than the burst length, the memory controller sends the column address and the data corresponding to each changed data block of the first to-be-written cache line and a column address and data corresponding to each changed data block of at least one second to-be-written cache line to the memory device. A sum of a quantity of the changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length. Correspondingly, the memory device performs, according to each column address of the first to-be-written cache line and each column address of the at least one second to-be-written cache line, the data write of the burst length on each changed data block of the first to-be-written cache line and each changed data block of the at least one second to-be-written cache line, where the second to-be-written cache line is a to-be-written cache line except the first to-be-written cache line in the LLC.
  • Generally, the cache divides the first to-be-written cache line of the LLC into at least one data block and performs one burst write on each of the at least one data block, where a quantity of data blocks obtained after the division is a quantity of burst writes, that is, the burst length. In this embodiment, if the quantity of the changed data blocks is less than the burst length, data in only some data blocks of the data blocks obtained after the division is changed. In this case, when performing command scheduling, the memory controller combines write commands, and completes multiple write in a clock cycle of one fixed burst length by combining the write commands, thereby preventing waste of the clock cycle, reducing power consumption of the memory system, and improving performance of the memory system.
  • Specifically, write requests that are sent by the LLC to the memory controller and used to request that data of a size of the cache line is written are first stored in the request queue, and the memory controller converts these write requests to write commands for operating the memory device and stores the write commands in the command queue. When the memory device sends a write command of the first to-be-written cache line, if the memory controller discovers, according to the change information of the first to-be-written cache line, that the quantity of the changed data blocks of the cache line is less than the burst length, a write command corresponding to the at least one second to-be-written cache line is selected from the command queue. A sum of a quantity of changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length. In burst writes whose quantity is equal to the BL, the memory controller sends a column address and data corresponding to one changed data block of the first to-be-written cache line to the memory device each beat; after column addresses and data corresponding to the changed data blocks of the first to-be-written cache line are sent, the memory controller subsequently continues to send a column address and data corresponding to one changed data block of the second to-be-written cache line to the memory device each beat, and repeats this process until data is written into data blocks whose quantity is equal to the BL, or until write commands that can be combined cannot be found in the command queue, that is, a quantity of data blocks into which data is written is less than the BL.
  • It should be noted that, if the quantity of the changed data blocks of the first to-be-written cache line is less than the burst length, and the write command corresponding to the first to-be-written cache line and the write command corresponding to the at least one second to-be-written cache line need to be combined during the data writing process, the following needs to be met: the sum of the quantity of the changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length, where the first to-be-written cache line and the at least one second to-be-written cache line correspond to the write commands that can be combined. In addition, the write commands further needs to meet the following condition: the first to-be-written cache line and the at least one second to-be-written cache line are in a same row of a same storage group Bank, and there is no read command of the same row in the LLC. That is, the write command corresponding to the first to-be-written cache line and the write commands corresponding to the at least one second to-be-written cache line are used for a write in the same row of the same storage group Bank, and there is no read request of the same row in the write commands corresponding to the at least one second to-be-written cache line. In this case, referring to FIG. 4, the memory device further includes a row test module (row test), which is configured to test whether write commands are used to perform a data write in a same row of a same storage group Bank.
  • Specifically, it is assumed that a size of one cache line of the LLC is 64 bytes, the memory data bus width is 64 bits, and the burst length BL is equal to 8. Table 1 shows information about commands in the command queue of the memory controller: three write commands are used to operate a same Bank, write commands Write1 and Write3 are used for a write in a row Row1, and a write command Write2 is used for a write in row Row2.
  • TABLE 1
    Write Row CBV
    Write1 Row1
    1 1 1 1 0 0 0 0
    Write2 Row2 1 1 1 0 0 0 0 0
    Write3 Row1 0 0 1 1 1 1 0 0
  • As can be seen from Table 1, Write1 and Write3 are used for the write in the same row; CBV, namely change information, indicates that a sum of a quantity of changed data blocks of a cache line corresponding to Write1 and a quantity of changed data blocks of a cache line corresponding to Write3 (as shown in the cross-hatching in Table 1) is equal to 8. Therefore, write command combining is performed for Write1 and Write3; the memory controller schedules Write2 after completing scheduling Write1 and Write3. Specifically, refer to FIG. 5.
  • FIG. 5 is a sequence diagram of write command combining in the data writing method according to this embodiment of the present disclosure. As shown in FIG. 5, in the first four beats, that is, rising edges and falling edges of clock cycles T0 and T1, the memory controller sends Write1 and column address col1, col2, col3 and col4; in subsequent four beats, that is, rising edges and falling edges of clock cycles T2 and T3, the memory controller sends Write3 and column addresses col3, col4, col5 and col6. Then, the memory controller sends column addresses corresponding to Write2. At a moment T5, burst write data Dn appears on a data bus, and eight burst writes are performed, so that the changed data blocks of the cache line corresponding to Write1 and the cache line corresponding to Write3 are written into the memory device. Col1 indicates a column address corresponding to the first data block of eight data blocks of the cache line corresponding to Write1, and D1 indicates data corresponding to the first data block, and the rest can be deduced by analogy.
  • It should be noted that, in the foregoing embodiment, the embodiment of the present disclosure is described in detail by using an example in which two write commands Write1 and Write3 are combined and the sum of the quantity of the changed data blocks of the cache line corresponding to Write1 and the quantity of the changed data blocks of the cache line corresponding to Write3 is equal to BL. However, the embodiment of the present disclosure is not limited thereto. In another possible implementation manner, multiple write commands may be combined. For example, the sum of the quantity of the changed data blocks of the cache line corresponding to Write1 and the quantity of the changed data blocks of the cache line corresponding to Write3 is less than the BL, other write commands that can be combined may be selected from the command queue. In addition, if a sum of quantities of change data blocks of cache lines corresponding to all write commands that can be combined in the command queue is less than the BL, burst writes whose quantity is equal to the BL are performed, and some clock cycles in the burst writes whose quantity is equal to the BL or some beats of a clock cycle are idle. In addition, FIG. 5 shows only three memory clocks (internal CK) of a dynamic random access memory (dynamic random access memory, DRAM). In fact, there are a total of eight memory clocks.
  • FIG. 6 is a schematic structural diagram of a memory system according to the embodiment of the present disclosure and is an apparatus embodiment corresponding to the embodiment of the present disclosure in FIG. 1; therefore, a specific implementation process is not described herein again. Specifically, a memory system 100 in this embodiment includes at least a memory controller 10 and a memory device 11.
  • Specifically, the memory controller 10 is configured to: receive change information sent by a cache, where the change information is information that is generated after the cache divides a first to-be-written cache line cache line of a last level cache (LLC) into at least one data block and that is used to indicate whether data in each of the at least one data block is changed; for each unchanged data block in which data is not changed as indicated by the change information, skip sending, according to the change information, a column address corresponding to each unchanged data block and data corresponding to each unchanged data block to the memory device; and for each changed data block in which data is changed as indicated by the change information, send, according to the change information, a column address corresponding to each changed data block and data corresponding to each changed data block to the memory device; and
  • the memory device 11 is configured to write, according to the column address corresponding to each changed data block and the data corresponding to each changed data block, data of a burst length into each changed data block, where the burst length is equal to a quantity of the at least one data block.
  • Further, the memory controller 10 is configured to: if a quantity of the changed data blocks of the first to-be-written cache line is equal to the burst length, send the column address corresponding to each changed data block and the data corresponding to each changed data block to the memory device 11.
  • The memory device 11 is configured to perform, according to each column address, the data write of the burst length on each changed data block of the first to-be-written cache line.
  • Further, the memory controller 10 is configured to: if a quantity of the changed data blocks of the first to-be-written cache line is less than the burst length, send the column address and the data corresponding to each changed data block of the first to-be-written cache line and a column address and data corresponding to each changed data block of at least one second to-be-written cache line to the memory device 11, where a sum of a quantity of the changed data blocks of the at least one second to-be-written cache line and the quantity of the changed data blocks of the first to-be-written cache line is less than or equal to the burst length; and
  • the memory device 11 is configured to perform, according to each column address of the first to-be-written cache line and each column address of the at least one second to-be-written cache line, the data write of the burst length on each changed data block of the first to-be-written cache line and each changed data block of the at least one second to-be-written cache line, where the second to-be-written cache line is a to-be-written cache line except the first to-be-written cache line in the LLC.
  • Further, the first to-be-written cache line and the at least one second to-be-written cache line are in a same row of a same storage group Bank, and there is no read command of the same row in the LLC.
  • Further, when column address buffers whose quantity is equal to the burst length and column decoders whose quantity is equal to the burst length are disposed on the memory device 11, where the number is equal to the burst length, the data write is performed on each changed data block by using an independent column address buffer and an independent column decoder.
  • Persons of ordinary skill in the art may understand that all or some of the steps of the method embodiments may be implemented by a program instructing relevant hardware. The program may be stored in a computer-readable storage medium. When the program runs, the steps of the method embodiments are performed. The foregoing storage medium includes: any medium that can store program code, such as a ROM, a RAM, a magnetic disk, or an optical disc.
  • Finally, it should be noted that the foregoing embodiments are merely intended for describing the technical solutions of the present disclosure, but not for limiting the present disclosure. Although the present disclosure is described in detail with reference to the foregoing embodiments, persons of ordinary skill in the art should understand that they may still make modifications to the technical solutions described in the foregoing embodiments or make equivalent replacements to some or all technical features thereof, without departing from the scope of the technical solutions of the embodiments of the present disclosure.

Claims (12)

What is claimed is:
1. A memory data updating method applied to a computer system comprising a CPU, a memory controller and a memory device, wherein the CPU comprises a last level cache (LLC), the method comprising:
dividing, by the CPU, a first cache line of the last level cache into k data blocks according to a burst write size, each of the k data blocks having a size equal to a multiple of the burst write size, wherein the burst write size is determined by a data bus width of the memory device, k is an integer and 2≦k≦burst length (BL), BL being a number of burst writes required for the memory controller to write data of one cache line of the last level cache into the memory device;
determining, by the CPU, update information of the first cache line, wherein the update information of the first cache line indicates an updating status for each of the k data blocks in the first cache line, and
sending, by the CPU, the update information of the first cache line to the memory controller.
2. The memory data updating method according to claim 1, wherein the step of determining update information of the first cache line comprises:
receiving, by the CPU, data in a buffer that is to be written into the first cache line;
creating a k-bits vector in a register, wherein each bit of the k-bits vector is to store the comparing result between the data that is to be written into a data block with data stored in the corresponding data block;
for each data block in the first cache line, comparing, by the CPU, the data in the buffer that is to be written into the each data block with data stored in the each data block; and
storing a comparison result for each data block into the bit corresponding to the data block in the k-bits vector in the register.
3. The memory data updating method according to claim 2, wherein the step of storing the comparison result comprises:
if the data that is to be written into the data block is the same as data stored in the data block, setting the bit corresponding to the data block in the k bits vector to 1;
if the data that is to be written into the data block and data stored in the data block are the same, setting the bit corresponding to the data block in the k bits vector to 0.
4. A memory data writing method applied to a computer system comprising a CPU, a memory controller and a memory device, wherein the CPU comprises a last level cache (LLC), the method comprising:
receiving, by the memory controller, update information of a first cache line in the last level cache and data of the first cache line sent by the CPU, wherein the update information of the first cache line indicates an updating status for each of k data blocks in the first cache line, wherein each of the k data blocks has a size equal to a multiple of a burst write size which is a size of a data block that equal to a memory data bus width, k is an integer and 2≦k≦burst length (BL), BL being a number of burst writes required by the memory controller to write data of one cache line into the memory device;
selecting, by the memory controller, changed data blocks in the first cache line according to the update information of the first cache line in the last level cache;
generating, by the memory controller, memory addresses of the changed data blocks of the first cache line in the last level cache, wherein each memory address comprises a row address and a column address of one changed data block; and
sending, by the memory controller, the memory addresses corresponding to the changed data blocks and data corresponding to the changed data blocks of the first cache line to the memory device for storing.
5. The method according to claim 4, wherein if a sum of the size of the changed data blocks of the first cache line is less than a length of the first cache line,
generating, by the memory controller, memory addresses of changed data blocks in a second cache line, wherein each memory address comprises a row address and a column address of each changed data block in the second cache line;
sending the memory addresses corresponding to the changed data blocks of the first cache line, data corresponding to the changed data blocks of the first cache line, the memory addresses corresponding to the changed data blocks of the second cache line and data corresponding to the changed data blocks of the second cache line to the memory device, wherein a sum of a size of the changed data blocks of the second cache line and the sum of the size of the changed data blocks of the first cache line is less than or equal to the length of the first cache line in the last level cache.
6. The method according to claim 5, wherein the row address of each changed data block in the second cache line and the row address of each changed data block in the first cache line are the same, and there is no read command for the same row address in the memory device.
7. A computer system, which comprises a CPU and a memory system, wherein the CPU comprises a last level cache (LLC), and the memory system comprises a memory controller and a memory device, wherein:
the cache is configured to: divide a first cache line of the last level cache into k data blocks according to a burst write size, each of the k data blocks having a size equal to a multiple of the burst write size, wherein the burst write size is determined by a data bus width of the memory device, k is an integer, and 2≦k≦burst length (BL), BL being a number of burst writes required for the memory controller to write data of one cache line of the last level cache into the memory device; determine update information of the first cache line, wherein the update information of the first cache line indicates an updating status for each of the k data blocks in the first cache line, and send the update information of the first cache line to the memory controller;
the memory controller is configured to: receive update information of the first cache line in the last level cache and data of the first cache line sent by the CPU, wherein the update information of the first cache line indicates an updating status for each of k data blocks in the first cache line, wherein each of the k data blocks has a size that equal to a multiple of a burst write size which is a size of a data block that equal to a memory data bus width, k is an integer, and 2≦k≦burst length (BL), BL being a number of burst writes required by the memory controller to write data of one cache line into the memory device; select changed data blocks in the first cache line according to the update information of the first cache line; generate memory addresses of the changed data blocks of the first cache line in the last level cache, wherein each memory address comprises a row address and a column address of one changed data block; and send the memory addresses corresponding to the changed data blocks and data corresponding to the changed data blocks of the first cache line to the memory device for storing; and
the memory is configured to: receive the memory addresses corresponding to the changed data blocks and data corresponding to the changed data blocks of the first cache line, and perform writing, according to the memory addresses corresponding to the changed data blocks and the data corresponding to the changed data blocks, the data of the changed data blocks in the memory device.
8. The computer system according to claim 7, wherein the memory controller is configured to: if a sum of the size of the changed data blocks of the first cache line is less than a length of the first cache line,
generate memory addresses of changed data blocks in a second cache line, wherein each memory address comprises a row address and a column address of each changed data block in the second cache line;
send the memory addresses corresponding to the changed data blocks of the first cache line, data corresponding to the changed data blocks of the first cache line, the memory addresses corresponding to the changed data blocks of the second cache line and data corresponding to the changed data blocks of the second cache line to the memory device, wherein a sum of a size of the changed data blocks of the second cache line and the sum of the size of the changed data blocks of the first cache line is less than or equal to the length of the first cache line in the last level cache.
9. The computer system according to claim 8, wherein the row address of the changed data blocks in the second cache line and the row address of each changed data block in the first cache line are the same, and there is no read command for the same row address in the memory device.
10. The computer system according to claim 7, wherein column address buffers whose quantity is equal to the burst length and column address decoders whose quantity is equal to the burst length are configured on the memory device, and each column address of each changed data block is processed by using an independent column address buffer and an independent column decoder.
11. The computer system according to claim 8, wherein column address buffers whose quantity is equal to the burst length and column address decoders whose quantity is equal to the burst length are configured on the memory device, and each column address of each changed data block is processed by using an independent column address buffer and an independent column decoder.
12. The computer system according to claim 9, wherein when column address buffers whose quantity is equal to the burst length and column address decoders whose quantity is equal to the burst length are configured on the memory device, and each column address of each changed data block is processed by using an independent column address buffer and an independent column decoder.
US14/982,353 2013-06-29 2015-12-29 Data writing method and memory system Abandoned US20160110286A1 (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201310270239.6 2013-06-29
CN201310270239.6A CN104252420B (en) 2013-06-29 2013-06-29 Method for writing data and memory system
PCT/CN2014/080073 WO2014206220A1 (en) 2013-06-29 2014-06-17 Data writing method and memory system

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/080073 Continuation WO2014206220A1 (en) 2013-06-29 2014-06-17 Data writing method and memory system

Publications (1)

Publication Number Publication Date
US20160110286A1 true US20160110286A1 (en) 2016-04-21

Family

ID=52141031

Family Applications (1)

Application Number Title Priority Date Filing Date
US14/982,353 Abandoned US20160110286A1 (en) 2013-06-29 2015-12-29 Data writing method and memory system

Country Status (9)

Country Link
US (1) US20160110286A1 (en)
EP (1) EP2998867B1 (en)
JP (1) JP6159478B2 (en)
KR (1) KR101785189B1 (en)
CN (2) CN107577614B (en)
AU (1) AU2014301874B2 (en)
BR (1) BR112015031803A2 (en)
RU (1) RU2621611C1 (en)
WO (1) WO2014206220A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210200695A1 (en) * 2019-12-27 2021-07-01 Advanced Micro Devices, Inc. Staging memory access requests

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107688466B (en) * 2016-08-05 2020-11-03 中科寒武纪科技股份有限公司 Arithmetic device and operation method thereof
CN109324982B (en) * 2017-07-31 2023-06-27 上海华为技术有限公司 Data processing method and data processing device
CN113918508A (en) * 2021-12-15 2022-01-11 苏州浪潮智能科技有限公司 Cache acceleration method, device, equipment and readable storage medium
CN117396857A (en) * 2021-12-21 2024-01-12 华为技术有限公司 Data storage method, storage device and equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5459842A (en) * 1992-06-26 1995-10-17 International Business Machines Corporation System for combining data from multiple CPU write requests via buffers and using read-modify-write operation to write the combined data to the memory
US6658533B1 (en) * 2000-09-21 2003-12-02 Intel Corporation Method and apparatus for write cache flush and fill mechanisms
US7075851B2 (en) * 2003-02-17 2006-07-11 Renesas Technology Corp. Semiconductor memory device inputting/outputting data and parity data in burst operation
US7304908B2 (en) * 2005-01-15 2007-12-04 Samsung Electronics, Co., Ltd. SRAM device capable of performing burst operation

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH02113353A (en) * 1988-10-24 1990-04-25 Hitachi Ltd Semiconductor memory
JPH03164849A (en) * 1989-11-22 1991-07-16 Matsushita Electric Ind Co Ltd Microprocessor and microprocessor system
RU2010318C1 (en) * 1991-10-28 1994-03-30 Институт точной механики и вычислительной техники им.С.А.Лебедева РАН Memory control unit
JPH05282208A (en) * 1992-04-03 1993-10-29 Hitachi Ltd Cache memory control system
EP0683457A1 (en) * 1994-05-20 1995-11-22 Advanced Micro Devices, Inc. A computer system including a snoop control circuit
JP3204295B2 (en) * 1997-03-31 2001-09-04 日本電気株式会社 Cache memory system
WO2005050454A1 (en) * 2003-11-18 2005-06-02 Matsushita Electric Industrial Co., Ltd. Cache memory and control method thereof
ATE410733T1 (en) * 2004-06-16 2008-10-15 Freescale Semiconductor Inc DATE BUFFER STORAGE SYSTEM
KR101443231B1 (en) * 2007-11-27 2014-09-19 삼성전자주식회사 Cache memory capable of adjusting burst length of write-back data in write-back operation
US8001331B2 (en) * 2008-04-17 2011-08-16 Arm Limited Efficiency of cache memory operations
US8700840B2 (en) * 2009-01-05 2014-04-15 SanDisk Technologies, Inc. Nonvolatile memory with write cache having flush/eviction methods
US8464002B2 (en) * 2009-10-14 2013-06-11 Board Of Regents Of The University Of Texas System Burst-based cache dead block prediction
US8924652B2 (en) * 2009-11-23 2014-12-30 Marvell Israel (M.I.S.L.) Ltd. Simultaneous eviction and cleaning operations in a cache
CN102135941B (en) * 2010-08-26 2013-09-11 华为技术有限公司 Method and device for writing data from cache to memory
CN102012850B (en) * 2010-12-09 2012-09-12 首都师范大学 Hardware monitoring and micro-packet protocol-based key data restoration method
RU2487398C1 (en) * 2011-12-13 2013-07-10 Общество с ограниченной ответственностью "ЛАН-ПРОЕКТ" Method of creating virtual memory and device for realising said method
CN102725741B (en) * 2011-12-31 2014-11-05 华为技术有限公司 Method, device, and system for controlling cache
CN102662992B (en) * 2012-03-14 2014-10-08 北京搜狐新媒体信息技术有限公司 Method and device for storing and accessing massive small files
CN102999441B (en) * 2012-11-15 2015-06-17 清华大学 Fine granularity memory access method
WO2015127327A1 (en) * 2014-02-23 2015-08-27 Rambus Inc. Distributed procedure execution and file systems on a memory interface

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5459842A (en) * 1992-06-26 1995-10-17 International Business Machines Corporation System for combining data from multiple CPU write requests via buffers and using read-modify-write operation to write the combined data to the memory
US6658533B1 (en) * 2000-09-21 2003-12-02 Intel Corporation Method and apparatus for write cache flush and fill mechanisms
US7075851B2 (en) * 2003-02-17 2006-07-11 Renesas Technology Corp. Semiconductor memory device inputting/outputting data and parity data in burst operation
US7304908B2 (en) * 2005-01-15 2007-12-04 Samsung Electronics, Co., Ltd. SRAM device capable of performing burst operation

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
Ebeling, C., "DDR3 Synchronous DRAM Memory", Spring 2011, retrieved from CSE 467, Winter 2011 lecture notes, hosted by University of Washington, 16 pages *
Kleanthous, M. and Sazeides, Y., "CATCH: A mechanism for dynamically detecting cache-content-duplication in instruction caches", October 2011, ACM Transactions on Architecture and Code Optimization, v.8 n.3, Article 11, 27 pages, [doi>10.1145/2019608.2019610] *
Koller, R. and Rangaswami, R., "I/O Deduplication: Utilizing content similarity to improve I/O performance", September 2010, ACM Transactions on Storage (TOS), v.6 n.3, p.1-26, [doi>10.1145/1837915.1837921] *
Rixner, S., Dally, W.J., Kapasi, U.J., Mattson, P., and Owens, J.D, "Memory access scheduling", 2000, In Proceedings of the 27th annual international symposium on Computer architecture (ISCA '00), ACM, New York, NY, USA, 128-138, doi: https://doi.org/10.1145/339647.339668 *
Stuecheli, J., Kaseridis, D., Daly, D., Hunter, H.C., and John, L.K., "The virtual write queue: coordinating DRAM and last-level cache policies", June 19-23, 2010, Proceedings of the 37th annual international symposium on Computer architecture, Saint-Malo, France, [doi>10.1145/1815961.1815972] *
Y. J. Nam, D. Park and D. H. C. Du, "Assuring Demanded Read Performance of Data Deduplication Storage with Backup Datasets," 2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems, Washington, DC, 2012, pp. 201-208.doi: 10.1109/MASCOTS.2012.32 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20210200695A1 (en) * 2019-12-27 2021-07-01 Advanced Micro Devices, Inc. Staging memory access requests

Also Published As

Publication number Publication date
KR20160014053A (en) 2016-02-05
AU2014301874B2 (en) 2017-05-04
EP2998867B1 (en) 2019-09-18
JP6159478B2 (en) 2017-07-05
CN104252420A (en) 2014-12-31
CN107577614B (en) 2020-10-16
WO2014206220A1 (en) 2014-12-31
KR101785189B1 (en) 2017-10-12
AU2014301874A1 (en) 2016-01-21
CN104252420B (en) 2017-08-29
EP2998867A4 (en) 2016-07-06
RU2621611C1 (en) 2017-06-06
JP2016524251A (en) 2016-08-12
BR112015031803A2 (en) 2017-07-25
EP2998867A1 (en) 2016-03-23
CN107577614A (en) 2018-01-12

Similar Documents

Publication Publication Date Title
US20160110286A1 (en) Data writing method and memory system
US8250322B2 (en) Command reordering based on command priority
US8661180B2 (en) Memory controlling device and memory controlling method
US7707328B2 (en) Memory access control circuit
US7844785B2 (en) Method and apparatus for memory access optimization
US20120239873A1 (en) Memory access system and method for optimizing SDRAM bandwidth
US8918589B2 (en) Memory controller, memory system, semiconductor integrated circuit, and memory control method
JP2008532140A (en) Memory device and method having multiple internal data buses and memory bank interleaving
CN101340569A (en) High-speed cache and data processing method thereof
JP5613242B2 (en) Memory management unit, image processing apparatus, and integrated circuit
JP2014154119A (en) Memory controller and semiconductor storage device
JP5034551B2 (en) Memory controller, semiconductor memory access control method and system
US10423548B2 (en) Memory controller, control method for the memory controller, and control method for memory
US20120066471A1 (en) Allocation of memory buffers based on preferred memory performance
KR20140088725A (en) Dynamic random access memory device having selective refresh function according to read and write access and the selective refresh method the same
JP5204777B2 (en) Memory device and control method thereof
US20070121398A1 (en) Memory controller capable of handling precharge-to-precharge restrictions
US20080263290A1 (en) Memory control apparatus and memory control method
WO2023189358A1 (en) Memory control device
TWI721660B (en) Device and method for controlling data reading and writing
US10719440B2 (en) Semiconductor device and memory access method
US11276452B2 (en) Memory device including a plurality of area having different refresh periods, memory controller controlling the same and memory system including the same
KR101335367B1 (en) Apparatus and method for controlling memory
KR20010050234A (en) Addressing of a memory
JP2014093030A (en) SDRAM controller

Legal Events

Date Code Title Description
AS Assignment

Owner name: HUAWEI TECHNOLOGIES CO., LTD., CHINA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:XIA, FEI;JIANG, DEJUN;XIONG, JIN;REEL/FRAME:038562/0962

Effective date: 20160512

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION