Arizona State University, Tempe, US
Ph: 480-278-9230
Strengths
The authors propose, test, and discuss limitations of their VAB in a
reasonable concise manner.
The authors use standard benchmarks (SPECINT95) which adds to the
credibility of their results.
They define the model processor that they used in these tests very
well.
They used useful metrics in their evaluation.
Their approach seems feasible to implement into existing
architectures.
Limitations
The paper is written to address the increasing power usage of register
files in wider instruction issue processors. The move to multi-core
architectures largely makes this work obsolete until superscalar multi-
core architectures become used.
The introduction section of this paper could use a lot of filling in
with supportive data to better motivate their idea.
They do a poor job defining finite numbers for the power usage of a
typical register file and the effect that the increasing register file
size has on overall system power.
I have few doubts in this paper.
1. How is VAB different from pipeline registers? Does a group of
pipeline registers with additional features like valid bit etc make a
VAB??
2.How is the power saved when VAB is accessed for data by consecutive
few instructions? How is the energy per access to RF is greater than
energy per access to VAB? Essentially both get accessed in the similar
manner. Only thing that I understand is the access time for accessing
the register files is more than accessing VAB.
Can someone please help me in clearing my doubts?
Regards
Sai Raghunath T
Regards
Sai Raghunath T
http://www.geocities.com/paddyinpilani
Regards
Sai Raghunath T
The paper suggests using a new element (VAB) for accessing the
Register File which resembles a Cache in many ways. They claim doing
so will help reduce the overall power consumption with a slightly
negative effect on performance. The values that need to be kept in
this holding tank called VAB, as well as the mechanism used to store
or remove values from it are well defined; as is the lifetime
characteristics of values in the register file. The impact this new
element will have on the mechanism of read, write and freeing
operations are also described sufficiently.
One thing that is missing in the results is the ratio of power
consumption of VAB. They only claim that the overall power consumption
had reduced, however if a high percentage of this power was consumed
by the VAB which should be very small in size this could result in a
major hot spot problem.
In the last section they suggest adding a location bit to the VAB to
prevent performance loss. If we are prepared to go as far as adding a
bit to decide to access the VAB or the register file why stop at that?
Why not assign multiple bits to determine this register belongs to
which one of the many and possibly smaller register files?
As a final note one thing that is missing from this paper is any
indication of the cost of implementing this VAB architecture.
* Paper overview:
Register files are a large culprit of power consumption due to their
large size and many read/write ports. However, the short-lived nature
of many register values can be exploited by instead retrieving some
values from a VAB; this distributes heat away from the register file
and reduces the need for more read/write ports, which in turn
decreases overall power usage.
* Praise during online discussion:
Several critics complimented the paper on its usage of concrete
benchmarks such as SPECINT95. The authors go to great detail to
describe their testing methodology, and there is a lot of confidence
that the results could be duplicated independently.
In addition, the paper provides power-saving statistics, which allow
an implementor to predict benefits and trade-offs. The paper
acknowledge performance decreases-extra cycles and extra bits are used-
and additional difficulties (such as misprediction) are openly
admitted..
* Criticism during online discussion:
The relevance of the article is questioned, citing the paper's
dependence on super scalar architectures. Simple architectures may not
benefit as much from a VAB.
Moreover, there were multiple questions (both in-class and out) about
the usefulness of the paper's technique. For example, how is the
presence of a VAB different from having registers elsewhere on the
chip (e.g., pipeline registers)? If the register file was broken up
into smaller chunks and distributed throughout the processor, that
would reduce the size of any single chunk of the register file-
potentially providing a similar tangible heat benefit as the VAB. (The
paper's authors were contacted for further explanation, but no
response has been received.)
Another critic questioned whether the technique could be self-
defeating. If the VAB itself became a hot-spot, then the power-benefit
could be mitigated. This would be especially unfortunate when
considering the performance loss and implementation challenges
associated with a VAB.
Last, the paper lacks some important information. For example, the
implementation of the micro-architecture is not described. Application
debugging in a VAB chip is not discussed at all.
chip ( e.g., pipeline registers)? If the register file was broken up