You don't need anything too fancy.
In your case, you're "Driving a Cadillac". You have multiplier
control, so issues about accidentally overclocked RAM are a non-issue.
You'll need to use some VCore, to get to use more steps.
Basically, the process as I see it, is to "draw a speed versus VCore voltage graph".
Raise the CPU speed in tiny steps. Boot to your OS (the tiny steps are
important to not outright crashing the OS immediately).
I run Prime95 for testing. I gradually inch up the speed, until I
see an error in Prime95 in five to ten minutes. Then, I have a "data point".
I bump the voltage a bit, and retest. If it's stable, move up in frequency
again, until it becomes slightly unstable and throws another Prime95 error.
On the processor I tried this on years ago, eventually I "hit a wall".
At a certain frequency, I got my Prime95 error. I increased VCore
and it wouldn't go away. I increased VCore again, and still no progress.
I went to a step below the "unsafe" voltage for a final check, and the
graph now had a "wall" in it.
There is no point in being right up against that wall.
Some processors, don't have a wall, and instead get uncomfortably
hot before hitting a limit. The Intel D805 would go to 4GHz and
draw 200 watts. And some motherboards got so hot by doing that,
that the VCore components would melt foam rubber used to support the
test motherboard. You don't want to go that far.
Since I'd hit the wall with my processor, then I could back off the
frequency, and set VCore to a step more than the value in the graph.
That's my "margin". Next, run Prime95 for four to eight hours. If no
errors, you're done. Maybe you need another voltage step, if it's still
a bit wobbly. You can throw in a simultaneous 3D looping benchmark as
well as Prime95, for further stress of the computer.
Now, with the multiplier approach alone, you've lost granularity.
You could mix both clock adjustments and multiplier, but if you
do so, then you have to sit down and do the math for the RAM settings.
A clock change affects both CPU and RAM. The RAM readout in the BIOS
will likely not be reading "true speed". You have to compute the true
value yourself, while sitting in front of the computer. If you're offered,
say DDR3-1066 and DDR3-800, these represent different ratios between
CPU input clock, and final DRAM clock. If bumping up the CPU clock, pushed the
RAM above DDR3-1066 true value, you use the DDR3-800 BIOS setting, to
select a different ratio, and then the new value is the ratio of clocks,
times the reduced ratio value selected. In that example, you can get
a lot of testing out of a 1066 to 800 change, without violating the limits
of the RAM. (You may also need to work out appropriate values for Tras,
Trcd and so on.)
If the input clock to the CPU was 200MHz, you might proceed in 5MHz steps,
while watching for Prime95 errors. If you see no errors in Prime95 in ten
minutes, you bump the clock again. Since you have multiplier, you could scan
with clock, until you get to the value you could have had with one multiplier
step, then go back to 200Mhz clock, while using +1 on multiplier. The
clock adjustment, gives you finer granularity between multiplier steps.
The reason for using tiny steps between experiments, is so you
don't crash the computer, and remain in control the whole time.
I've had the computer so unstable, Prime95 errors out virtually immediately.
And you're working in too large a step size, if that happens.
If you have a Windows overclocker program, that saves the reboot step. The
online overclocker application I used, would make slow clock changes (slewing),
and for the program to take a giant step, it cranks the clock gradually
over a 30 second period. That isn't absolutely necessary, but it's
fun to watch.
Anyway, in my view, that's the thing to do. Watch TV with one eyeball, and
every ten minutes, check to see if it's time to tweak the computer. It'll
take a while to draw the graph of frequency versus VCore, but once you have
the graph in hand, you can select close to the final values, and run a
longer term stability test overnight.
The reason for disabling Turbo, CNQ and C1E, is so you're in absolute
control of things. Using CPU-Z, you can check the core clock speed while
you're working, and verify the value. Various power saving options,
could tempt the system to enter a different P-state, many times a
second, and you might not be sure of your result. On the computer
I'm typing on, I have a whole bunch of C options that have to be
disabled, before the CPU clock stands still at the canonical value.
I have a different brand of motherboard, that doesn't take nearly
as much to "tame" and run at a fixed rate.
Just before you "turn on Prime95", you wouldn't want the motherboard
making any sudden leaps in frequency while idle. You want as
steady a frequency as possible, to better help you prepare your
SHMOO plot.
http://en.wikipedia.org/wiki/Shmoo_plot
(Example SHMOO...)
http://www.realworldtech.com/page.cfm?NewsID=354
The C states are mentioned here, but some motherboards go all the
way to C6, so this article isn't complete. You may find more info
on the C states, on another site.
http://en.wikipedia.org/wiki/C1E#Processor_states
Paul