SCN wrapping

Mladen Gogala

unread,

Jan 19, 2012, 12:38:28 PM1/19/12

to

It appears that Oracle 11G has a rather serious bug: it wraps SCN numbers
around, when doing begin/end backup type of backup:

http://tinyurl.com/6wbker6

One of the quotes in the article reminded me of the CIO who has saved 9.5
million dollars on monitoring by switching to OEM. He was talking about
thousands of databases. The interesting passage from the article (page 5)
is here:
"Again, only very large customers with many interconnected Oracle
databases would be likely to run a significant risk of being affected by
this problem. But the larger the Oracle environment, the longer this
restoration would take. Typically, large organizations have the least
tolerance for downtime."

That's precisely the description of the company run by the guy who has
saved millions. This could be funny. Of course, my confidence into Oracle
is also a bit shaken, bugs on the level this fundamental are not supposed
to happen. I should be able to trust my DB vendor with the same degree of
trust as my stock broker. I know that my stock broker is not going to
securitize worthless "liar loans", get the deceiving AAA rating for so
created security, by the auditing agency owned by the same bank as the
brokerage, sell that security to me and bet against the security they sold
me with an insurance company. I must have the same level of confidence
with my DB vendor, too.

--
http://mgogala.freehostia.com

joel garry

unread,

Jan 19, 2012, 1:26:11 PM1/19/12

to

On Jan 19, 9:38 am, Mladen Gogala <gogala.REMOVETHISmla...@google.com>
wrote:

> --http://mgogala.We were unable to post your message.com

Well, the attack surface can be large even for a small company:
http://www.gokhanatil.com/2012/01/fundamental-oracle-flaw-revealed-lets.html

So, can script kiddies poison dns to point at their own VM with a
compromised scn in it and a user already linked to? How about if they
can steal a backup VM, or a plain old backup of an XE used in
production? Does OCM world-publish enough info to know what to
attack? Are employees ever disaffected?

Questions, questions, questions.

jg
--
@home.com is bogus.
"What does it say about the state of computer science education that
one must make a case for teaching how to think clearly?"
http://research.microsoft.com/en-us/um/people/lamport/pubs/pubs.html#teaching-concurrency

Matthias Hoys

unread,

Jan 19, 2012, 3:11:39 PM1/19/12

to

"Mladen Gogala" <gogala.REMO...@google.com> wrote in message
news:jf9kek$gk8$1...@solani.org...

> It appears that Oracle 11G has a rather serious bug: it wraps SCN numbers
> around, when doing begin/end backup type of backup:
>
> http://tinyurl.com/6wbker6
>

> --
> http://mgogala.freehostia.com

I wonder if it's only 11g that's affected by the bug or also any older
versions? I don't remember reading anything about this in the last PSU patch
notes for 10g... And yes, there are still people using prehistoric
technology like 10g! ;-)

Matthias Hoys

Matthias Hoys

unread,

Jan 19, 2012, 3:20:23 PM1/19/12

to

"Mladen Gogala" <gogala.REMO...@google.com> wrote in message
news:jf9kek$gk8$1...@solani.org...

> It appears that Oracle 11G has a rather serious bug: it wraps SCN numbers
> around, when doing begin/end backup type of backup:
>
> http://tinyurl.com/6wbker6
>

Strange, these are the patch notes for the PSU 11.2.0.3.1 (JAN2012),
released a couple of days ago:

PSU 11.2.0.3.1 contains the following new fixes:

Automatic Storage Management
9703627 - 11.2.0.2: ROOT USE OF ASMCMD PLACES ALERT.LOG IN USER DIRECTORY
12620823 - SOL-SP64-11203:ASM INSTANCE HANG DURING CRS STACK STARTING ON THE
SECOND NODE
12797765 - SOL_SP64: AFTER ALL DISKS FAILURE, DG CAN'T BE DISMOUNTED ON
T2000-3
12905058 - REBOOT 2 CELL NODES, CHECKFILE FOUND CORRUPTION BLOCK IN 3 UNDO
DATAFILES
12938841 - 11203_ASM_SOL_SP64:RACE BETWEEN ADD DISK AND DISMOUNT MAY CAUSE
KFGUSENUM01
12950644 - RBAL HIT ORA-07445:[KFDGLOBALOPEN()+738], ASM INST ABORT

Generic
9873405 - ORA-600 DURING FAST REFRESH AFTER 11.2.0.1.0 TO 11.2.0.2.0 UPDATE.

High Availability
12718090 - LNX64-11203-RAC:DB FG RROC HIT ORA-00600[KCLCHKBLK_3]
12834027 - ORA-00600 [KJBMPRLST:SHADOW] & [KJBRASR:PKEY] IN A READ MOSTLY &
SKIP LOCK ENV
12847466 - AROLTP-C: HANG SIGNATURE: 'GC CURRENT REQUEST'<='GC BUFFER BUSY
ACQUIRE'
12861463 - RAC PERF: DEFAULT VALUE FOR _LM_SINGLE_INST_AFFINITY_LOCK SHOULD
BE FALSE
12917230 - QUERY WITH TEMP TABLE TRANSFORMATION RUNS 5X SLOWER WAITING FOR
REMASTERING
12998795 - AROLTP-C: HANG SIGNATURE: 'GC CURRENT REQUEST'<='GC BUFFER BUSY
ACQUIRE'
13035804 - LACK OF DLM PSEUDO RECONFIGURATION TEXTUAL REASON

Oracle Space Management
13041324 - HCC ON ZFS AND PILLAR STORAGE
13492735 - DISALLOW ADDING NON-HCC DATAFILE TO HCC TABLESPACE

Oracle Virtual Operating System Services
13362079 - HCC SHOULD NOT BE ENABLED FOR NON ZFS/ PILLAR STORAGE ARRAY

So, where's the SCN bug fix??

Matthias Hoys

Noons

unread,

Jan 19, 2012, 4:35:49 PM1/19/12

to

On Jan 20, 7:20 am, "Matthias Hoys" <a...@spam.com> wrote:

>
> So, where's the SCN bug fix??
>

wanna bet there will be a patch to the PSU, very soon?

joel garry

unread,

Jan 19, 2012, 7:09:28 PM1/19/12

to

On Jan 19, 12:11 pm, "Matthias Hoys" <a...@spam.com> wrote:
> "Mladen Gogala" <gogala.REMOVETHISmla...@google.com> wrote in message

>
> news:jf9kek$gk8$1...@solani.org...
>
> > It appears that Oracle 11G has a rather serious bug: it wraps SCN numbers
> > around, when doing begin/end backup type of backup:
>
> >http://tinyurl.com/6wbker6
>
> > --

> >http://mgogala.freehosta.com

>
> I wonder if it's only 11g that's affected by the bug or also any older
> versions? I don't remember reading anything about this in the last PSU patch
> notes for 10g... And yes, there are still people using prehistoric
> technology like 10g! ;-)
>
> Matthias Hoys

Note the link I posted used 9i. And we have another Wartiki-wannabe:
http://blogs.oracle.com/UPGRADE/entry/fundamental_oracle_flaw_revealed_really

Mladen, see Bug 12371955 - Hot Backup can cause increased SCN growth
rate leading to ORA-600 [2252] errors [ID 12371955.8]
I think there is confusion because that was in 11.2.0.3, but is also
available as a patch 12371955 for earlier versions. They don't seem
to put the old patches in the new listing you posted.

jg
--
@home.com is bogus.

http://www.informationweek.com/news/security/app-security/232500111

joel garry

unread,

Jan 19, 2012, 7:13:44 PM1/19/12

to

On Jan 19, 4:09 pm, joel garry <joel-ga...@home.com> wrote:
> On Jan 19, 12:11 pm, "Matthias Hoys" <a...@spam.com> wrote:

>
> Mladen, see Bug 12371955 - Hot Backup can cause increased SCN growth

Sorry, that was meant to be Matthias, getting a bit blurry-eyed with
all this.

mhoys

unread,

Jan 20, 2012, 3:39:11 AM1/20/12

to

On Jan 20, 1:09 am, joel garry <joel-ga...@home.com> wrote:
> On Jan 19, 12:11 pm, "Matthias Hoys" <a...@spam.com> wrote:
>
>
>
>
>
>
>
>
>
> > "Mladen Gogala" <gogala.REMOVETHISmla...@google.com> wrote in message
>
> >news:jf9kek$gk8$1...@solani.org...
>
> > > It appears that Oracle 11G has a rather serious bug: it wraps SCN numbers
> > > around, when doing begin/end backup type of backup:
>
> > >http://tinyurl.com/6wbker6
>
> > > --
> > >http://mgogala.freehosta.com
>
> > I wonder if it's only 11g that's affected by the bug or also any older
> > versions? I don't remember reading anything about this in the last PSU patch
> > notes for 10g... And yes, there are still people using prehistoric
> > technology like 10g! ;-)
>
> > Matthias Hoys
>

> Note the link I posted used 9i. And we have another Wartiki-wannabe:http://blogs.oracle.com/UPGRADE/entry/fundamental_oracle_flaw_reveale...

>
> Mladen, see Bug 12371955 - Hot Backup can cause increased SCN growth
> rate leading to ORA-600 [2252] errors [ID 12371955.8]
> I think there is confusion because that was in 11.2.0.3, but is also
> available as a patch 12371955 for earlier versions. They don't seem
> to put the old patches in the new listing you posted.
>
> jg
> --
> @home.com is bogus.http://www.informationweek.com/news/security/app-security/232500111

Thanks, that MOS article helped to clear up the confusion a bit :-)
Looks like the bug was already fixed in the 11.2.0.3 server patch set.

And this is what they say about pre-11g versions:

"This fix is *NOT* required in any release prior to 11g.
For 11g onwards this fix is already included in various Patch Set
Updates and bundles as listed above."

*getting even more confused*

Matthias Hoys

joel garry

unread,

Jan 20, 2012, 12:03:07 PM1/20/12

to

As I understand it, there are several issues, working together. The
SCN being propagated among distributed databases appears to have been
around a long time, but never really had a problem because of the
large scale of the variable. The bug that congealed the problem seems
to be the begin database backup which would elevate the SCN too fast.
That would only really be a problem for a large system with many links
and much usage of bcp style backups, where people would backup whole
dbs with a snapshot, rather than tablespaces, and the SCN jumps
propagating would multiply the problem. Since it could happen, but
usually doesn't, they distribute a script to say red, amber or green
light, so most people get warm and fuzzy green lights.

But now that we know that, it is a simple matter to poison a system by
hacking the controlfiles of an obscure database, then propagate with a
mere access over a link. You don't need the unpatched backup to have
the problem happen, someone can make it happen. It may just be a
matter of time until it gets to the script-kiddie point (I haven't
looked yet this morning).

jg
--
@home.com is bogus.

http://www.wired.com/wiredenterprise/2012/01/intel-oracle-hp/

Mark D Powell

unread,

Jan 24, 2012, 12:46:18 PM1/24/12

to

> @home.com is bogus.http://www.wired.com/wiredenterprise/2012/01/intel-oracle-hp/- Hide quoted text -
>
> - Show quoted text -

This bug is not nearly as risky as the InfoWorld article made out. On
its own it is not likely to occur. As far as a DOS attach goes if you
have proper control of your network and do not allow remote non-
controlled databases to link into yours then you can wait the time it
takes to upgrade/patch to a protected version in the normal course of
business.

You can implement monitoring of your SCN number and spit out an alert
or other form of warning message to identify an attach taking place.

IMHO -- Mark D Powell --

joel garry

unread,

Jan 24, 2012, 5:58:55 PM1/24/12

to

> > @home.com is bogus.http://www.wired.com/wiredenterprise/2012/01/intel-oracle-hp/-Hide quoted text -

>
> > - Show quoted text -
>
> This bug is not nearly as risky as the InfoWorld article made out. On
> its own it is not likely to occur. As far as a DOS attach goes if you
> have proper control of your network and do not allow remote non-
> controlled databases to link into yours then you can wait the time it
> takes to upgrade/patch to a protected version in the normal course of
> business.
>
> You can implement monitoring of your SCN number and spit out an alert
> or other form of warning message to identify an attach taking place.
>
> IMHO -- Mark D Powell --

Ah, I missed the bit about ora-600 on the victim db if you went over a
reasonable SCN. Going through Bug 11767824: HIGH SCN VALUES / ORA-600
[2252] ERRORS while trying to understand what Jonathan said in some
places helped me understand much more. That shows the issue was there
in 10, even if that was kind of solved (or at least known and
trackable) and then made worse with the backup bug in 11.

Still glad I hadn't gone to snapshots and 11g though, even if only
through the luck of the budget-deprived. I'm definitely on the lag
side of http://mwidlake.wordpress.com/2012/01/20/friday-philosophy-lead-or-lag-when-to-upgrade/

You, Billy and Jonathan have brought me over to the tempest in a
teapot view, thanks.

jg
--
@home.com is bogus.

http://online.wsj.com/article/BT-CO-20120124-711190.html

Mark D Powell

unread,

Jan 25, 2012, 10:56:29 AM1/25/12

to

According to InfoWorld a partial patch is in the January 2010 CPU. The magazine has issued an update which includes mention of another potential means of the issue being raised. The actual bug related to manual hot backups is apparently limited to two releases per the article, which also contains a link to an article on what and how Oracle uses the SCN number.

http://www.infoworld.com/d/security/the-oracle-flaw-clarifications-and-more-information-184775

HTH -- Mark D Powell --

Mladen Gogala

unread,

Jan 25, 2012, 11:46:31 AM1/25/12

to

Thanks, Mark. Fascinating reading. Good thing is that Infoworld is fair
and balanced.

--
http://mgogala.byethost5.com