Account Options

  1. Sign in
The old Google Groups will be going away soon, but your browser is incompatible with the new version.
Google Groups Home
« Groups Home
Ingres 9.2 - Why does Ingres think these are the same?
There are currently too many topics in this group that display first. To make this topic appear first, remove this option from another topic.
There was an error processing your request. Please try again.
flag
  21 messages - Collapse all  -  Translate all to Translated (View all originals)
The group you are posting to is a Usenet group. Messages posted to this group will make your email address visible to anyone on the Internet.
Your reply message has not been sent.
Your post was successful
 
From:
To:
Cc:
Followup To:
Add Cc | Add Followup-to | Edit Subject
Subject:
Validation:
For verification purposes please type the characters you see in the picture below or the numbers you hear by clicking the accessibility icon. Listen and type the numbers you hear
 
Dennis  
View profile  
 More options Jan 15, 6:51 pm
Newsgroups: comp.databases.ingres
From: Dennis <droes...@comcast.net>
Date: Sun, 15 Jan 2012 16:51:30 -0700
Local: Sun, Jan 15 2012 6:51 pm
Subject: Ingres 9.2 - Why does Ingres think these are the same?
The value for varchar colx in two different rows are
'AAA_BBB_12345' and 'AAA BBB 12345'.

Ingres 9.2.1 (a64.lnx/103)NPTL thinks these are the same, but Ingres 10
doesn't.

Ingres 9.2 wont allow this table to be created as btree unique on colx.

Is this a bug or am I missing something?

Thanks

Dennis


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ingres Forums  
View profile  
 More options Jan 16, 2:36 am
Newsgroups: comp.databases.ingres
From: Ingres Forums <info-ing...@kettleriverconsulting.com>
Date: Mon, 16 Jan 2012 01:36:05 -0600
Local: Mon, Jan 16 2012 2:36 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?

Could you please perform this query and let us know the result?
SELECT DBMSINFO('unicode_level')\p\g

--
bilgihan
------------------------------------------------------------------------
bilgihan's Profile: http://community.actian.com/forum/member.php?userid=8658
View this thread: http://community.actian.com/forum/showthread.php?t=14157


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ingres Forums  
View profile  
 More options Jan 16, 4:04 am
Newsgroups: comp.databases.ingres
From: Ingres Forums <info-ing...@kettleriverconsulting.com>
Date: Mon, 16 Jan 2012 03:04:25 -0600
Local: Mon, Jan 16 2012 4:04 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?

Is your where clause *WHERE colx = 'AAA_BBB_12345'* or is it *WHERE colx
LIKE 'AAA_BBB_12345'*?

LIKE treats the underscore as a wildcard for any single character unless
you escape it (*WHERE colx LIKE 'AAA\_BBB\_12345' ESCAPE '\'*) .

--
rhann
------------------------------------------------------------------------
rhann's Profile: http://community.actian.com/forum/member.php?userid=131
View this thread: http://community.actian.com/forum/showthread.php?t=14157


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Martin Bowes  
View profile  
 More options Jan 16, 4:23 am
Newsgroups: comp.databases.ingres
From: Martin Bowes <martin.bo...@ctsu.ox.ac.uk>
Date: Mon, 16 Jan 2012 09:23:48 +0000
Local: Mon, Jan 16 2012 4:23 am
Subject: Re: [Info-Ingres] Ingres 9.2 - Why does Ingres think these are the same?
HI Dennis

On ordinary databases: createdb bowtest

I get the same answer on both 9.2.0 and 10.0.0 on a64/lnx.

In both cases the rows sort in order:
AAA BBB 12345
AAA_BBB_12345

You possibly used a different collation sequence on the different installations?

Martin Bowes


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Martin Bowes  
View profile  
 More options Jan 16, 5:03 am
Newsgroups: comp.databases.ingres
From: Martin Bowes <martin.bo...@ctsu.ox.ac.uk>
Date: Mon, 16 Jan 2012 10:03:02 +0000
Local: Mon, Jan 16 2012 5:03 am
Subject: Re: [Info-Ingres] Ingres 9.2 - Why does Ingres think these are the same?
Hi All,

I've mucked around with this further....
I can replicate the error(?) if the database is unicode enabled: createdb -i bowtest

And only then if the sort column is nvarchar not varchar.

create table x(a nvarchar(100) not null not default);

    insert into x values('AAA BBB 12345');
    insert into x values('AAA_BBB_12345');

    modify x to btree unique on a;

    select * from x;

The modify will generate the error:

E_US1591 MODIFY: table could not be modified because rows contain duplicate keys.

I get this on both 9.2 and on 10.0.

Marty


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
eric.misc...@w-link.net  
View profile  
 More options Jan 16, 1:07 am
Newsgroups: comp.databases.ingres
From: eric.misc...@w-link.net
Date: Sun, 15 Jan 2012 22:07:52 -0800 (PST)
Local: Mon, Jan 16 2012 1:07 am
Subject: Re: [Info-Ingres] Ingres 9.2 - Why does Ingres think these are the same?
Ingres 9.2 is evaluating the underscore "_" as the wildcard character.


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ian Kirkham  
View profile  
 More options Jan 16, 7:26 am
Newsgroups: comp.databases.ingres
From: "Ian Kirkham" <Ian.Kirk...@actian.com>
Date: Mon, 16 Jan 2012 07:26:42 -0500
Local: Mon, Jan 16 2012 7:26 am
Subject: Re: [Info-Ingres] Ingres 9.2 - Why does Ingres think these are the same?
No it isn't - pattern mating is only performed with the pattern
operators LIKE and SIMILAR TO.

This behaviour will likely be due to a UTF-8 installation and hence
Unicode collation enabled on the column in question.

Regards,
Ian


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Kristoff  
View profile  
 More options Jan 16, 8:16 am
Newsgroups: comp.databases.ingres
From: Kristoff <kristoff.pic...@ingres.com>
Date: Mon, 16 Jan 2012 05:16:50 -0800 (PST)
Local: Mon, Jan 16 2012 8:16 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?
It depends on the unicode collation sequence in use. The default
unicode collation sequence used by Ingres treats underscore and space
as the same.
You can use the udefault5 collation sequence instead (createdb -
iudefault5 dbname), which distinguishes between these characters (and
some more).
Not sure at the moment whether this was available in the first 9.2 GA
release, but at least available by patch and in 10.0.

By the way, when using a UTF8 installation you will see the problem
with varchar too.

Kristoff

On Jan 16, 11:03 am, Martin Bowes <martin.bo...@ctsu.ox.ac.uk> wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Ingres 9.2 - Why does Ingres think these are thesame?" by Emma K. McGrattan
Emma K. McGrattan  
View profile  
 More options Jan 16, 8:44 am
Newsgroups: comp.databases.ingres
From: "Emma K. McGrattan" <Emma.McGrat...@actian.com>
Date: Mon, 16 Jan 2012 13:44:23 +0000
Local: Mon, Jan 16 2012 8:44 am
Subject: Re: [Info-Ingres] Ingres 9.2 - Why does Ingres think these are thesame?
Thanks, Kristoff!  I wasn't aware of this

-- Emma

PS Getting used to my iPhone. Excuse typos and nonsensical auto corrects!

On Jan 16, 2012, at 13:38, "Kristoff" <kristoff.pic...@ingres.com> wrote:


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Discussion subject changed to "Ingres 9.2 - Why does Ingres think these are the same?" by Dennis
Dennis  
View profile  
 More options Jan 16, 10:56 am
Newsgroups: comp.databases.ingres
From: Dennis <droes...@comcast.net>
Date: Mon, 16 Jan 2012 08:56:57 -0700
Local: Mon, Jan 16 2012 10:56 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?
On 1/16/2012 12:36 AM, Ingres Forums wrote:

> Could you please perform this query and let us know the result?
> SELECT DBMSINFO('unicode_level')\p\g

This returns 1

Dennis


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Dennis  
View profile  
 More options Jan 16, 11:04 am
Newsgroups: comp.databases.ingres
From: Dennis <droes...@comcast.net>
Date: Mon, 16 Jan 2012 09:04:52 -0700
Local: Mon, Jan 16 2012 11:04 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?
On 1/16/2012 6:16 AM, Kristoff wrote:

> It depends on the unicode collation sequence in use. The default
> unicode collation sequence used by Ingres treats underscore and space
> as the same.
> You can use the udefault5 collation sequence instead (createdb -
> iudefault5 dbname), which distinguishes between these characters (and
> some more).

It looks like alterdb will let me change the collation sequence, but am
I better off to unload, drop, createdb and reload?

This database was created with createdb -i and II_CHARSET is UTF8.  The
database was created the same way on Ingres 10 as well.

> Not sure at the moment whether this was available in the first 9.2 GA
> release, but at least available by patch and in 10.0.

> By the way, when using a UTF8 installation you will see the problem
> with varchar too.

These are declared as varchar.

Thanks

Dennis


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ingres Forums  
View profile  
 More options Jan 17, 3:21 am
Newsgroups: comp.databases.ingres
From: Ingres Forums <info-ing...@kettleriverconsulting.com>
Date: Tue, 17 Jan 2012 02:21:27 -0600
Local: Tues, Jan 17 2012 3:21 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?

This confirms unicode is being used here.
'Knowlege Base | How to Determine if Unicode Enabled? - 411686 -
Actian' (http://www.actian.com/kb/article/411686)

This was also the most of discussion topic yesterday.

--
bilgihan
------------------------------------------------------------------------
bilgihan's Profile: http://community.actian.com/forum/member.php?userid=8658
View this thread: http://community.actian.com/forum/showthread.php?t=14157


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Dennis  
View profile  
 More options Jan 17, 5:38 am
Newsgroups: comp.databases.ingres
From: Dennis <droes...@comcast.net>
Date: Tue, 17 Jan 2012 03:38:57 -0700
Local: Tues, Jan 17 2012 5:38 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?
On 1/16/2012 6:16 AM, Kristoff wrote:

> It depends on the unicode collation sequence in use. The default
> unicode collation sequence used by Ingres treats underscore and space
> as the same.
> You can use the udefault5 collation sequence instead (createdb -
> iudefault5 dbname), which distinguishes between these characters (and
> some more).
> Not sure at the moment whether this was available in the first 9.2 GA
> release, but at least available by patch and in 10.0.

Is it safe to copy the udefault5 file from an Ingres 10 installation to
an Ingres 9.2 installation, or does this require some other patch to use
this in Ingres 9.2?

Dennis


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Kristoff  
View profile  
 More options Jan 17, 10:46 am
Newsgroups: comp.databases.ingres
From: Kristoff <kristoff.pic...@ingres.com>
Date: Tue, 17 Jan 2012 07:46:21 -0800 (PST)
Local: Tues, Jan 17 2012 10:46 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?
On Jan 17, 11:38 am, Dennis <droes...@comcast.net> wrote:

To be safe, copy the file udefault5.uce (in $II_SYSTEM/ingres/files/
collation/) to the 9.2 installation into the same directory and then
compile the collation sequence using:
aducompile udefault5.uce udefault5 -u

Kristoff


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
James K. Lowden  
View profile  
 More options Jan 18, 6:31 pm
Newsgroups: comp.databases.ingres
From: "James K. Lowden" <jklow...@schemamania.org>
Date: Wed, 18 Jan 2012 18:31:01 -0500
Local: Wed, Jan 18 2012 6:31 pm
Subject: Re: [Info-Ingres] Ingres 9.2 - Why does Ingres think these are the same?
On Mon, 16 Jan 2012 05:16:50 -0800 (PST)

Kristoff <kristoff.pic...@ingres.com> wrote:
> The default unicode collation sequence used by Ingres treats
> underscore and space as the same.

That strikes me as somewhat ... counterintuitive.  

How is such a collation advantageous?  Is there a Unicode
recommendation that led Ingres^WActian to choose it?  

--jkl


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Kristoff  
View profile  
 More options Jan 19, 3:30 am
Newsgroups: comp.databases.ingres
From: Kristoff <kristoff.pic...@ingres.com>
Date: Thu, 19 Jan 2012 00:30:14 -0800 (PST)
Local: Thurs, Jan 19 2012 3:30 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?
On Jan 19, 12:31 am, "James K. Lowden" <jklow...@schemamania.org>
wrote:

> On Mon, 16 Jan 2012 05:16:50 -0800 (PST)

> Kristoff <kristoff.pic...@ingres.com> wrote:
> > The default unicode collation sequence used by Ingres treats
> > underscore and space as the same.

> That strikes me as somewhat ... counterintuitive.

> How is such a collation advantageous?  Is there a Unicode
> recommendation that led Ingres^WActian to choose it?

> --jkl

Don't know the advantage of that, but it was the official standard
collation sequence for Unicode 2.1 (http://www.unicode.org/versions/
components-2.1.9.html), fairly outdated now.

Kristoff


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
James K. Lowden  
View profile  
 More options Jan 21, 4:36 pm
Newsgroups: comp.databases.ingres
From: "James K. Lowden" <jklow...@schemamania.org>
Date: Sat, 21 Jan 2012 16:36:50 -0500
Local: Sat, Jan 21 2012 4:36 pm
Subject: Re: [Info-Ingres] Ingres 9.2 - Why does Ingres think these are the same?
On Thu, 19 Jan 2012 00:30:14 -0800 (PST)

Kristoff <kristoff.pic...@ingres.com> wrote:
> On Jan 19, 12:31 am, "James K. Lowden" <jklow...@schemamania.org>
> wrote:
> > On Mon, 16 Jan 2012 05:16:50 -0800 (PST)
> > Kristoff <kristoff.pic...@ingres.com> wrote:

> > > The default unicode collation sequence used by Ingres treats
> > > underscore and space as the same.

> > How is such a collation advantageous?  

> Don't know the advantage of that, but it was the official standard
> collation sequence for Unicode 2.1 (http://www.unicode.org/versions/
> components-2.1.9.html), fairly outdated now.

Hi Kristoff,

AIUI the default collation for blank and underscore are not the same.  
Maybe I'm misinterpreting what you said or what it says.  

        http://unicode.org/collation/
points to
        http://www.unicode.org/reports/tr10/,
the Unicode Collation Algorithm, which references
        http://www.unicode.org/reports/tr10/#Allkeys
the allkeys file, the Default Unicode Collation Element Table,
which says:

$ grep -E '^00.+(LOW LINE|SPACE)$' allkeys.txt  | head
0020  ; [*020A.0020.0002.0020] # SPACE
005F  ; [*021B.0020.0002.005F] # LOW LINE

They sure don't *look* the same.  They differ on the first level and
the fourth, which the UCA says is "computable".  

The '*' indicates a "variable collation elements", which
the standard says "can be either treated as ignorables or not".  These
constitute 22% of the Basic Latin page, 58 characters.  

Is it the case that Ingres treats all such elements as ignorable?

I read
http://docs.actian.com/ingres/10.0/system-administrator-guide/3937-su...
but it doesn't describe a general Unicode collation.  

For those following along at home, the UCA also says sequences of
igorables are all ignored e.g.,

        "This is some text"

would sort equal to any of

        "  This is some text"
        "This     is       some        text"
        "Thisissometext"
        "This__is__some__text"

The whole set of ignorables includes most punctuation, so an
implementation could also fairly treat these as also equivalent:

        "This is some text?"
        "(This is some text)"
        "This is *some* text!"

I'm not saying this is what Ingres is doing.  I'm saying this is a
valid interpretation of what I understand to be the controlling
standard.  And that nothing in the UCA DUCET standard suggests blanks
and underscores must have the same sorting value.  

--jkl


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Kristoff  
View profile  
 More options Jan 23, 4:21 am
Newsgroups: comp.databases.ingres
From: Kristoff <kristoff.pic...@ingres.com>
Date: Mon, 23 Jan 2012 01:21:09 -0800 (PST)
Local: Mon, Jan 23 2012 4:21 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?
On Jan 21, 10:36 pm, "James K. Lowden" <jklow...@schemamania.org>
wrote:

Hi James,
I'm not speaking about the actual default collation at unicode.org,
but the old one which came with the 2.0 standard. Take a look at:
http://unicode.org/Public/UCA/2.1.9/basekeys-2.1.9.txt

$egrep '^005F|^0020' basekeys-2.1.9.txt

0020 ; [*0209.0020.0002.0020] # SPACE
005F ; [*0209.0021.0002.005F] # LOW LINE; COMPATSEQ

So in this old collation sequence the first weight of both chars is
the same.

Kristoff


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Ian Kirkham  
View profile  
 More options Jan 23, 4:38 am
Newsgroups: comp.databases.ingres
From: "Ian Kirkham" <Ian.Kirk...@actian.com>
Date: Mon, 23 Jan 2012 04:38:57 -0500
Local: Mon, Jan 23 2012 4:38 am
Subject: Re: [Info-Ingres] Ingres 9.2 - Why does Ingres think these are the same?
Hi James,
You are correct in your reading of *current* DUCET tables but as Kristoff said 9.2 Ingres is based on an *older* DUCET which did collate space and underscore. If you don't like it, you can change it.
Regards,
Ian


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
James K. Lowden  
View profile  
 More options Jan 24, 12:03 am
Newsgroups: comp.databases.ingres
From: "James K. Lowden" <jklow...@schemamania.org>
Date: Tue, 24 Jan 2012 00:03:39 -0500
Local: Tues, Jan 24 2012 12:03 am
Subject: Re: [Info-Ingres] Ingres 9.2 - Why does Ingres think these are the same?
On Mon, 23 Jan 2012 01:21:09 -0800 (PST)

Thanks for the clarification, Kristoff.  If I may, did the standard
also change how/when the second field comes into play?  AIUI
the current standard, each level is a tie-breaker for the the one
before. I won't go into detail now in case it's irrelevant.  

To the OP's question, is the Unicode collation user modfiable (without a
compiler) or contemplated for a future release?  

Regards,

--jkl


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
Kristoff  
View profile  
 More options Jan 24, 4:24 am
Newsgroups: comp.databases.ingres
From: Kristoff <kristoff.pic...@ingres.com>
Date: Tue, 24 Jan 2012 01:24:20 -0800 (PST)
Local: Tues, Jan 24 2012 4:24 am
Subject: Re: Ingres 9.2 - Why does Ingres think these are the same?
On Jan 24, 6:03 am, "James K. Lowden" <jklow...@schemamania.org>
wrote:

Yes, the collation can be changed - there is no "true" collation which
would fit into any language, so actually it is expected that user will
customize it.
For Ingres you need to compile the collation sequence, but the
compiler (aducompile) is delivered with the Ingres distribution.

Be careful when creating your own sequence or when using the very
latest available at unicode.org. Currently Ingres does not understand
all the actual unicode features. If you look at the udefault5
collation which is now delivered in addition to the old one, it is not
the original one coming with Unicode 5.0, it only use the weights
defined in that version, but only contains those characters defined in
standard one coming wit Ingres.

Kristoff


 
You must Sign in before you can post messages.
To post a message you must first join this group.
Please update your nickname on the subscription settings page before posting.
You do not have the permission required to post.
End of messages
« Back to Discussions « Newer topic     Older topic »