SQL: Working with huge tables of chronological data

John

unread,

Apr 5, 2007, 10:27:23 AM4/5/07

to

Hi,

I'm trying to figure out an efficient way to search for the non
existence of events in chronological data with SQL. The goal (detailed
below) seems pretty simple but so far it looks like it's tricky to do
with Oracle. Here's my problem:

I'm working with 2 simple but huge tables each recording a different
kind of event associated with a timestamp. For instance:

Table A
(03:50pm, A1)
(03:55pm, A2)
(03:58pm, A3)

Table B
(03:51pm, B1)
(04:00pm, B2)

I'm looking for all the chronological sequences (Ax, Ay) where no B
event is present between Ax and Ay. In this example, the result would
be (A2, A3).

I've been searching actively for an efficient solution for this
problem and I couldn't find any fast enough. Do you have any idea?

Thanks a lot,

John

Ana C. Dent

unread,

Apr 5, 2007, 10:51:28 AM4/5/07

to

"John" <acide.as...@gmail.com> wrote in news:1175783243.167337.195580
@n59g2000hsh.googlegroups.com:

> Hi,
>
> I'm trying to figure out an efficient way to search for the non
> existence of events in chronological data with SQL. The goal (detailed
> below) seems pretty simple but so far it looks like it's tricky to do
> with Oracle. Here's my problem:
>
> I'm working with 2 simple but huge tables each recording a different
> kind of event associated with a timestamp. For instance:
>
> Table A
> (03:50pm, A1)
> (03:55pm, A2)
> (03:58pm, A3)
>
> Table B
> (03:51pm, B1)
> (04:00pm, B2)
>
> I'm looking for all the chronological sequences (Ax, Ay) where no B
> event is present between Ax and Ay. In this example, the result would

I do not understand the logic that would lead the answer above;
since none of the timestamps in Table A match any timestamp in Table B.

> be (A2, A3).
>
> I've been searching actively for an efficient solution for this
> problem and I couldn't find any fast enough. Do you have any idea?

How fast is fast enough?
How do we know what you tried & deemed unacceptable?

DA Morgan

unread,

Apr 5, 2007, 10:59:25 AM4/5/07

to

To me your example and explantion make no sense.

What Ax and Ay? Please try for more clarity with your explanation.
--
Daniel A. Morgan
University of Washington
damo...@x.washington.edu
(replace x with u to respond)
Puget Sound Oracle Users Group
www.psoug.org

John

unread,

Apr 5, 2007, 11:29:18 AM4/5/07

to

Thanks for your answers, here is a clarification:

- Ax and Ay are any different A events (A1, A2...). But since I'm
looking for chronological sequences, Ay has to happen after Ax.

- If I merge the two tables A and B while respecting the chronology,
this would lead to the following. I've also add an additionnal A4
event here to clarify.

Table A
(03:50pm, A1)
(03:55pm, A2)
(03:58pm, A3)

(03:59pm, A4)

Table B
(03:51pm, B1)
(04:00pm, B2)

Chronology
(03:50pm, A1)
(03:51pm, B1)

(03:55pm, A2)
(03:58pm, A3)

(03:59pm, A4)
(04:00pm, B2)

I'm looking for all the sequences of events A in the chronology with
no B event in the middle. Here the results would be:
(A2, A3) ; (A2, A4) and (A3, A4)

Thanks!

John

> damor...@x.washington.edu

> (replace x with u to respond)

> Puget Sound Oracle Users Groupwww.psoug.org- Hide quoted text -
>
> - Show quoted text -

John

unread,

Apr 5, 2007, 12:02:10 PM4/5/07

to

Thanks for your answer,

Here are the technical details and the query I've been using so far.

TableA is ~100 millions row and contains (timestamp, evtA)
TableB is ~30 millions row and contains (timestamp, evtB)

The following query took ~60h (on a private but quite slow server) to
compute. ~1h is what I'm aiming to.

select TA1_B.evtA, TA2.evtA
from
(
select TA1.evtA, TA1.timestamp timeA1, TB.evtB, min(TB.timestamp)
min_timeB
from tableA TA1 left outer join tableB TB on (TA1.timestamp <
TB.timestamp)
group by TA1.evtA, TA1.timestamp, TB.evtB
) TA1_B,
tableA TA2
where
TA1_B.timeA1 < TA2.timestamp
and (TA2.timestamp < TA1_B.min_timeB or TA1_B.min_timeB is null)
and TA1_B.evtA <> TA2.evtA;

Thanks!

John

On Apr 5, 10:51 am, "Ana C. Dent" <anaced...@hotmail.com> wrote:
> "John" <acide.ascorbi...@gmail.com> wrote in news:1175783243.167337.195580

> How do we know what you tried & deemed unacceptable?- Hide quoted text -

DA Morgan

unread,

Apr 5, 2007, 2:48:08 PM4/5/07

to

Sorry but your answer begets more questions:

Is this the real problem or a simplification?
Is this something that will be run once or repeatedly?
Is it possible for the same time to be in A and B?
Is it possible to have a B before an A beginning the sequence?
Is it possible for there to be multiple Bs between As?

And I am not at all surprised it is taking a lot of time.

--
Daniel A. Morgan
University of Washington

damo...@x.washington.edu

John

unread,

Apr 5, 2007, 3:50:37 PM4/5/07

to

> Is this the real problem or a simplification?

It's a simplification but not that much. The real problem involves
user_ids but this part can be skipped here.

> Is this something that will be run once or repeatedly?

Only once.

> Is it possible for the same time to be in A and B?

No, A and B are completely different data.

> Is it possible to have a B before an A beginning the sequence?
> Is it possible for there to be multiple Bs between As?

Yes everything is possible, A events and B events happen
independently.

Thanks for being interested in my problem!

John

Jonathan Lewis

unread,

Apr 5, 2007, 3:50:44 PM4/5/07

to

"John" <acide.as...@gmail.com> wrote in message
news:1175783243.1...@n59g2000hsh.googlegroups.com...

Interesting problem.

The data size and available resources are likely
to make a big difference when testing solutions
for feasibility.

Here's a possibility, start with:

select 'A' flag, event, timestamp from tableA
union all
select 'B', event, timestamp from tableB

Option a)
Order by timestamp. Open a pl/sql cursor
on the result set and walk the data one row
at a time, reporting rows when the current
and previous rows are 'A' rows.

Option b)
Use the analytic lag(,1) function

select
flag, event, prior_event, timestamp, prior_timestamp
from (
select
flag, lag(flag,1) over (order by timestamp) prior_flag,
event, lag(event,1) over (order by timestamp) prior_event
timestamp, lag(timestamp,1) over (order by timestamp) prior_timestamp
from
(the union all query)
where
flag = prior_flag
and flag = 'A' -- if you just want A's without a B in between.
;

I may have some errors in the analtyic code, but I hope
there's enough there to give you the right idea.

Either option will lead to a massive sort operation on
all your data.

--
Regards

Jonathan Lewis
http://jonathanlewis.wordpress.com

Author: Cost Based Oracle: Fundamentals
http://www.jlcomp.demon.co.uk/cbo_book/ind_book.html

The Co-operative Oracle Users' FAQ
http://www.jlcomp.demon.co.uk/faq/ind_faq.html

Charles Hooper

unread,

Apr 5, 2007, 5:07:36 PM4/5/07

to

I would be inclined to not handle this by not just using SQL. You
could potentially have a nearly full Cartesian join on the first table
to itself, for example:
SELECT
A1.V1,
A2.V1
FROM
T1 A1,
T1 A2
WHERE
A1.V1<A2.V1;

A quick setup:
CREATE TABLE T1 (V1 DATE NOT NULL, V2 VARCHAR2(10));
CREATE TABLE T2 (V1 DATE NOT NULL, V2 VARCHAR2(10));

CREATE INDEX T1_IND1 ON T1(V1);
CREATE INDEX T2_IND1 ON T2(V1);

INSERT INTO
T1
SELECT
TRUNC(TRUNC(SYSDATE) + (ROWNUM*2.5/24/60),'MI'),
TO_CHAR(ROWNUM)
FROM
DUAL
CONNECT BY
LEVEL<=1000;

COMMIT;

INSERT INTO
T2
SELECT
TRUNC(TRUNC(SYSDATE) + (ROWNUM*9.415/24/60),'MI'),
TO_CHAR(ROWNUM)
FROM
DUAL
CONNECT BY
LEVEL<=300;

COMMIT;

We now have two tables, T1 and T2, that correspond to your table A and
B, respectively. If we perform a full outer join between these two
table, we obtain all time values in the two tables with no duplicates
(9i+ syntax) (note that TO_CHAR is used to limit the width of the
columns for display purposes):
SELECT
TO_CHAR(T1.V1,'HH24:MI') T1_V1,
T1.V2 T1_V2,
TO_CHAR(T2.V1,'HH24:MI') T2_V1,
T2.V2 T2_V2
FROM
T1 FULL OUTER JOIN T2 ON T1.V1=T2.V1
ORDER BY
NVL(T1.V1,T2.V1);

T1_V1 T1_V2 T2_V1 T2_V2
===== ========== ===== ==========
00:02 1
00:05 2
00:07 3
00:09 1
00:10 4
00:12 5
00:15 6
00:17 7
00:18 2
00:20 8
00:22 9
00:25 10
00:27 11
00:28 3
00:30 12
00:32 13
00:35 14
00:37 15 00:37 4
00:40 16
00:42 17
00:45 18
00:47 19 00:47 5
00:50 20
00:52 21
00:55 22
00:56 6
00:57 23
01:00 24
01:02 25
01:05 26 01:05 7
01:07 27
01:10 28
01:12 29
01:15 30 01:15 8
01:17 31
01:20 32
01:22 33
01:24 9
01:25 34
01:27 35
01:30 36
01:32 37
01:34 10

Extending the above to give more detail:
SELECT
TO_CHAR(NVL(T1.V1,T2.V1),'HH24:MI') TIME_DATE,
DECODE(T1.V1,NULL,'B',NVL2(T2.V1,'AB','A')) TIME_SLOT,
TO_CHAR(T1.V1,'HH24:MI') T1_V1,
T1.V2 T1_V2,
TO_CHAR(T2.V1,'HH24:MI') T2_V1,
T2.V2 T2_V2
FROM
T1 FULL OUTER JOIN T2 ON T1.V1=T2.V1
ORDER BY
NVL(T1.V1,T2.V1);
TIME_DATE TIME_SLOT T1_V1 T1_V2 T2_V1 T2_V2
========= ========= ===== ========== ===== ==========
00:02 A 00:02 1
00:05 A 00:05 2
00:07 A 00:07 3
00:09 B 00:09 1
00:10 A 00:10 4
00:12 A 00:12 5
00:15 A 00:15 6
00:17 A 00:17 7
00:18 B 00:18 2
00:20 A 00:20 8
00:22 A 00:22 9
00:25 A 00:25 10
00:27 A 00:27 11
00:28 B 00:28 3
00:30 A 00:30 12
00:32 A 00:32 13
00:35 A 00:35 14
00:37 AB 00:37 15 00:37 4
00:40 A 00:40 16
00:42 A 00:42 17
00:45 A 00:45 18
00:47 AB 00:47 19 00:47 5
00:50 A 00:50 20
00:52 A 00:52 21
00:55 A 00:55 22
00:56 B 00:56 6

What can we do with the above to avoid the Cartesian join as much as
possible? We can use LEAD to peek at the next set of values:
SELECT
TO_CHAR(TIME_DATE,'HH24:MI') TIME_DATE,
TO_CHAR(LEAD(TIME_DATE,1) OVER (ORDER BY TIME_DATE),'HH24:MI')
NEXT_TIME_DATE,
TIME_SLOT,
LEAD(TIME_SLOT,1) OVER (ORDER BY TIME_DATE) NEXT_TIME_SLOT,
T1_V1,
T1_V2
FROM
(SELECT
NVL(T1.V1,T2.V1) TIME_DATE,
DECODE(T1.V1,NULL,'B',NVL2(T2.V1,'AB','A')) TIME_SLOT,
TO_CHAR(T1.V1,'HH24:MI') T1_V1,
T1.V2 T1_V2,
TO_CHAR(T2.V1,'HH24:MI') T2_V1,
T2.V2 T2_V2
FROM
T1 FULL OUTER JOIN T2 ON T1.V1=T2.V1
ORDER BY
NVL(T1.V1,T2.V1));

TIME_DATE NEXT_TIME_DATE TIME_SLOT NEXT_TIME_SLOT T1_V1 T1_V2
========= ============== ========= ============== ===== ==========
00:02 00:05 A A 00:02 1
00:05 00:07 A A 00:05 2
00:07 00:09 A B 00:07 3
00:09 00:10 B A
00:10 00:12 A A 00:10 4
00:12 00:15 A A 00:12 5
00:15 00:17 A A 00:15 6
00:17 00:18 A B 00:17 7
00:18 00:20 B A
00:20 00:22 A A 00:20 8
00:22 00:25 A A 00:22 9
00:25 00:27 A A 00:25 10
00:27 00:28 A B 00:27 11
00:28 00:30 B A
00:30 00:32 A A 00:30 12
00:32 00:35 A A 00:32 13
00:35 00:37 A AB 00:35 14
00:37 00:40 AB A 00:37 15
00:40 00:42 A A 00:40 16
00:42 00:45 A A 00:42 17
00:45 00:47 A AB 00:45 18
00:47 00:50 AB A 00:47 19
00:50 00:52 A A 00:50 20
00:52 00:55 A A 00:52 21
00:55 00:56 A B 00:55 22
00:56 00:57 B A

Now, if you can scan through the rows returned programmatically,
creating a processing break when TIME_SLOT or NEXT_TIME_SLOT is not A,
you should be able to handle the processing. In this case remember
00:02, since TIME_SLOT is A and NEXT_TIME_SLOT is A, and report 00:02
- 00:05. Process the next line, and remember 00:05 also, since
TIME_SLOT is A and NEXT_TIME_SLOT is A, and output 00:02 - 00:07 and
00:05 - 00:07. Process the next line, either TIME_SLOT is not A or
NEXT_TIME_SLOT is not A, so clear the remembered list and process the
next line. It is quite simple to handle programmatically.

Charles Hooper
PC Support Specialist
K&M Machine-Fabricating, Inc.

DA Morgan

unread,

Apr 5, 2007, 5:37:00 PM4/5/07

to

Charles seems to have provided a good starting point for you.
The reason for all my questions is that without knowing the business
rules, now clear, it would be easy to point you in the wrong direction.

But I take issue, at least in theory, with your response that the same
times can not occur in A and B. They are are independent ... why not?

HTH

joel garry

unread,

Apr 5, 2007, 5:39:08 PM4/5/07

to

I'm too stupid to do this in SQL. I'd select out the timestamp, data
and a tag for which table, from the two tables into two files, send
both files through unix sort, awk for the pattern, and then back into
the db with sqlloader or maybe as an external table. This often is
faster because you don't upset Oracle with too-large sorting within
the db, and you just blast sequentially through the data rather than
trying to lag values.

Please don't top-post.

jg
--
@home.com is bogus.
"Oh boy, we need to work on the I/O on this test system, I swear there
are gerbils in that server running back and forth with some floppies
in their mouths transferring the data between disks. " - Herod T

Mladen Gogala

unread,

Apr 5, 2007, 6:22:25 PM4/5/07

to

John, allow me to take a shot: Ralph Kimball describes the timeline
table in his DW toolkit book. He uses fixed length intervals (connected
with the warehouse granularity). Also, you may want to introduce a
synthetic event C, defined as the absence of B in the given period.
SQL, as opposed to Perl is not a solution for all problems.

--
http://www.mladen-gogala.com

DA Morgan

unread,

Apr 5, 2007, 6:46:11 PM4/5/07

to

Similarly a pipelined table function could generate all possible times
and be joined.

Maxim Demenko

unread,

Apr 5, 2007, 7:44:50 PM4/5/07

to Charles Hooper

Charles Hooper schrieb:

> On Apr 5, 3:50 pm, "John" <acide.ascorbi...@gmail.com> wrote:
>>> Is this the real problem or a simplification?
>> It's a simplification but not that much. The real problem involves
>> user_ids but this part can be skipped here.
>>
>>> Is this something that will be run once or repeatedly?
>> Only once.
>>
>>> Is it possible for the same time to be in A and B?
>> No, A and B are completely different data.
>>
>>> Is it possible to have a B before an A beginning the sequence?
>>> Is it possible for there to be multiple Bs between As?
>> Yes everything is possible, A events and B events happen
>> independently.
>>
>> Thanks for being interested in my problem!
>>
>> John
>
> I would be inclined to not handle this by not just using SQL. You
> could potentially have a nearly full Cartesian join on the first table
> to itself, for example:
>

> Charles Hooper
> PC Support Specialist
> K&M Machine-Fabricating, Inc.
>

I suggest, pure SQL solution could be like this:

create table a as
with t as (
select 'A' event,dbms_random.value(1,31) + trunc(sysdate,'MM') edate
from dual
connect by level <= 60 )
select * from t;

create table b as
with t as (select 'B' event,dbms_random.value(1,31) +
trunc(sysdate,'MM') edate
from dual
connect by level <= 15)
select * from t;

WITH u as ( SELECT * FROM a UNION ALL SELECT * FROM b ORDER BY 2)
,
t as
(SELECT u.*,
row_number() over(order by edate) id, -- to enumerate
all events

decode(lag(event) over (order by edate), event, 0, 1)
start_of_group
FROM u
ORDER BY edate
)
,
t1 as ( SELECT id,event,edate,sum(start_of_group) over(order by
edate) group_id FROM t)
,
t2 as
(SELECT id,
event,
edate,
group_id,
count(*) over(partition by group_id) cnt_in_group,
row_number() over(partition by group_id order by edate)
row_num
FROM t1
)
,
t3 as
(SELECT connect_by_root(id) first_event,
connect_by_root(edate) first_date,
id last_event,
edate last_date
FROM t2
WHERE cnt_in_group >1
AND event ='A' connect by prior row_num=row_num-1
AND prior group_id=group_id
)
SELECT *
FROM t3
WHERE first_event != last_event
ORDER BY first_event,
last_event;

However, considering the data volumes ( 1e8 for A Events and 3e7 for B
Events), this could result in worst case ( all B events are
chronologically after or before A events) in 1e8!/2!(1e8-2)!
permutations, what nearly equals 1e16 rows, in best case, if all B
events are evenly distributed among A events, this will result in
approximately 1e7 rows, assuming the distribution is somewhere in the
middle - still would expect a lot of data in result set - i would second
the Charles suggestion to process programmatically, possibly dividing
the source data in chunks.

Best regards

Maxim

Charles Hooper

unread,

Apr 5, 2007, 10:51:28 PM4/5/07

to

On Apr 5, 7:44 pm, Maxim Demenko <mdeme...@gmail.com> wrote:
> Charles Hooper schrieb:
> > On Apr 5, 3:50 pm, "John" <acide.ascorbi...@gmail.com> wrote:
> >>> Is this the real problem or a simplification?
> >> It's a simplification but not that much. The real problem involves
> >> user_ids but this part can be skipped here.
>
> >>> Is this something that will be run once or repeatedly?
> >> Only once.
>
> >>> Is it possible for the same time to be in A and B?
> >> No, A and B are completely different data.
>
> >>> Is it possible to have a B before an A beginning the sequence?
> >>> Is it possible for there to be multiple Bs between As?
> >> Yes everything is possible, A events and B events happen
> >> independently.
>
> >> Thanks for being interested in my problem!
>
> >> John
>
> > I would be inclined to not handle this by not just using SQL. You
> > could potentially have a nearly full Cartesian join on the first table
> > to itself, for example:
>
> > Charles Hooper
> > PC Support Specialist
> > K&M Machine-Fabricating, Inc.
>
> I suggest, pure SQL solution could be like this:

(SNIP)

Nice. I stopped short of providing a solution in my previous post.
Here is an example of an all SQL solution, continuing from my previous
post:

Let's try an experiment to see if we can generate a sequential counter
that skips when either TIME_SLOT or NEXT_TIME_SLOT is not A:
SELECT
TIME_DATE,
NEXT_TIME_DATE,
(ROW_NUMBER() OVER (ORDER BY TIME_DATE))*DECODE(TIME_SLOT,'A',
1,NULL)*DECODE(NEXT_TIME_SLOT,'A',1,NULL) TT,
TIME_SLOT,

NEXT_TIME_SLOT,
T1_V1,
T1_V2
FROM
(SELECT

TIME_DATE TIME_DATE,
LEAD(TIME_DATE) OVER (ORDER BY TIME_DATE) NEXT_TIME_DATE,

TIME_SLOT,
LEAD(TIME_SLOT,1) OVER (ORDER BY TIME_DATE) NEXT_TIME_SLOT,
T1_V1,
T1_V2
FROM
(SELECT
NVL(T1.V1,T2.V1) TIME_DATE,
DECODE(T1.V1,NULL,'B',NVL2(T2.V1,'AB','A')) TIME_SLOT,
TO_CHAR(T1.V1,'HH24:MI') T1_V1,
T1.V2 T1_V2,
TO_CHAR(T2.V1,'HH24:MI') T2_V1,
T2.V2 T2_V2
FROM
T1 FULL OUTER JOIN T2 ON T1.V1=T2.V1
ORDER BY

NVL(T1.V1,T2.V1)));

TIME_DATE NEXT_TIME_DATE TT TIME_SLOT NEXT_TIME_SLOT T1_V1
T1_V2
==================== ==================== =========== =========
============== ===== ==========
05-APR-2007 00:02:00 05-APR-2007 00:05:00 1 A A 00:02 1
05-APR-2007 00:05:00 05-APR-2007 00:07:00 2 A A 00:05 2
05-APR-2007 00:07:00 05-APR-2007 00:09:00 A B 00:07 3
05-APR-2007 00:09:00 05-APR-2007 00:10:00 B A
05-APR-2007 00:10:00 05-APR-2007 00:12:00 5 A A 00:10 4
05-APR-2007 00:12:00 05-APR-2007 00:15:00 6 A A 00:12 5
05-APR-2007 00:15:00 05-APR-2007 00:17:00 7 A A 00:15 6
05-APR-2007 00:17:00 05-APR-2007 00:18:00 A B 00:17 7
05-APR-2007 00:18:00 05-APR-2007 00:20:00 B A
05-APR-2007 00:20:00 05-APR-2007 00:22:00 10 A A 00:20 8
05-APR-2007 00:22:00 05-APR-2007 00:25:00 11 A A 00:22 9
05-APR-2007 00:25:00 05-APR-2007 00:27:00 12 A A 00:25 10
05-APR-2007 00:27:00 05-APR-2007 00:28:00 A B 00:27 11
05-APR-2007 00:28:00 05-APR-2007 00:30:00 B A
05-APR-2007 00:30:00 05-APR-2007 00:32:00 15 A A 00:30 12
05-APR-2007 00:32:00 05-APR-2007 00:35:00 16 A A 00:32 13
05-APR-2007 00:35:00 05-APR-2007 00:37:00 A AB 00:35 14
05-APR-2007 00:37:00 05-APR-2007 00:40:00 AB A 00:37 15
05-APR-2007 00:40:00 05-APR-2007 00:42:00 19 A A 00:40 16
05-APR-2007 00:42:00 05-APR-2007 00:45:00 20 A A 00:42 17
05-APR-2007 00:45:00 05-APR-2007 00:47:00 A AB 00:45 18
05-APR-2007 00:47:00 05-APR-2007 00:50:00 AB A 00:47 19
05-APR-2007 00:50:00 05-APR-2007 00:52:00 23 A A 00:50 20
05-APR-2007 00:52:00 05-APR-2007 00:55:00 24 A A 00:52 21
05-APR-2007 00:55:00 05-APR-2007 00:56:00 A B 00:55 22
05-APR-2007 00:56:00 05-APR-2007 00:57:00 B A
05-APR-2007 00:57:00 05-APR-2007 01:00:00 27 A A 00:57 23
05-APR-2007 01:00:00 05-APR-2007 01:02:00 28 A A 01:00 24
05-APR-2007 01:02:00 05-APR-2007 01:05:00 A AB 01:02 25
05-APR-2007 01:05:00 05-APR-2007 01:07:00 AB A 01:05 26
05-APR-2007 01:07:00 05-APR-2007 01:10:00 31 A A 01:07 27
...

Now, let's use SYS_CONNECT_BY_PATH to connect the start and end times
together:
SELECT
TIME_DATE,
NEXT_TIME_DATE,
TO_CHAR(TIME_DATE,'HH24:MI') ||'-' ||
SUBSTR(SUBSTR(SYS_CONNECT_BY_PATH(TO_CHAR(NEXT_TIME_DATE,'HH24:MI'),','),
2,50)||',',
1,INSTR(SUBSTR(SYS_CONNECT_BY_PATH(TO_CHAR(NEXT_TIME_DATE,'HH24:MI'),','),
2,50)||',',',')-1) TIME_RANGE
FROM
(SELECT
TIME_DATE,
NEXT_TIME_DATE,
(ROW_NUMBER() OVER (ORDER BY TIME_DATE))*DECODE(TIME_SLOT,'A',
1,NULL)*DECODE(NEXT_TIME_SLOT,'A',1,NULL) TS,
TIME_SLOT,
NEXT_TIME_SLOT
T1_V1,
T1_V2
FROM
(SELECT
TIME_DATE TIME_DATE,
LEAD(TIME_DATE,1) OVER (ORDER BY TIME_DATE) NEXT_TIME_DATE,

TIME_SLOT,
LEAD(TIME_SLOT,1) OVER (ORDER BY TIME_DATE) NEXT_TIME_SLOT,
T1_V1,
T1_V2
FROM
(SELECT
NVL(T1.V1,T2.V1) TIME_DATE,
DECODE(T1.V1,NULL,'B',NVL2(T2.V1,'AB','A')) TIME_SLOT,
TO_CHAR(T1.V1,'HH24:MI') T1_V1,
T1.V2 T1_V2,
TO_CHAR(T2.V1,'HH24:MI') T2_V1,
T2.V2 T2_V2
FROM
T1 FULL OUTER JOIN T2 ON T1.V1=T2.V1
ORDER BY

NVL(T1.V1,T2.V1))))
CONNECT BY PRIOR
TS=TS+1;

TIME_DATE NEXT_TIME_DATE TIME_RANGE
==================== ====================
=========================================================
05-APR-2007 00:02:00 05-APR-2007 00:05:00 00:02-00:05
05-APR-2007 00:05:00 05-APR-2007 00:07:00 00:05-00:07
05-APR-2007 00:02:00 05-APR-2007 00:05:00 00:02-00:07
05-APR-2007 00:10:00 05-APR-2007 00:12:00 00:10-00:12
05-APR-2007 00:12:00 05-APR-2007 00:15:00 00:12-00:15
05-APR-2007 00:10:00 05-APR-2007 00:12:00 00:10-00:15
05-APR-2007 00:15:00 05-APR-2007 00:17:00 00:15-00:17
05-APR-2007 00:12:00 05-APR-2007 00:15:00 00:12-00:17
05-APR-2007 00:10:00 05-APR-2007 00:12:00 00:10-00:17
05-APR-2007 00:20:00 05-APR-2007 00:22:00 00:20-00:22
05-APR-2007 00:22:00 05-APR-2007 00:25:00 00:22-00:25
05-APR-2007 00:20:00 05-APR-2007 00:22:00 00:20-00:25
05-APR-2007 00:25:00 05-APR-2007 00:27:00 00:25-00:27
05-APR-2007 00:22:00 05-APR-2007 00:25:00 00:22-00:27
05-APR-2007 00:20:00 05-APR-2007 00:22:00 00:20-00:27
05-APR-2007 00:30:00 05-APR-2007 00:32:00 00:30-00:32
05-APR-2007 00:32:00 05-APR-2007 00:35:00 00:32-00:35
05-APR-2007 00:30:00 05-APR-2007 00:32:00 00:30-00:35
05-APR-2007 00:40:00 05-APR-2007 00:42:00 00:40-00:42
05-APR-2007 00:42:00 05-APR-2007 00:45:00 00:42-00:45
05-APR-2007 00:40:00 05-APR-2007 00:42:00 00:40-00:45
05-APR-2007 00:50:00 05-APR-2007 00:52:00 00:50-00:52
05-APR-2007 00:52:00 05-APR-2007 00:55:00 00:52-00:55
05-APR-2007 00:50:00 05-APR-2007 00:52:00 00:50-00:55
05-APR-2007 00:57:00 05-APR-2007 01:00:00 00:57-01:00
05-APR-2007 01:00:00 05-APR-2007 01:02:00 01:00-01:02
05-APR-2007 00:57:00 05-APR-2007 01:00:00 00:57-01:02
05-APR-2007 01:07:00 05-APR-2007 01:10:00 01:07-01:10
05-APR-2007 01:10:00 05-APR-2007 01:12:00 01:10-01:12
05-APR-2007 01:07:00 05-APR-2007 01:10:00 01:07-01:12
05-APR-2007 01:17:00 05-APR-2007 01:20:00 01:17-01:20
05-APR-2007 01:20:00 05-APR-2007 01:22:00 01:20-01:22
05-APR-2007 01:17:00 05-APR-2007 01:20:00 01:17-01:22
05-APR-2007 01:25:00 05-APR-2007 01:27:00 01:25-01:27
05-APR-2007 01:27:00 05-APR-2007 01:30:00 01:27-01:30
05-APR-2007 01:25:00 05-APR-2007 01:27:00 01:25-01:30
...

(Rows 1652) CONNECT BY WITHOUT FILTERING (cr=23 pr=0 pw=0
time=31559 us)
(Rows 1194) VIEW (cr=23 pr=0 pw=0 time=29665 us)
(Rows 1194) WINDOW NOSORT (cr=23 pr=0 pw=0 time=24882 us)
(Rows 1194) VIEW (cr=23 pr=0 pw=0 time=20154 us)
(Rows 1194) WINDOW SORT (cr=23 pr=0 pw=0 time=14131 us)
(Rows 1194) VIEW (cr=23 pr=0 pw=0 time=15595 us)
(Rows 1194) SORT ORDER BY (cr=23 pr=0 pw=0 time=12007 us)
(Rows 1194) VIEW (cr=23 pr=0 pw=0 time=28726 us)
(Rows 1194) UNION-ALL (cr=23 pr=0 pw=0 time=23946 us)
(Rows 1000) HASH JOIN RIGHT OUTER (cr=14 pr=0 pw=0
time=11251 us)
(Rows 300) TABLE ACCESS FULL T2 (cr=7 pr=0 pw=0
time=662 us)
(Rows 1000) TABLE ACCESS FULL T1 (cr=7 pr=0 pw=0
time=2026 us)
(Rows 194) MERGE JOIN ANTI (cr=9 pr=0 pw=0 time=4647
us)
(Rows 300) TABLE ACCESS BY INDEX ROWID T2 (cr=2 pr=0
pw=0 time=2128 us)
(Rows 300) INDEX FULL SCAN T2_IND1 (cr=1 pr=0 pw=0
time=918 us)
(Rows 106) SORT UNIQUE (cr=7 pr=0 pw=0 time=2528 us)
(Rows 1000) INDEX FAST FULL SCAN T1_IND1 (cr=7 pr=0
pw=0 time=2026 us)