Gerard Schildberger <
gera...@rrt.net> writes:
> Yes, especially if the channels were outboard instead of
> inboard. Inboard channels stole "CPU" cycles for execution
> and memory access(es). _________________ Gerard Schildberger
re:
http://www.garlic.com/~lynn/2012o.html#19 Assembler vs. COBOL--processing time, space needed
http://www.garlic.com/~lynn/2012o.html#20 Assembler vs. COBOL--processing time, space needed
http://www.garlic.com/~lynn/2012o.html#21 Assembler vs. COBOL--processing time, space needed
low-end &mid-range 360 processors had integrated channels ... i.e. same
engine that executed the 360 instruction set microcode also executed the
channel microcode. 360/65(67) and above had external channels.
note that even external channels doing lots of i/o, could cause lots of
memory bus interference for the processor.
undergraduate I had done the cp67 changes to support tty/ascii terminals
... and tried to make the ibm 2702 terminal controller do something that
it couldn't quite do. this somewhat prompted the univ to start a clone
controller effort ... reverse engineer channel interface and build
channel interface controller board for interdata/3 programmed to emulate
2702 (and also do what I tried with the 2702). late four of us were
written up as responsible for (some part of) the clone controller business
... some past posts
http://www.garlic.com/~lynn/subtopic.html#360pcm
an early bug was the controller "held" the channel ... which "held" the
memory bus ... which resulted in the processor "red-light". The issue
was that the 360/67 high-frequency timer "tic'ed" approx. every 13mics
and needed to update the location 80 timer field. If the timer tic'ed
while a previous timer memory update was still pending, the processor
would stop with hardware error. the clone controller had to frequently
"free" the channel (which in turned "freed" the memory bus) ... to
allow for timer-tic update.
later at the science center ... the 360/67 was "upgraded" from a
"simplex" to dual-processor 360/67. the multiprocessor 360/67 was unique
in 360s that added a "channel controller" and multi-ported memory that
provided channel interface with independent interface. the multi-ported
memory interface slowed the processor memory bus cycle by 10-15% ... a
single-processor "half-duplex" 360/67 (single processor with channel
controller) would run that much slower for compute intensive workload.
However a simplex 360/67 running 100% cpu concurrent with heavy i/o
workload would have lower thruput than same workload on a half-duplex
(single processor with channel controller) 360/67. misc. past posts
mentioning multiprocessor (&/or compare&swap instruction)
http://www.garlic.com/~lynn/subtopic.html#smp
note that clone controller business was major motivation for the
"failed" future system effort ... some past posts
http://www.garlic.com/~lynn/submain.html#futuresys
during "future system" period lots of 370 activity was killed off (and
the lack of 370 products during this period is credited with giving
clone processors a market foothold) ... but when FS was killed ... there
was mad rush to get stuff back into the 370 product pipelines. this
included the q&d 303x in parallel with 3081 using some warmed over FS
technology.
370/158 had integrated channels, for 303x ... there was an external
"channel director" by taking the 370/158 engine and eliminating the 370
instruction microcode. A 3031 was a 370/158 engine with just the 370
instruction microcode and a 2nd 370/158 engine with just the integrated
microcode engine. A 3032 was a 370/168 reconfigured to use the 303x
channel director. A 3033 was 370/168 logic remapped to 20% faster chips
... but with something like 10 times the circuits/chip. During 3033
development, some logic rework was done to take advantage of the extra
circuits eventually resulting in 3033 being 50% faster than 370/158.
The 360/370 channel architecture was half-duplex, end-to-end handshaking
(between channel and end-device) and designed to be "shared" with lots
of concurrent device activity. The end-to-end handshaking could result
in enormous "channel busy" and latency ... especially with slow
controllers.
I've repeated before that common configuration for "large systems" (16
channels) tended to be both disk controllers and 3270 terminal
controllers spread across the same channels. In special project for STL
moving 300 people from the IMS group to off-site location, I did support
for HYPERchannel operating as channel extender. HYPERChannel remote
device adapters were at the remote location with local
"channel-attached" 3270 controllers moved to the remote location ... and
HYPERChannel adapter used on the real channels. The HYPERChannel
adapters had much faster processor than the 3270 controllers ... so
identical operations resulted in much lower channel busy (especially
hand-shaking controller operations) ... and the move resulted in 10-15%
increase in workload throughput (lower channel busy interferance with
disk operations).
A similar but different was the development of the 3880 disk controller
(replacing 3830) for 3mbyte 3380 disks. While the transfer rate when up
by factor of ten times, the 3880 processor engine was much slower than
the 3830 it replaced (significantly channel busy for control
hand-shaking operations). The 3090 processor had originally configured
max. number of configuration channels based on assumption that 3880
would have same channel busy control operations a 3830. When they found
out how bad the 3880 controller actually was, they had to significantly
increase the number of channels (in order to achieve target system
throughput). This increased the number of TCMs needed to manufactor a
3090 ... a big hit to the 3090 manufacturing cost ... there was joke
that 3090 group was going to bill the 3880 group for the increase in
3090 manufacturing cost. This in large part accounted for subsequent
myths about the massive number of mainframe channels increasing i/o
throughput (when it was actually to compensate for the implementation
short-comings of half-duplex, end-to-end operation).
Fibre-channel effort originated circa 1988 at LLNL looking at
standardizing was serial technology that they were using in house (this
was about the same time that LANL started work on HiPPI for 100mbyte/sec
standards version of half-duplex cray channel). A big push in
fibre-channel was pairs of serial transfer, one dedicated for concurrent
transfer in each direction. Part of the effort was fully concurrent
asynchronous operation to compensate for the increasing role that
end-to-end latency played as transfer rates increased. Part of this was
complete transfer of request to remote end as asynchronous operation
with results coming back ... minimizing the number of end-to-end
operations for doing I/O (and the associated latency turn-around).
reference to early jan1992 meeting doing cluster-scaleup for commercial
using fibre-channel standard
http://www.garlic.com/~lynn/95.html#13
misc. past ha/cmp postings
http://www.garlic.com/~lynn/subtopic.html#hacmp
above references also wanting to reform (serial) 9333/harrier to
interoperate with fibre-channel ... instead it was done as its own
unique coming out as SSA. It was part of working with LLNL on
fibre-channel ... as well as working with LLNL on cluster-scaleup for
numeric intensive & scientific workload ... some old email on
cluster-scaleup
http://www.garlic.com/~lynn/lhwemail.html#medusa
possibly only hrs after the last email above (end of jan1992), the
cluster-scaleup work was transferred and we were told we couldn't work
on anything with more than four processors. A couple weeks later
(17Feb1992), it was announced as IBM supercomputer for numeric intensive
and scientific *ONLY*. old press reference:
http://www.garlic.com/~lynn/2001n.html#6000clusters1
later that spring (11May1992), press about clusters had caught the
company "by surprise" ... even though I had been working on it for
some time with various customers.
http://www.garlic.com/~lynn/2001n.html#6000clusters2
The later IBM mainframe FICON effort effectively layered the standard
half-duplex characteristic on-top of fibre-channel standard ... carrying
with it all the throughput limitations of the high number of end-to-end
operations and associated latencies (drastically cutting the throughput
of the FICON layer compared to that of the underlying fibre-channel).