Assuming we have a fixed higher order scheme ... 3rd order... 16 channels.
What makes the most sense... for a highly simplified, limited ambisonics scheme?
1) having 3 possible channel counts: 4 (1st order), 16 (3rd order) and
20 (3rd then 1st)
2) having only 1 possible channel count: 20 (1st order followed by 3rd
order) . Empty channels are allowed in the first 4 and the last 16 but
not both (does this work for decoders?)
3) having only 1 possible channel count: 20 ... 16 then 4 (i.e. should
the first order channels come before or after the 16 chn 3rd order
ones?)
4) having 2 possible channel counts: 4 and 20 (i.e. 1st order and
combined 1st and 3rd).
As usual, the motivation is the simplicity of software implementation
and interface, routing signals etc.
Etienne
I'm wondering what VST instruments would look like if you were mixing
a third order and 1st order piece together.
Would you have one stream of 4 channels (the recorded) ... then an
other stream of 16 channels (the synthesised) .... then some super
funky VST that adds them up into 20 channels and spits them out?
or would you have every VST be 20 channels but the recorded stuff just
uses the first 4 and the synthesised stuff just uses the last 16?
Etienne
Good, simple and easy to understand formats, help this. 1st order Ambi does
this. FMH, for all its sins, re extensibility, sorta does this.
[rant]
Having separate channels for 1st & 3rd order obfuscates matters for everyone
and seriously turns (Ambi naive) Yamaha away.
Possible Ambi converts will realise this isn't much (if at all) better than
zillion.1, just as wasteful of resources and MUCH more complicated.
I WANT to do a good General Media player with the best Ambi technology based on
VideoLAN. Ambi tech has advantages even for zillion.1 and 5.1 playback IN THE
HOME.
The choice of SN3D in all its forms fills me with dread cos the magic nos. As
Aaron has shown in his experimental codec, this is NOT easy to program. You
need to use extra special functions in the GSL library to avoid overload from
the zillion factorials involved. Its bad enough having to do this for N3D
(which you MUST do internally anyway) without having to generate a 2nd, even
more complicated set for conversion.
Having to deal with extra sets of channels just makes me want to go away and
concentrate on FMH.
________________
Once again ..
A RECORDING format should define what it sounds like; the soundfield.
Shelf filters, psychoacoustic decoding, NFC etc are decoding issues; the
PLAYBACK problem. None of these should influence the recording format.
Let the decoder designers work it out. They use the recorded definition and do
the best they can. Fons has already made a start.
Do we seriously think we have heard the last word on high order decoders? We
still haven't got enough good 1st order decoders! (Only 3 software ones)
That's what BLaH3 is about.
We've hardly touched on what is necessary for good 1st order periphony let
alone HOA playback.
How about some serious work, including listening tests, on HOA decoders
instead of
more pontificating leading only to
even more seriously complicated formats.
[/rant]
Lets keep our format as simple and efficient as possible.
> Having separate channels for 1st & 3rd order obfuscates matters for
> everyone and seriously turns (Ambi naive) Yamaha away.
Quite so. What we really have is a very good recording format that is
integral. In the sense that adding together multiple orders yields a
perfectly good representation of mixed accuracy representations of the
soundfield as a whole. Sure, the lower orders do spatially average more
than the higher ones, but in the end the physical description of the
soundfield that results from simple addition is sound.
As such I would once again say that since the psychoacoustical
optimization comes from the decoding end, let's just leave it there.
Let's just blithely mix across orders, and wait for the point in time
where decoders are intelligent enough to separate lower order signal
sets from the higher ones, even in sum. Because at least in theory, that
is at least partially achievable. Let's not litter our standardised
transmission formats with shortcuts to the end of proper psychoacoustic
decoding, over mixed orders. Especially since mixed orders in the
general sense would lead to a signal set plagued by combinatorial
explosion of sidechannels, used to facilitate pantophony, periphony,
multiple orders, and all of the combinations.
Let's instead build a new, perhaps active, theory of decoders, that can
handle sum signals originating with different orders, or in plain words,
signals that sometimes happen to have components where the higher order
coefficients are constantly set nil.
> Possible Ambi converts will realise this isn't much (if at all) better than
> zillion.1, just as wasteful of resources and MUCH more complicated.
Quite so. We want to keep the architecture clear and simple. Even at the
cost of increasingly complicated decoders. Because we would have gone
there anyways, even given first order signals, and certainly the
processing power to do that upto third order is already there, and
increasing exponentially.
> The choice of SN3D in all its forms fills me with dread cos the magic
> nos.
I don't like that either. It should be all N3D because that's what we
operate on.
> Having to deal with extra sets of channels just makes me want to go
> away and concentrate on FMH.
To amplify, I just want to run away from the whole thing if I have to
concentrate on extra channels besides the basic, physically
well-founded, spherical harmonic encoded signal set.
> A RECORDING format should define what it sounds like; the soundfield.
> [...]
I'm right there with you. Proper modularization and a certain separation
of concerns is what has made, and should in the future make, ambisonic
as elegant as it is/could be.
> Do we seriously think we have heard the last word on high order
> decoders? We still haven't got enough good 1st order decoders!
Nor do we have a single active decoder for any order. Eventhough the
original BBC paper on Matrix H *did* suggest an active design could make
a significant positive difference in the case of underdetermined signal
sets (in that case single dominant sources, but the idea can be fully
generalized).
> How about some serious work, including listening tests, on HOA decoders
Quite so. And also new decoder designs, which depart from the
traditional dogma.
> Lets keep our format as simple and efficient as possible.
Hail to that. Whatever went into the mix, the basic S3N spherical
harmonic decomposition should in my mind stand as the ultimate format.
Things like near field coding should be added after the fact, via
scaling coefficients or the like.
--
Sampo Syreeni, aka decoy - de...@iki.fi, http://decoy.iki.fi/front
+358-50-5756111, 025E D175 ABE5 027C 9494 EEB0 E090 8BA9 0509 85C2
That's a very sensible decision. They can stop anywhere they want. You don't
have to support 192kB sampling just cos some deaf .. I mean Golden Pinnae
believe they are bats.
> 2) decoders. Ditto. This is an AmbDecLib issue and dependent on the no. of
speakers. A 5.1 AV receiver doesn't have to support 7.1 ... zillion.1
> 3) ... all the different possible ... Ditto
> 4) how does the end user (my mother in law) choose which mixed-order, let
alone higher order to use?
It is transparent to the end user. The recording producer decides if they want
5.1, 7.1 ... zillion.1 or HOA. The end user's Ambisonic Surround Decoder (see
my Ambisonia page on ASD) is matched to his speaker rig (automatically) and
does the best it can with what its presented.
> but mixed orders are designed to cater for specific arrays ... 2 user groups
...
Mixed orders are OPTIMISED for specific arrays but a mixed order signal is
still decoded AS GOOD AS POSSIBLE by the ASD.
> We need to set a ceiling for higher orders ...
That's good. UA has this, as has FMH.
________________
This is what we NEED. (from discussions circa 2007)
1) Good compressed Ambi
2) AmbDecLib
3) A good way to extend to HOA
NOTHING ELSE is REQUIRED (or now IMHO, even good to have.)
1) & 2) can be based on FMH. Only the researchers really like 3)
We should have had 1) a long time ago but the people who did good work on this,
Aaron & Sebastien Olter, were ignored as usual.
(http://www.ambisonia.com/tests/streamingAAC if you want to see what can be
done easily with little fuss)
AmbiDecLib should be the most important contribution by the Ambi community.
Apart from some small stuff by Fons, nothing has been done.
I include myself among those who should be de-pinnaed but I plead computer
problems.
AmbiDecLib which is also the ASD answers all your questions. If you look at
the block diagram, you will see it is also the guts of a HOA ENCODER as well as
a DECODER.
We need to do Listening Tests on HOA decoders to at least the level of the BLaH
tests.
Because AmbiDecLib is the "Answer to Life, the Universe and Everything", I look
at all these proposals and ask ...
Will they make AmbDecLib development easier or more difficult / obfuscating?
Yamaha AND the end user are similarly affected.
I don't think its worth discussing this further. I'll opt out and go back to
compiling VideoLAN with AmbDecLib for FMH on my Sinclair Spectrum.
Expect results come the next millenium.