I believe examples for your 'data transformation parameters' are
- The integer k in 'k-means clustering'
- The window size in a 'moving average'
- The values for p, T, w, m in a 's transformation'
I believe the following is true: In every instance of a data
transformation process, the data transformation parameter is replaced by
a value (typically a number). That number is referred to as a data
transformation parameter, as a different number could have been used,
and it would have still been an instance of the same data
transformation. This means being a parameter is a 'role like' entity
attached to a number used in a process. As we are not allowed to use
roles for data (we need to follow up with Bary on that, dropped the ball
...), we should follow the 'specification' route for now.
Something like:
data transformation parameter specification
is_a 'information entity about a realizable'
is_concretized_as (is_realized by only data transformation)
is_about some (information_content entity participates_in some data
transformation)
(As always, I would have preferred a 'specifies' relation instead of the
is_about, but every time I raise that relation my emails get ignored :) )
- Bjoern
James Malone wrote:
> Hi Bjoern,
>
> Yes there are at least two uses of the word. The computer function
> parameter is quite close to the mathematical use. I can find several
> definitions on the net some better than others, the definitions
> further down this wiki page http://en.wikipedia.org/wiki/Parameter are
> ok for defining where they lie within a function but little more. The
> statistical one is more complicated as you've spelled out but we need
> both. So to push on, considering the mathematical/computer function
> use, one thing that seems to strike me is that for a data
> transformation, that a paremeter is a part of the process. If we say
> that a parameter mu is part of some equation involved which defines a
> data transformation, there are two things we need to say 1. mu is part
> of the equation (i.e. the DT) and 2. that mu is a paremeter. Does
> this help Alan?
>
> James
>
--
Bjoern Peters
Assistant Member
La Jolla Institute for Allergy and Immunology
9420 Athena Circle
La Jolla, CA 92037, USA
Tel: 858/752-6914
Fax: 858/752-6987
http://www.liai.org/pages/faculty-peters
- Bjoern
Bjoern's examples are good and represent some of the things we would
represent with this. So in order to push on with this, do you want me
to construct a definition and propose it? Send on some more examples?
Submit it to the tracker? I'd like to push on with this and not let
it die...
Thanks,
James
Thanks,
Chris
That's somewhat in the direction I was thinking as well, though more
on the objective side. Specifically I wonder if we wouldn't make
progress by having sub-objectives of data transformation related
objectives that provision of the parameter help satisfy.
For k in k-means it does indeed seem like a specification.
For a window for averaging, it can help achieve the objective of reducing noise.
Consider the case of RMA normalization. We've used a standard set of
arrays as a background to add a new array and have it normalized. Are
the standard set of arrays a parameter? (because they aren't about the
measurement at hand?)
-Alan
The problem is the one with dealing with all information content
entities. How to prevent a slide into disconnection of the information
from the thing it represents. This isn't a matter of overmodeling,
it's a matter of keeping to a principle of clean hygiene in
representing information.
>Â I agree with Chris, I'm not sure where it sits presently
> but I would be inclined to leave this as either a role or a
> specification. Â The only issue is that a parameter is only a parameter
> in the context of the process it participates in, but perhaps that's
> still ok for it to live under ICE.
Explain this last bit, if you would?
Thanks,
Alan
I am extending here what I wrote in my original email to James. One thing to keep in mind is that there is no such thing as 'specification' in the OBI, but there is 'information entity about a realizable' which is the parent of 'plan specification' etc., so we have informally dubbed it 'specification'.
I agree with Alan that it can be hard to keep 'parameters' and specified data input and output apart. All are specified in the plan that is executed in the data transformation process. For all of them, we can identify some 'information content entity' that participates in the actual data transformation process. So I believe we can define:
data transformation participant specification
is_a 'information entity about a realizable'
part_of some (plan specification and has_part data transformation objective)
is_about some (information_content entity and participates_in some data transformation)
data transformation participant specification has at least three children:
1)data transformation input specification
2)data transformation output specification
3)data transformation parameter specification
For 1 and 2, we have been using the 'has_specified_information_input / output' relations, which point to 'data' items. We are not actually using 1 + 2. We could do the same for 3, and use a third relation 'has_specified_information_parameter', which would point to 'data transformation parameter' items. This would move the fact that these are specifications into the relation.
E.g:
information content entity
data transformation parameter
window averaging length
number of clusters k
data transformation parameter would be a defined class:
information content entity and is_specified_information_parameter_in some data transformation
I have to stop here...
- Bjoern
James
Cheers,
James
information content entity
data transformation parameter value
window averaging length
number of clusters k
It would be used like:
'moving average' has_specified_information_parameter 'window averaging
length'
and we should have a similar approach to identify the value of the
'window averaging length' as we are using for 'measurement datum', i.e.
a 'has_value' relation, possibly also a 'has unit' relation.
Draft definition:
A data transformation parameter value is a information content entity
which is a specified participant of a data transformation. It is
distinct from the data input and the data output of the data
transformation, in that the the latter refer to the data items which are
being transformed, the latter retain an aboutness relation to a measured
entity.
Otherwise would we need to have extra relations for our variable
(dependent, independent, controlled) specifications too?
Melanie
---
Mélanie Courtot
TFL- BCCRC
675 West 10th Avenue
Vancouver, BC
V5Z 1L3, Canada