definition: a data transformation parameter specification is an information entity about a realizeable that is used in a data transformation to refer to specific kinds of values.
examples: The integer k in 'k-means clustering', The window size in a 'moving average'; The values for p, T, w, m in a 's transformation'
is_a 'information entity about a realizable'
is_concretized_as (is_realized by only data transformation)
is_about some (information_content entity participates_in some data
transformation)
editor note: There are other meanings of parameter such as population characteristic that may still need to be addressed.
term: genome sequence version
definition: genome sequence version is a label that is used to specify the representation of the assembled genome sequence contained in a file or used in an analysis.
definition source: CS, DENRIE
examples: mm8, The March 2006 human reference sequence (NCBI Build 36.1)
restrictions:
is_about some (genome sequence <output of some data transformation of sequence data into a genome sequence>)
editor note: need to create 'genome sequence' and/or 'data transformation of sequence data into a genome sequence' or something like it.
original request from Nicole Washington:
for a data analysis protocol where an entire genomic sequence might be a
specified parameter, it would be useful to be able to specify the genomic
version.
for example, i have an algorithm that takes a genomic sequence as an input,
say like a gene-model prediction algorithm, and outputs some transformation
of the data. the results of the algorithm would be different depending on
which genomic sequence was in the input parameter.
term: tree model
definition: a tree model is a data representational model in which there are one or more layers of leaf nodes attached in a hierarchical manner and there may be a top or root node.
definition source: CS, DENRIE
examples: tree models are use in phylogenetic trees, gene clusters based on microarray data
restrictions: ??
editor note: not sure how to logically define hierarchical structure which is what distinguishes this from other models.
original request from James:
term: time series collection
definition: a time series collection is a data collection that is a sequence of data points, measured typically at successive times, spaced at (often uniform) time intervals.
definition source: Wikipedia
examples: gene expression measurements of cells taken from a culture over a series of days.
restrictions:
is_output_of some measurement
is_input_to some data transformation
original request from James:
DT requires the concept of 'time series' which would serve as input to some
of the DTs that deal with this. As a starting point for time series, here
is the wikipedia def: "A time series is a sequence of data points, measured
typically at successive times, spaced at (often uniform) time intervals".
term: heatmap
definition: a heatmap is a report element which is a graphical representation of data where the values taken by a variable in a two-dimensional map are represented as colors.
definition source: Wikipedia
examples: representation of microarray data for expression values of many genes across multiple samples or conditions.
original request from James:
term: survival curve
definition: a survival curve is a report element which plot percent survival as a function of time.
definition source: Graphpad.com
original request from James:
term: venn diagram
definition: a venn diagram is a report element which is constructed with a collection of simple closed curves drawn in the plane.
definition source: Wikipedia
original request from James:
term: graph diagram
definition: a graph diagram is a report element which is a collection of points and lines connecting some (possibly empty) subset of them.
original request from James: