On Wed, Apr 28, 2010 at 23:09, Jeanne Kramer-Smyth
<
jkrame...@gmail.com> wrote:
> Along the same vein, I would like to be able to collapse rows together when
> I use clustering to update values. My specific case is one in which I want
> to be able to add values together when I discover they belong in the same
> row.
>
> For example, the original data might look like this:
>
> ABCDEFG 35 100
> ABcdeFG 20 50
>
> After clustering and finding the match, I would love to end up with 1 row
> that looks like this:
>
> ABCDEFG 55 150
>
> Plausible? Anyone else think this would be useful? I am not sure what
> happens to non-numeric values - perhaps just putting all the values in a
> single cell with a given value separator.
Definitely -- this kind of thing is something I'd probably drop out to
a programming language to do most of the time. What operation to
perform on the numbers in the collapsed targets would vary by the
semantics of each column, though; for columns that are "counts of
ABCDEFG", summing is of course the interesting operation, whereas for
columns that are "average price of ABCDEFGs", it would be an
average(count(each type of reconciled ABCDEFG) * value)), however that
would or would not be expressible in something like GEL. Yet other
column semantics might apply too, of course.
I would love this kind of collapsing functionality, especially if
columns could be marked up to carry along some of their semantics,
guiding per-column default choices of operation like this, letting
Gridworks get smarter (without overruling power uses with some other
agenda).
--
/ Johan Sundström,
http://ecmanaut.blogspot.com/