--
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to bids-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CA%2BA4wOneVTU5YMmuVSCfoReMVaAcGqYjPT6KhE_-5G7MPP6qMA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
Hi,
On Mon, Apr 10, 2017 at 08:43:15AM -0700, Chris Gorgolewski wrote:
> Hi,
>
> A couple of questions:
> - How would this play with BIDS Derivatives
> <https://docs.google.com/document/d/1Wwc4A6Mow4ZPPszDIWfCUCRNstn7d_zzaWPcfcHmgI4/edit>?
Quick question: Has there been any change to the concept of
incorporating derivatives inside a dataset (in a derivatives/ folder)?
I have previously expressed my concern that this brings a range of
practical problems, e.g.:
- a shared namespace is established: who gets to decide whose derivative
of a particular kind can take up a slot?
- dataset hosting and update bottle neck: any new derivative requires an
update of the original dataset. who get's to decide when it is worth
it? What is the take of data portal maintainers on this? Requests like
"I made A from B, please update dsXXX with it" are a likely
consequence.
- distribution efficiency: a growing blob of original data with
incorporated derivatives becomes increasingly more difficult to
handle, and the likelihood of anyone needing all pieces goes down at
the same time
- rights: who decides what licence derivatives are under when they go
into an original dataset? For example, "my" original data is CC0.
Nothing prevents people from producing "all rights reserved" works of
wonder from them. However, I will be up in arms if anyone achieves to
incorporate this kind of thing into a dataset, and at a location,
where I published the orginal CC0 dataset -- which would then maybe
get its licence amended and the author list extended by a new friend.
On the other side of the spectrum would be some kind of GPL-like viral
behavior, where derivatives have to share the license of the original
-- which is equally bad, although in a totally different way.
I recently had a discussion about this design with Franco Pestilli
(CC'ed), and it appears that I am not the only one with these kinds of
thoughts.
My suggestion was, and is, to have one dataset per derivative, and only
use a more or less convenient reference to the original dataset (at
least a README, but preferably a subdataset relationship like
datalad provides).
https://github.com/psychoinformatics-de/studyforrest-data-aligned/tree/master/src
The above link shows how one derivative dataset (spatially normalized
data for the studyforrest dataset) references its input/raw/original
datasets. In the studyforrest project we organize data in this way (one
study/acquisition == one dataset, one derivative/analysis == one
dataset). With the growing list of such datasets we are able to flexibly
combine intermediate datasets as source for new derivatives, without
feeling the pain of never ending updates to our foundation datasets --
something that we started with but had to abandon for manpower reasons
at a very early stage of the project.
Best,
Michael
--
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to bids-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/20170410182515.vm3xitrtsotixypk%40meiner.
- How would this play with BIDS Derivatives? Would there be an intrinsic default layout people would extend?
- Do you want to control the key names in this layout?
On Mon, Apr 10, 2017 at 7:50 AM, Satrajit Ghosh <sa...@mit.edu> wrote:--hi folks,just like bids has a structure that could be represented with a schema or a layout file (see BIDSLayout here: https://github.com/INCF/pybids/blob/master/bids/grabbids/config/bids.json ). we can scale this architecture for derivatives, while maintaining a dictionary for commonly used terms (e.g., sub, ses, etc.,.).i would propose that the author of any app or process creating a bids derivatives optionally provide a layout.json file at the root of the derivative structure (//derivatives/{derivative_name}/layout.json)the layout file should be relative to derivatives/{derivative_name}cheers,
satra
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to bids-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CA%2BA4wOneVTU5YMmuVSCfoReMVaAcGqYjPT6KhE_-5G7MPP6qMA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to bids-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CAAQzouN2%3DHrcpLMSbQTAj6CbzDL%3DyzaY77RbBmymRsU73KFdag%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CA%2BA4wOnOtScDCWJ8mwsmO_bRV9u9qDoszLreNs2gTWeVx753Ew%40mail.gmail.com.
Hi,
thanks for the explanations, please see my response below.
On Mon, Apr 10, 2017 at 11:46:48AM -0700, Chris Gorgolewski wrote:
> On Mon, Apr 10, 2017 at 11:25 AM, Michael Hanke <michae...@gmail.com>
> > - a shared namespace is established: who gets to decide whose derivative
> > of a particular kind can take up a slot?
> >
> There is not shared namespace anymore - each pipeline has its own subfolder.
Indeed the namespace conflict is limited to the pipeline names. Do you
envision it to become common practice to encode a person/lab's name in
the pipeline name?
> - dataset hosting and update bottle neck: any new derivative requires an
> > update of the original dataset. who get's to decide when it is worth
> > it? What is the take of data portal maintainers on this? Requests like
> > "I made A from B, please update dsXXX with it" are a likely
> > consequence.
> >
> You can add derivative packages to existing datasets since each new
> derivative set will have its own subfoldr.
True. However, I am still struggling to see how to apply this design in
the general case. Another example from our current practice are
derivatives that:
- need other derivatives as input in addition to raw data. I assume the
pipeline folder names are persistent with respect to updates, and pipeline
code will reference data inputs via a ../<derivpath> relative path.
- need more than one raw dataset as input.
This one I have no idea how to frame in the spirit of the current
spec. The new derivative has no exclusive parent-relation to either of
the two raw datasets. Where would it go?
You pointed out the derivatives are associated "by proximity" (i.e.
have to be extracted into the right dataset directory). So I guess I
could extract them into both input raw datasets. However, that still
wouldn't make the code that produced the derivative run in either
location without having to go beyond the root directory of a raw
dataset.
> You can create separate compressed archives that include only derivatives
> from one pipeline. Those can be downloaded independently from the raw data.
> <snip>
> We have not included this before, but I just added license info to the
> <dataset>/derivatives/<pipeline_name>/pipeline_description.json
Yes, that makes sense -- I see now that derivatives are considered stand alone.
Thanks,
Michael
--
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to bids-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/20170411050734.o5awgxffeondwspb%40meiner.
--
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to bids-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CA%2BA4wOneVTU5YMmuVSCfoReMVaAcGqYjPT6KhE_-5G7MPP6qMA%40mail.gmail.com.
Michael
--
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to bids-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/20170411061322.w4kzvnmedxlt6yfq%40meiner.
Hi Satra,Before focusing on BIDS Derivatives, if we go down this route, could the BIDS layout file become an official part of the specs (structure of the JSON file, etc)? Unless I miss something, the file you point to defines a number of entities but is far from exhaustively describing the entire BIDS layout.
Guillaume.Thanks,--On 10 April 2017 at 15:50, Satrajit Ghosh <sa...@mit.edu> wrote:hi folks,--just like bids has a structure that could be represented with a schema or a layout file (see BIDSLayout here: https://github.com/INCF/pybids/blob/master/bids/grabbids/config/bids.json ). we can scale this architecture for derivatives, while maintaining a dictionary for commonly used terms (e.g., sub, ses, etc.,.).i would propose that the author of any app or process creating a bids derivatives optionally provide a layout.json file at the root of the derivative structure (//derivatives/{derivative_name}/layout.json)the layout file should be relative to derivatives/{derivative_name}cheers,
satra
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to bids-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CA%2BA4wOneVTU5YMmuVSCfoReMVaAcGqYjPT6KhE_-5G7MPP6qMA%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussion+unsubscribe@googlegroups.com.
To post to this group, send email to bids-discussion@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CADWiGBKZzTe4dmzEHiUDFL-_Hi_0-f6oUzKc9uK%2BDaODozWUuQ%40mail.gmail.com.
On Wed, Apr 12, 2017 at 3:39 AM, Guillaume Flandin <guillaum...@gmail.com> wrote:Hi Satra,Before focusing on BIDS Derivatives, if we go down this route, could the BIDS layout file become an official part of the specs (structure of the JSON file, etc)? Unless I miss something, the file you point to defines a number of entities but is far from exhaustively describing the entire BIDS layout.Technically this file has been a formal representation of the BIDS Spec used internally in pybids. In current BIDS (not BIDS Derivatives) it would be redundant to include this file with each dataset since it would always be the same. Am I missing something?
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CAAQzouNYhnza9fPHqhS00ynknVUiC%2BfKxMgYb94b%3DFaK_Y8kDg%40mail.gmail.com.
Hi Chris,
On 12 April 2017 at 16:27, Chris Gorgolewski <krzysztof.gorgolewski@gmail.com> wrote:On Wed, Apr 12, 2017 at 3:39 AM, Guillaume Flandin <guillaum...@gmail.com> wrote:Hi Satra,Before focusing on BIDS Derivatives, if we go down this route, could the BIDS layout file become an official part of the specs (structure of the JSON file, etc)? Unless I miss something, the file you point to defines a number of entities but is far from exhaustively describing the entire BIDS layout.Technically this file has been a formal representation of the BIDS Spec used internally in pybids. In current BIDS (not BIDS Derivatives) it would be redundant to include this file with each dataset since it would always be the same. Am I missing something?Sorry if I was unclear: I meant to have it part of the BIDS specification document, not included in each dataset. And if this format ends up being recommended to be used in BIDS Derivatives, you will anyway have to describe it formally
- unless you consider that pybids is part of the specs.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CADWiGBLxcxDReaS2i4pwF9JZ9susufDeXUBZ-001G04fNERt9Q%40mail.gmail.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/CAAQzouPCuJSRbf2YJvzKpC_VfzzqbvyixgENpDb9dhL963xq_Q%40mail.gmail.com.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussi...@googlegroups.com.
To post to this group, send email to bids-di...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/bids-discussion/20170411061322.w4kzvnmedxlt6yfq%40meiner.
For more options, visit https://groups.google.com/d/optout.
--
You received this message because you are subscribed to the Google Groups "bids-discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bids-discussi...@googlegroups.com.
To post to this group, send email to bids-di...@googlegroups.com.