Importing directory tree structure

323 views
Skip to first unread message

James Hondo

unread,
Oct 22, 2012, 9:55:57 PM10/22/12
to mayan...@googlegroups.com
Hello, thanks a lot for releasing your software, it is great!  I have been looking for something like it for a long time, it does everything I needed and then more.  One thing I couldn't found on the documentation; can it automatically mirror the structure of the document directories when doing an initial import?  Also I noticed the Wikipedia article is missing a great deal of stuff, I've worked on a few articles myself and would gladly help polish Mayan's article if you like.

James

Roberto Rosario

unread,
Oct 23, 2012, 3:35:11 AM10/23/12
to mayan...@googlegroups.com
Hi James,

Thanks I apreciate your comments :)

Check this thread to see if this is more or less what your are interested in: https://groups.google.com/forum/?fromgroups=&pli=1#!topic/mayan-edms/M_S5ZSVV5U4%5B1-25%5D

As far as I know there are no Mayan EDMS articles on Wikipedia, there was one try once and the article got deleted by the most ridiculous of excuses, it became clear that the editors evaluating the article were seriously biased against Mayan for what I can only think were monetary reasons.  Wikipedia as an idea is great, but the project has fallen from grace, there are very serious moderation and vandalism issues that are as old as the project and that they have not been able to address.  I don't have any interest for an article about Mayan on Wikipedia.  Sorry if that sounds bit harsh since you are just offering to help, I just want to save you the time and effort of building and defending a great article only to have corrupt editors delete it once you comply with the self serving objections they will produce.  I wholeheartedly thank you for your interest, but it is not worth your time.

--Roberto

James Hondo

unread,
Oct 23, 2012, 1:44:13 PM10/23/12
to mayan...@googlegroups.com
We are a small accounting firm and have a Windows server working as fileserver with all of our clients' documents sorted by year, month, activities and such.  My question is; Can I import not only the documents, but also their existing directory structure?

Wow I don't always agree with the veteran editor's decision but calling them corrupt is a very heavy handed and strong worded accusation.  Still I think Mayan is a great piece of software with a great community and worthy of an article in Wikipedia, just something to consider. 

Roberto Rosario

unread,
Oct 23, 2012, 6:31:13 PM10/23/12
to mayan...@googlegroups.com
No, the directory structure would not be 'cloned'.  I added this to the TO DO list for future versions, but I'm a little hesitant to add it, because it would just be duplicating the inefficient paradigm of filesystem directory trees only on a web interface.  This is the reason I created the automatic indexing where Mayan creates a hierarchical structure based on user defined rules to help users avoid being slaves of a manually updated structure as other DMS software do.  Is a little work at the beginning while you create the rules, but then you don't have to ever worry again about documents being placed in the correct hierarchical unit.

It is not an acusasion, is it a document fact: http://news.cnet.com/8301-1023_3-57514677-93/corruption-in-wikiland-paid-pr-scandal-erupts-at-wikipedia/ It has long been suspected that this was happening, the episode in the link is the most documented and alarming as it was done by a very senior Wikipedia editor.  The editor received payment to edit and favor the page of the government of Gibraltar so that it would be featured in Wikipedia's front page.  An article is lucky to land on the front page, the page of the country of Gibraltar landed 17 times, boosting their SEO results sky high.  What is most disgusting is how the editor involved and others argue that getting paid for favorably editing (or dumbing down a competitor's article) is not a conflict of interest!

With this information in mind, now read Mayan's previous wikipedia article discussion here: http://en.wikipedia.org/wiki/Wikipedia:Articles_for_deletion/Mayan_edms  It is extensive but you can clearly see how: 1) The article was tagged for deletion from the start even when articles are usually moved to an 'inactive' (userfied) mode where defenders can keep improving it at resubmit it for evaluation again, 2) The criteria for deletion was produced out of thin air, 3) The existing articles for commercial DMS software would fail that same criteria, 4) The editors were not following Wikipedia's code of conduct (accussing new users of being SPA's) , 5) Confused the issue on purpose, mixing defense of the article with WP:OTHERSTUFFEXISTS to invalidate defense, 6) Editors had no idea what a DMS software is and confused it with CMS software; and tell me that it is hard to deny how the editors appeared to be personally motivated beyond their duties as editors to erradicate Mayan article from Wikipedia.

I'm not trying to be controversial, just answering your argument and explaining my desires for not wanting/caring about a Mayan EDMS article on Wikipedia.

--Roberto

James Hondo

unread,
Oct 25, 2012, 7:31:28 PM10/25/12
to mayan...@googlegroups.com
Thanks for replying and explaining the reasoning behind your decision.  I hope you reconsider or implement something similar, it would be a shame to lose our existing structure and start from scratch.

I was completely unaware of this, all I can say is that I am speechless... The article I wanted to update has been deleted anyway.  Someone else is trying to create a new one; http://en.wikipedia.org/wiki/Wikipedia_talk:Articles_for_creation/Mayan_EDMS But it has been declined twice.  I apologize, I didn't knew about any of this.  

----
James

Nate Aune

unread,
Oct 26, 2012, 11:47:35 AM10/26/12
to mayan...@googlegroups.com

James Hondo

unread,
Oct 26, 2012, 9:44:06 PM10/26/12
to mayan...@googlegroups.com
No; it was this one: http://en.wikipedia.org/wiki/Mayan_dms But WOW!  The page you mentioned was created by none other than Samuel Jabob Klein!  This is a great validation of the Mayan software as Samuel is an elected member of the Wikipedia board of trustees and a Wikipedia steward (a super admin), so him creating the article for Mayan is a BIG THING!  Wikipedia is a collective of a big number of individuals; some a bit misguided as the one mentioned here, but with many others like Samuel.  So read his Wikipedia bio (here), I hope his actions has restored even a little bit of faith in the community of people working behind Wikipedia.

James 

maarten

unread,
Nov 2, 2012, 3:36:01 AM11/2/12
to mayan...@googlegroups.com
Hi Roberto and James,
 
I agree on Roberto's statement on the filesystem. That is exactly the reason why filesystem directory trees always end in a mess; as soon as more than one directory (='metadata') can be applicable users will choose randomly. In case of a DMS you can then just use a second, third metadata tag.
However, I do understand James' problem, very common in a transition, as the best 'metadata' he currently has is the interpretation of the directory tree. E.g. a structure "project number XX\sales\quotations" are in fact 3 metadata tags to be filled.
 
You could try a script where the arguments for -- metadata at bulk_upload are filled by some logic from a ls command. I've never tried (and am really not enough up to speed on pyhton if it wouldn't work to achieve this) but this command with multiple --metadata arguments could then do the trick if ran per subfolder:
 
$ ./manage.py bulk_upload --noinput --metadata '{"project": "bulk"}' --document_type "Accounting documents" compressed.zip
 
After this the magick of indexes could rebuild the original directory structure but then with many more usefull cross-sections of your document metadata.
 
Maarten

Roberto Rosario

unread,
Nov 2, 2012, 10:51:39 AM11/2/12
to mayan...@googlegroups.com
Hi James,

No problem, it was just a design philosophy but I'm always very open about adding features requests to the TO DO list.  Feel free to open an issue on github.

No need to apologize, at all! :)  Like I said, I think the concept behind Wikipedia is great and very important.  Knowledge needs to be preserved and can only be accurately preserved in a crowd sourced manner for balance and for scalability.  In my opinion, Wikipedia is becoming too important and some of it policies need to mature because they are empowering the wrong people to decide what is and is not worthy of being recorded for future reference.

--Roberto

Roberto Rosario

unread,
Nov 2, 2012, 11:26:34 AM11/2/12
to mayan...@googlegroups.com
Now this is what I would call unexpected...  and puzzling!

Roberto Rosario

unread,
Nov 2, 2012, 12:02:03 PM11/2/12
to mayan...@googlegroups.com
Thanks Maarten that's actually the way it could be accomplished with the current code.  The TO DO list is quite extensive so I can't promise a date for this, but based on Maarten's script idea and adding 3 new options to the bulk_upload command this could be done automatically.

--dir_regex_metadata: to interpret a document's path as metadata values using regular expression 
--from_path: to import documents from a directory instead of a zipped file
--recursive: to traverse all subdirectories

so for something like:

/project number 01/customer/Customer Name A/
/project number 01/customer/Customer Name B/
/project number 02/customer/Customer Name C/

My regex Fu is poor but the command line would be more or less like this:

$ ./manage.py bulk_upload --noinput --dir_regex_metadata "/project number (?P<project_number>\d+)/customer/(?P<customer_name>[a-zA-Z0-9 ]+)"  --document_type "Accounting documents" --from_path /var/accounting/docs/ --recursive

with the regular expression parameter name (the name inside ?P<  >) being the internal metadata type name.

Thoughts?

--Roberto

James Hondo

unread,
Nov 7, 2012, 11:21:07 AM11/7/12
to mayan...@googlegroups.com
Maarten, thanks for the suggestion I will give it a try.  Roberto, any estimate when this might get added?  Would be really helpful not only to me but I'm sure that for a lot of people as well.

On an unrelated note Alfresco 4.2 came out, take a look at the release notes, see if you notice something :-)

James

Roberto Rosario

unread,
Nov 9, 2012, 4:39:38 PM11/9/12
to mayan...@googlegroups.com
Hi James,

Hard to say, the next version was meant as a maintenance release with only bug fixes and minor updates.  As it stands already there are some major features that were added.  An option could be to delay the next version even further to add all these requests and release a major version.  Opinions?

I don't know exactly what you are referring to.  I found this page: http://wiki.alfresco.com/wiki/Alfresco_Community_4.2 if that is what you meant, then yes there are a few things that are straight from Mayan, the development version and from stuff discussed here, that have not been part of that software in the years it has existed.  Mayan's document handling paradigms are very different from what is available.  The slogan on the website "Re-Think Document Management" is actually very true, I started from scratch based on my personal experiences working with a huge amount of document of different kinds on a government agency.  I don't mind other software duplicating functionality, in fact "Imitation the highest form of flattery" :)  So it's nice to know that Mayan is not just swimming around the DMS market, but actually setting the pace in feature set :D
 
--Roberto

maarten

unread,
Dec 1, 2012, 5:20:40 AM12/1/12
to mayan...@googlegroups.com
Hello James,

Any update on the import script idea? Did you try some code and would you like to share to code and regexp?

I am very curious to hear.

Best regards,

Maarten

Roberto Rosario

unread,
Dec 12, 2012, 9:32:37 PM12/12/12
to mayan...@googlegroups.com
No update on this yet, I'm trying to get out the already delayed twice version with the bootstrap app in it, but I will give this a try on the next development cycle after the stable version release.

--Roberto

joshu...@gmail.com

unread,
May 30, 2013, 7:40:19 PM5/30/13
to mayan...@googlegroups.com
Our organization is evaluating a EDMS solution and we came across the Mayan edms software. I had a question for you....How does your software compare to Alfresco?

Reply all
Reply to author
Forward
0 new messages