You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to Paperwork
Hello, on Linux here, with flatpack.
Paperwork uses too much memory, 1.1GB at the moment.
Is there a possibility not to hold too much information in the memory, that is not actually needed?
Thanks, Steffen
Jerome Flesch
unread,
Nov 27, 2019, 6:36:54 AM11/27/19
Reply to author
Sign in to reply to author
Forward
Sign in to forward
Delete
You do not have permission to delete messages in this group
Copy link
Report message
Show original message
Either email addresses are anonymous for this group or you need the view member email addresses permission to view the original message
to paperw...@googlegroups.com, Steffen Michalek
Hello,
Yes and no.
I think the root of the problem is that Paperwork stores a copy of the
content of the documents in the Whoosh index. It needs this copy so it
can be used to untrain the label guessers when a document has been
deleted without using Paperwork. Problem is, if I'm not mistaken, Whoosh
keeps everything in memory. I think it also makes the index rewrites on
disk much longer.
In Paperwork 2.0, things will be done things differently. Whoosh will
only be used for indexing and searching. Document content copies are
stored in a separate Sqlite database (which won't be in memory).
Hopefully this will reduce memory usage.
Also in Paperwork 2.0, things will be a lot more modular, which will
make it much easier to figure out what is actually consuming resources.