Replies: 1 comment 1 reply
-
That is indeed a bit of a pickle. Perhaps writing a small script that uses the API to do the deleting for you, either slowly (delete request to each) or in small batches (bulk_edit). |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I am using the latest docker based version of paperless-ngx, runing on an old I7 notebook with 8 GB RAM.
I just learned that my archive consists of 90% garbage in case of duplicate files. The files are PNG-files in the file format of e.g.
Planung Solarmodule-onshape-neu_XXX.png
where XXX is counting up from 1 to currently 987.
In total I have 17K PNG files in my originals folder, with at total number of 21K files.
These PNG files were not intended to be archived and are of no value for me.
I assume that they created due to the fact that my consume folder was read only. paperless seems to be able to detect duplicates for pdf files but for png fils this seems not to work.
I will correct the issue with the consume folder but how to get rid of all these png files?
It seems not to be possible via the GUI: I can filter them, but even selecting them fails and latest the delete operation fails with a timeout or generic error messages (error cause 0).
I was tempted to simply delete all png files in the originals folder but I fear that I might create severe inconsistencies that will make the archive unusable. I also considered to export everything and start with a fresh installation but the export also fails after several days.
Any suggestions how to remove these files or to save my other 4k tagged pdf files?
Michael
Beta Was this translation helpful? Give feedback.
All reactions