So, discovered the other day that one of my yacy was complaining about diskspace being full. I cleaned whatever i could, using the admin panels. But to no avail.
Though, i shortly after discovere ~9GB(!!) of quite old data lingering in DATA/SURROGATES/out/
The files had apparently neither been touched or modiefied in quite some time.
(This might be because of changes in the requesting peers. Or rare/never-returning peers; ‘Leaving behind’ their requested data )
I decided they’re probably lingering files of no use. So I decideded to apply tmpreaper
In case someone might find it useful; The following is a crontab entry i currently use to prevent lingering 'out’s . It deletes them 12 hours after their modified/creation time (mtime), if not accessed (atime) in 12 hours. Checking is being done at reboot, and then subsequently every 15th minute.
(note that user in this case is ‘yacy’. evnt. adjust as needed)
#clean yacy out
@reboot tmpreaper 12h /home/yacy/yacy/DATA/SURROGATES/out/ >/dev/null 2>&1
*/15 * * * * tmpreaper 12h /home/yacy/yacy/DATA/SURROGATES/out/ >/dev/null 2>&1
Keep in mind that mtime (-m) argument will likely need to be added if noatime arg have been set on the disk/partition used in /etc/fstab (often set for SSDs etc.) . Since ‘last time file was accessed’ records are then not kept by the filesystem.
https://linux.die.net/man/8/tmpreaper_selinux
https://tracker.debian.org/pkg/tmpreaper
https://aur.archlinux.org/packages/tmpreaper/
https://tldp.org/LDP/solrhe/Securing-Optimizing-Linux-RH-Edition-v1.3/chap6sec73.html
I expect there is no problem doing it as such…