We generate a new data set from the ls listings at the end of a data collection pass if the current data set is more than 4 days old, or we have more than 700 parsed ls listings that are newer than the current data set. The data set generation interval varies between 2 and 3 days.
http://www.unit.no/
are not indexed.
Then send an email to tegge@idt.ntnu.no. It should contain the
word add
and the URL of the server to be indexed without any
directory specified, e.g. add ftp://ftp.unit.no/
To have only a part of the the ftp server indexed, you must maintain a ls-lR.gz file in the root directory of the anonymous ftp area containing what you want indexed.
A firewall should not be configured to drop (deny) incoming tcp connections to port 113. Instead, it should be configured to send back a RST packet (reject), causing the ident request to be aborted with a Connection refused error, instead of a timeout.
#!/bin/sh cd /ftparea && ls -laR pub > ls-lR.new 2>/dev/null && gzip -9 ls-lR.new && mv ls-lR.new.gz ls-lR.gzThis script could be run from cron on a daily basis. When the file ls-lR.gz is present, FTP search will fetch that file instead of performing a recursive ls listing.
If your system does not have a ls command with the expected output, but your ftp server gives the right listing, you might want to connect to your own ftp server via a command-line ftp client in the script that generates the ls-lR.gz file.
A more radical alternative is to remove the server from FTP search.
[ FTP search | Search page | Technical info ]