bew is a Web mirroring tool that will recursively get a whole set of Web pages using the HEAD mechanism so that it only downloads pages that have changed. It mostly works, but there are still a lot of features to be added. Support for checking external links is also implemented.

hhgg2xml will fetch articles from H2G2.com and output them in a variety of formats, including XML, text, and TomeRaider. H2G2.com is badged as the "Earth Edition of the Hitchhiker's Guide to the Galaxy".

HTTrack is an easy-to-use offline browser utility. It allows you to download a Web site from the Internet to a local directory, building recursively all directories, getting HTML, images, and other files from the server to your computer. HTTrack arranges the original site's relative link-structure. Simply open a page of the mirrored Web site in your browser, and you can browse the site from link to link, as if you were viewing it online. HTTrack can also update an existing mirrored site, and resume interrupted downloads. WebHTTrack is a Web-based GUI for HTTrack.

JCheckLinks a Java app that does hyperlink validation. It is multi-threaded, has an include/exclude list to determine which URLs will be harvested for more links and which will be only checked with a HEAD request, and adheres to the Robots Exclusion Protocol (/robots.txt).

lftp is a sophisticated command line based file transfer program. Supported protocols include FTP, HTTP, SFTP, and FISH. It has a multithreaded design allowing you to issue and execute multiple commands simultaneously or in the background. It also features mirroring capabilities and will reconnect and continue transfers in the event of a disconnection. Also, if you quit the program while transfers are still in progress, it will switch to nohup mode and finish the transfers in the background. Additional protocols supported: FTP over HTTP proxy, HTTPS and FTP over SSL, BitTorrent protocol. IPv6 is fully supported. There are lots of tunable parameters, including rate limitation, number of connections limitation and more.

OMT is a simple script for mirroring Web pages for off-line mirror/reading. It rewrites the content of the pages to make a complete and functional mirror. It has a number of options to specify what files should be mirrored and what renaming should be done.

rsync is a replacement for rcp (and scp) that has many more features. It uses the "rsync algorithm" which provides a very fast method for remote files into sync. It does this by sending just the differences in the files across the link, without requiring that both sets of files are present at one of the ends of the link beforehand.