Getting Started

These might be the examples, each of which has a different point to get over...

Working without the Config.xml file
=== Working with Config.xml file to do different things === There are mainly going to be things that you can do just by editing the config.xml file and running from the command-line. See the documentation about the Config file here.

  • A site mirror. How to use from the command-line, minimal config.xml file
  • Broken-link checker.
  • Image downloader. Not restricted to any particular site perhaps
  • Simple Search Engine (not Lucene). How to limit how far the crawler goes.
  • etc