To use ACHE with Docker, you can 1) use a pre-built image or 2) build the image yourself as follows:

1. Using the pre-build docker image

We publish pre-built docker images on Docker Hub for each released version.
You can run the latest image using:

dockerrun-p8080:8080vidanyu/ache:latest

Docker will automatically download the image from DockerHub and run it.

2. Build the image on your machine

Alternatively, you can build the image yourself and run it:

gitclonehttps://github.com/ViDA-NYU/ache.gitcdachedockerbuild-tache.

where ache is the name of the image being built.

Running the crawler using Docker

The Dockerfile used to build the image exposes two data volumes so that you can mount a directory with your configuration files (at /config) and preserve the data stored by the crawler (at /data) after the container stops.
In order to run ACHE using docker, you will need a command like:

where $PWD is the path where your config file ache.yml and the seeds.txt are located and $PWD/data is the path where the crawled data will be stored. In this command vidanyu/ache refers to the pre-built image on DockerHub. If you built the image yourself, you should use the same name that you used to build the image.

This configuration will not persist after system restarts. To make it persistent,
you will need configure the system to reload these settings automatically.
Every operating system is configured in a different way.
Following, is an example of how to install ACHE at \opt for Linux
(tested only in Ubuntu 16.04):

Only released tagged versions are published to Anaconda Cloud, so the version available through Conda may not be up-to-date.
If you want to try the most recent version, please clone the repository and build from source or use the Docker version.