Task
Task
A task is a process triggered by a spider which crawls data from websites, performs specific operations, or serves other functionalities. It is the basic unit of the execution process of spiders.
In Crawlab, you can not only run tasks through only a single click, but also be able to visually view task info such as stats, realtime logs and crawled data. Furthermore, you can set Priority of tasks in order to determine their execution sequence.
Run Task
You can either run a task from spider, or follow the steps below.
- Navigate to
Taskspage. - Click
New Tasksbutton on the top left. - Select
Spiderand choose other settings. - Click
Confirm.
Restart Task
- Navigate to
Taskspage. - Click
Restartbutton on the right.
Monitor Task
Crawlab provides task monitoring functionalities to allow you to closely watch the results and performance of your crawling tasks.
View Logs
You can view realtime logs in Crawlab.
- Navigate to task detail page.
- Click
Logstab.
View Data
You can view crawled data in realtime.
- Navigate to task detail page.
- Click
Datatab.
Cancel Task
Once a task is Pending or Running, you can cancel it by either
- clicking on
Cancelbutton on the right inTaskspage, or - clicking on
Cancelbutton on the nav bar in task detail page.