CL Command Reference - STRHTTPCRL

CL Command List > STRHTTPCRL Reference

Description:

The Start HTTP Crawling (STRHTTPCRL) command allows you to create or append to a document list by crawling remote web sites, downloading files found, and saving the path names in the document list specified.

To create a document list, specify *CRTDOCL for the Option (OPTION) parameter.

To update a document list, specify *UPDDOCL for the OPTION parameter.


Examples:

STRHTTPCRL OPTION(*CRTDOCL) DOCLIST(¡¯/mydir/my.doclist¡¯)
URL(¡¯http://www.ibm.com¡¯) MAXDEPTH(2)

This command starts a new crawling session finding referenced links 2 layers from the starting URL at www.ibm.com. The document list will be created in ¡¯/mydir/my.doclist¡¯ and will contain sets of a local directory path, for example, ¡¯/QIBM/USERDATA/HTTPSVR/INDEX/DOC/www.ibm.com/us/index.html¡¯ and the actual URL to the page ¡¯http://www.ibm.com/us/¡¯. Use the Configure HTTP Search (CFGHTTPSCH) command to create an index using this document list.