SiteSpider Overview



Trellian SiteSpider is a powerful, yet simple to use program which is loaded with sophisticated search, web crawling and site mapping functions.

What does SiteSpider do?

SiteSpider can be used simply as a web browser, which works cooperatively with Internet Explorer to allow importing and utilization of your favorites directory.

Trellian SiteSpider can help you to extract valuable data from any website. The data is then cleanly organized into an easy to navigate panel, which can also be conveniently exported into folders and lists, depending upon file types.

What can SiteSpider find?

While using SiteSpider in just the same way as a web browser, you can also search for and list photos, movies, email addresses, applications, documents, music and more.

SiteSpider will also list every HTML document that it has crawled.




SiteSpider Features

Features











SiteSpider Registration

Registration


To register and upgrade your Software, you will first need to obtain a serial number from our web site by using the following URL in your web browser: http://www.trellian.com/order.htm. Alternatively you may purchase this information from an authorized Trellian Software reseller if this is easier for you.

Once you have all of your details ready, choose the Register Software option from the Help menu of SiteSpider.
Enter your Email address and Serial Number into the corresponding fields as shown in the example, then click the OK button. Make sure you type your registration details exactly as we supplied them to you, (this includes upper AND lower case letters).


Step 1. - Select Register Software from the Help menu.



Step 2. - Enter your registration details exactly as they were sent to you.



Step 3. - Click the Unlock Software button to register SiteSpider.

Field:Description:
Email AddressThis is where the email address that you used to register with is entered
Serial NumberThis is where you enter the serial number number that is sent to you upon registration
Button:Description:
Unlock Software:Click this button to unlock the software when you have entered your registration details.
OKClick this button to complete the registration process
Home PageClick this button to visit the Trellian website







SiteSpider - System Requirements

System Requirements


Operating Systems:
Microsoft Windows 95/98/Me
Microsoft Windows NT/2000/XP

Internet Browsers:
Microsoft Internet Explorer 5.x/6.x
Netscape Navigator 5.x/6.x

CPU/Processors:
Intel Pentium I, II, III, IV, AMD or compatible

RAM/Memory:
16Mb or greater (64Mb recommended)




SiteSpider - Upgrading

Upgrading


To keep SiteSpider current, upgrades should be installed from time to time. These upgrades are free to all registered users.

To download the latest upgrade, select the Upgrade option from the File menu. LiveUpgrade will contact the upgrade server to check if there are any upgrades available.



All available components will be displayed in the selection list. All items that can be upgraded will be selected, and marked with a red clock icon. To upgrade these items, tick the checkbox next to the program and/or component name and click the Upgrade button.

Items marked by a grey dot have not been installed.

Items marked with a green tick are current and no update is required.




SiteSpider Support

Support


FREE email support is offered to all registered users of SiteSpider.

If you are experiencing problems, or are requiring answers to a technical question, please write us an email detailing the issue, or issues that you are experiencing and send it along with any other related information to our Support Department.

You may contact this department by sending an email to support@trellian.com.

Support Hours:
Open Mon - Fri, 9:00am - 5:00pm (Melbourne, Australia - Eastern Standard Time)

Please include your registration key, and some basic information about your computer and the Operating System you are running..




SiteSpider - Using SiteSpider

Using SiteSpider



This chapter explains how to use SiteSpider






SiteSpider - Spidering a website

Spidering a website



  1. Enter the URL of a website or webpage to check in the Address bar. The URL should be in the form of http://www.trellian.com



    To spider a website from your hard drive, select Open from the File menu. Browse for the HTML file and click OK. File names can also be entered in the address bar eg. C:\filetocheck.htm

  2. Select a spider type from the spider drop down menu.



    Site Spider
    The Site Spider will scan a complete web site and index all the URL's that the site contains. This spider will not spider behind the root domain. You can select a URL from the spider results and restart the spider from that point.

    Gallery Page Spider
    The gallery page spider is designed to extract the content from compiled gallery pages. Commonly called "Link Lists", these pages usually contain no content of there own, but do include many links to pages that do contain.
    The first page of all the links that exit the root domain are included in the spider results. This kind of spider often yields the biggest return per spider.

    Engine Page Spider
    The engine page spider is very similar to the gallery page spider. The root page is the only page included at the root domain. This spider is designed to extract the content from engine page results.
    You can then select the domain with the content that matches your requirements and continue searching using the default spider.

    Keyword Spider
    The keyword spider is an automated engine page spider.Trellian SiteSpider will automatically extract the search engine results before beginning a standard engine page spider. This spider is useful if you want to do a quick spider, but sometimes the results don't have the quality of a manually triggered spider. Simply enter keywords and select a search engine from the list.



  3. The selected spider will start spidering the website. SiteSpider's status is shown in the bottom left corner of the window.



  4. To stop the spider click the Stop button.



    To resume the spider click the Spider button then click the Yes button when prompted to resume. The unspidered files are shown in the Pending Files tab.








SiteSpider - Results

Results



When finished spidering a website SiteSpider's status bar will show "Done".




The site is broken down into the categories displayed in the sitemap window pane. Right click any of the files to display tools to work with the file type.

CategoryDescription
SiteMapDisplays all the web pages found on the website in a sitemap format
PhotosDisplays the photos ( jpg files) found on the website
ImagesDisplays the images(gif, png files) found on the website
MoviesDisplays all the move files found on the website
AddressesDisplays all the email addresses found on the website
ApplicationsDisplays all the application files found on the website
DocumentsDisplays the documents (pdf, doc, xls files) found on the website
MusicDisplays all the music files found on the website
ArchivesDisplays all the archive files (zip, rar files) found on the website
HTMLDisplays all the HTML pages found on the website
OtherDisplays any other items found on the website that do not fall under any of the other categories





SiteSpider - Using SiteSpider

Using SiteSpider



This chapter explains the SiteSpider interface






SiteSpider - Menu Options

Menu Options





File
OpenOpens a HTML file
Save AsSaves changes a HTML file using a different file name
Page SetupDisplays the page setup for printing
PrintPrints the current webpage
Print PreviewDisplays a preview before printing
Page PropertiesDisplays the file properties of the current page
Internet OptionsDisplays the Internet Explorer Internet Options
UpgradeConnects to the Trellian server then checks for the availability of automated Software Upgrades/Updates
ExitExits the program




Edit
CutCuts the currently selected item to the clipboard
CopyCopies the currently selected item to the clipboard
PastePastes the item from the clipboard
Select AllSelects all items
Find (on This Page)Searches for text on the webpage




Export
Go ToNavigates the website
StopStops the spider
RefreshRefreshes the webpage
Text SizeChanges the webpage text size
SourceDisplays the HTML source code in NotePad
Web BrowserToggles display of the web browser panel
Site ExplorerToggles display of the Site Explorer panel




Help
AboutShows information about the program
Register SoftwareFrom here you may enter your registration details in order to register the program
Trellian SoftwareBrowses the Trellian Software website
Trellian ServicesBrowses the Trellian Services website