User Agent Category: Spider/Bot

Any type of robot that does not fit in other categories from the Crawlers group. This include unknown and undocumented visitors that act like spiders, traversing a large number of pages in a small period of time.

AgentDescription
A-OnlineA-Online.at robot, now Jet2Web Search.

AbcdatosUsed to verify availability of the ABCdatos directory entries, checking HTTP HEAD.

ABCsearch

AberjaLink checker.

AbiLogicBotCheckes links for the AbiLogic web directory

aBotNameprotect copyright search robot.

About

AboutUsBotThe AboutUs:Bot gathers descriptive information about a website from several sources to build a Wiki Page. This pre-built wiki page gives website owners and AboutUs.org contributors a head-start in creating a useful and informative AboutUs.org page.

AckAckerm search robot.

Acme.SpiderA Java utility class for writing your own robots.

AcoiPicture finder robot.

Acoon

Advanced Email ExtractorE-mail collector.

Aesop

AgentNameLinkomatic submission verifier.

Ah-ha

AhoyResearch project at the University of Washington, for finding personal Homepages.

AIBOTReal artificial intelligence search engine China.

AITCSRobotIts purpose is to generate a Resource Discovery database. This Robot traverses the net and creates a searchable database of Web pages. It stores the title string of the HTML document and the absolute url. A search engine provides the boolean AND & OR query models with or without filtering the stop list of words. Feature is kept for the Web page owners to add the url to the searchable database.

Aladin

Alcohol Search

AlexaAlexa crawler.

AlkalineUnix/NT internet/intranet Vestris search engine designed at the University of Geneva.

Allesklar

Almaden/IBM PlanetwideRestricted to IBM owned or related domains. Set of research technologies that collect, store and analyze massive amounts of unstructured and semi-structured text. It is built on an open, extensible platform that enables the discovery of trends, patterns and relationships from data.

Amgen

AMZNKAssocBot

Ananzi

AnnomilleAnnomille historical oriented robot.

AnonymizerFaked user agent.

AnthillGather price information automatically from online stores. Research project at the University of Mannheim.

AntiBot/AntiSearchDiscontinued robot.

AnzwersYahoo robot.

Aport

ArachmoWeb site file extraction tool.

ArachnophiliaCollect approximately 10k html documents for testing automatic abstract generation.

AraleJava multithreaded web spider. Download entire web sites or specific resources from the web. Render dynamic sites to static pages.

AraneoFor crawling and indexing web pages written in the international language Esperanto.

AraybOtAgent software of AraykOO! which crawls web sites listed in dmoz.org/Adult/, in order to build a adult search engine.

ArchitextSpider/ExciteGenerate a Resource Discovery database and statistics. The ArchitextSpider collects information for the Excite and WebCrawler search engines.

ArethaA crude robot built on top of Netscape and Userland Frontier, a scripting system for Macs

ArgusSimpy Bookmarklet crawler.

AriadnePrototype of an environment for testing focused crawling strategies.

Arianna

ArksThe Arks robot is used to build the database for the dpsindia/lawvistas.com search service. The robot runs weekly, and visits sites in a random order.

Asahina AntennaASAHINA Antenna information detecting agent.

Ask.24x.InfoAsk 24x Info robot.

AskAboutOilPetroleum related search, using Nutch.

ASpiderASpider is a CGI script that searches the web for keywords given by the user through a form.

AspTearURL fetching program component, Download32.com spider.

ASSORTAssociative sort robot.

Astra/AstraFind/AstraSpiderAdult search robot.

AtlocalLocal business search robot.

ATN WorldwideThe ATN robot is used to build the database for the AllThatNet search service operated by All That Net. The robot runs weekly, and visits sites in a random order.

AtomzRobot used for web site search service. Developed for Atomz.com, launched in 1999.

atSpiderCeased email harvester/spambot.

AugurFindAugurnet search robot.

AURESYSThe AURESYS is used to build a personnal database for somebody who search information. The database is structured to be analysed.

AutoEmailSpiderAuto Email Pro Email harvester.

AV Fetch

Avalon

AztrxAdult toys and videos.

BaBoomBaBoom Web Portal (ODP) robot.

BackRub

BACS

Badongo

BanBotsPerl script-based robot.

BarraHomeBarrahome crawler.

Batsch

bBotMainly intended for site level search, sometimes set loose.

BCentralBCentral crawler.

bdcIndexerBusiness.com robot.

BDFetchBrandimensions Brand Protection robot.

BeautyBotRobot for Cosmoty, beauty and wellness search engine.

BebopBotA Passion for Jazz music related search robot.

BillBotCarnegie Mellon School robot, link checker.

BimBotProvides converged data and voice services.

Bisnisseek

BitacleBlog search archive robot.

BjaalandCrawls sites listed in the ODP.

BlackWidowStarted as a research project, used to find links for a random link generator. Also used to research the growth of specific sites.

Blaiz-BeeBlaiz Enterprises RawGrunt search.

Blazer

BlogBotblogdex robot from MIT.edu.

BlogpulseBlog search.

BlogSearch

BlogsNowBotBlogsNow realtime link tracker robot.

BlogWatcherRobot from Okumura Group Tokyo.

BlogzIce

BloodhoundBloodhound will download a whole web site depending on the number of links to follow specified by the user.

BoardReaderBoardReader search image and favicon fetcher.

Boito

Borg-BotDevelopmental crawler to feed a search engine.

Bot

BoxSeaBotNutch-based crawler. This robot is used to find pages for building the BoxSea search engine indices. The robot code uses Nutch. Earlier experimental crawls were done under various user agent names such as NutchCVS.

Brismee

BruinBotWebarchive Project Bruinbot crawler.

BSDSeekInktomi Hotbot-Lycos NBCi robot.

BSpiderBSpider is crawling inside of Japanese domain for indexing.

BuildCMSMarket monitoring project.