Web hosting with free domain names in europeCpanel X unlimited pop3 accounts with linux web servers
dedicated server uptimeGreek Lang | gr domain name registration 
SERVICES
Web Hosting
Cpanel Free Scripts
Dedicated Servers
Servers Stock
Network
Web Design
Domain Parking
Domain Registration
Data Center Tour
FREE WEB TOOLS
Flash Toolbar Generators
Graphic Toolbar Generators
DHTML / CSS Menu Generators
Java Script Menu Generators
MAILING LIST
Sign up to our mailing list
E-mail:
I want to:
SSL
Google Ads

Meta Crawlers Index

  • robot-id: kapsi
    robot-name: image.kapsi.net
    robot-cover-url: http://image.kapsi.net/
    robot-details-url: http://image.kapsi.net/index.php?page=robot
    robot-owner-name: Jaakko Heusala
    robot-owner-url: http://huoh.kapsi.net/
    robot-owner-email: Jaakko.Heusala@kapsi.net
    robot-status: development
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix
    robot-availability: data
    robot-exclusion: yes
    robot-exclusion-useragent: image.kapsi.net
    robot-noindex: no
    robot-host: addr-212-50-142-138.suomi.net
    robot-from: yes
    robot-useragent: image.kapsi.net/1.0
    robot-language: perl
    robot-description: The image.kapsi.net robot is used to build the database for the image.kapsi.net search service. The robot runs currently in a random times.
    robot-history: The Robot was build for image.kapsi.net's database in year 2001.
    robot-environment: hobby, research
    modified-date: Thu, 13 Dec 2001 23:28:23 EET
    modified-by:

  • robot-id: katipo
    robot-name: Katipo
    robot-cover-url: http://www.vuw.ac.nz/~newbery/Katipo.html
    robot-details-url: http://www.vuw.ac.nz/~newbery/Katipo/Katipo-doc.html
    robot-owner-name: Michael Newbery
    robot-owner-url: http://www.vuw.ac.nz/~newbery
    robot-owner-email: Michael.Newbery@vuw.ac.nz
    robot-status: active
    robot-purpose: maintenance
    robot-type: standalone
    robot-platform: Macintosh
    robot-availability: binary
    robot-exclusion: no
    robot-exclusion-useragent:
    robot-noindex: no
    robot-host: *
    robot-from: yes
    robot-useragent: Katipo/1.0
    robot-language: c
    robot-description: Watches all the pages you have previously visited
    and tells you when they have changed.
    robot-history:
    robot-environment: commercial (free)
    modified-date: Tue, 25 Jun 96 11:40:07 +1200
    modified-by: Michael Newbery

  • robot-id: kdd
    robot-name: KDD-Explorer
    robot-cover-url: http://mlc.kddvw.kcom.or.jp/CLINKS/html/clinks.html
    robot-details-url: not available
    robot-owner-name: Kazunori Matsumoto
    robot-owner-url: not available
    robot-owner-email: matsu@lab.kdd.co.jp
    robot-status: development (to be avtive in June 1997)
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent:KDD-Explorer
    robot-noindex: no
    robot-host: mlc.kddvw.kcom.or.jp
    robot-from: yes
    robot-useragent: KDD-Explorer/0.1
    robot-language: c
    robot-description: KDD-Explorer is used for indexing valuable documents
    which will be retrieved via an experimental cross-language
    search engine, CLINKS.
    robot-history: This robot was designed in Knowledge-bases Information
    processing Laboratory, KDD R&D Laboratories, 1996-1997
    robot-environment: research
    modified-date: Mon, 2 June 1997 18:00:00 JST
    modified-by: Kazunori Matsumoto

  • robot-id:kilroy
    robot-name:Kilroy
    robot-cover-url:http://purl.org/kilroy
    robot-details-url:http://purl.org/kilroy
    robot-owner-name:OCLC
    robot-owner-url:http://www.oclc.org
    robot-owner-email:kilroy@oclc.org
    robot-status:active
    robot-purpose:indexing,statistics
    robot-type:standalone
    robot-platform:unix,windowsNT
    robot-availability:none
    robot-exclusion:yes
    robot-exclusion-useragent:*
    robot-noindex:no
    robot-host:*.oclc.org
    robot-from:no
    robot-useragent:yes
    robot-language:java
    robot-description:Used to collect data for several projects.
    Runs constantly and visits site no faster than once every 90 seconds.
    robot-history:none
    robot-environment:research,service
    modified-date:Thursday, 24 Apr 1997 20:00:00 GMT
    modified-by:tkac

  • robot-id: ko_yappo_robot
    robot-name: KO_Yappo_Robot
    robot-cover-url: http://yappo.com/info/robot.html
    robot-details-url: http://yappo.com/
    robot-owner-name: Kazuhiro Osawa
    robot-owner-url: http://yappo.com/
    robot-owner-email: office_KO@yappo.com
    robot-status: active
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: ko_yappo_robot
    robot-noindex: yes
    robot-host: yappo.com,209.25.40.1
    robot-from: yes
    robot-useragent: KO_Yappo_Robot/1.0.4(http://yappo.com/info/robot.html)
    robot-language: perl
    robot-description: The KO_Yappo_Robot robot is used to build the database
    for the Yappo search service by k,osawa
    (part of AOL).
    The robot runs random day, and visits sites in a random order.
    robot-history: The robot is hobby of k,osawa
    at the Tokyo in 1997
    robot-environment: hobby
    modified-date: Fri, 18 Jul 1996 12:34:21 GMT
    modified-by: KO

  • robot-id: labelgrabber.txt
    robot-name: LabelGrabber
    robot-cover-url: http://www.w3.org/PICS/refcode/LabelGrabber/index.htm
    robot-details-url: http://www.w3.org/PICS/refcode/LabelGrabber/index.htm
    robot-owner-name: Kyle Jamieson
    robot-owner-url: http://www.w3.org/PICS/refcode/LabelGrabber/index.htm
    robot-owner-email: jamieson@mit.edu
    robot-status: active
    robot-purpose: Grabs PICS labels from web pages, submits them to a label bueau
    robot-type: standalone
    robot-platform: windows, windows95, windowsNT, unix
    robot-availability: source
    robot-exclusion: yes
    robot-exclusion-useragent: label-grabber
    robot-noindex: no
    robot-host: head.w3.org
    robot-from: no
    robot-useragent: LabelGrab/1.1
    robot-language: java
    robot-description: The label grabber searches for PICS labels and submits
    them to a label bureau
    robot-history: N/A
    robot-environment: research
    modified-date: Wed, 28 Jan 1998 17:32:52 GMT
    modified-by: jamieson@mit.edu

  • robot-id: larbin
    robot-name: larbin
    robot-cover-url: http://para.inria.fr/~ailleret/larbin/index-eng.html
    robot-owner-name: Sebastien Ailleret
    robot-owner-url: http://para.inria.fr/~ailleret/
    robot-owner-email: sebastien.ailleret@inria.fr
    robot-status: active
    robot-purpose: Your imagination is the only limit
    robot-type: standalone
    robot-platform: Linux
    robot-availability: source (GPL), mail me for customization
    robot-exclusion: yes
    robot-exclusion-useragent: larbin
    robot-noindex: no
    robot-host: *
    robot-from: no
    robot-useragent: larbin (+mail)
    robot-language: c++
    robot-description: Parcourir le web, telle est ma passion
    robot-history: french research group (INRIA Verso)
    robot-environment: hobby
    modified-date: 2000-3-28
    modified-by: Sebastien Ailleret

  • robot-id: legs
    robot-name: legs
    robot-cover-url: http://www.MagPortal.com/
    robot-details-url:
    robot-owner-name: Bill Dimm
    robot-owner-url: http://www.HotNeuron.com/
    robot-owner-email: admin@magportal.com
    robot-status: active
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: linux
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: legs
    robot-noindex: no
    robot-host:
    robot-from: yes
    robot-useragent: legs
    robot-language: perl5
    robot-description: The legs robot is used to build the magazine article
    database for MagPortal.com.
    robot-history:
    robot-environment: service
    modified-date: Wed, 22 Mar 2000 14:10:49 GMT
    modified-by: Bill Dimm

  • robot-id: linkidator
    robot-name: Link Validator
    robot-cover-url:
    robot-details-url:
    robot-owner-name: Thomas Gimon
    robot-owner-url:
    robot-owner-email: tgimon@mitre.org
    robot-status: development
    robot-purpose: maintenance
    robot-type: standalone
    robot-platform: unix, windows
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: Linkidator
    robot-noindex: yes
    robot-nofollow: yes
    robot-host: *.mitre.org
    robot-from: yes
    robot-useragent: Linkidator/0.93
    robot-language: perl5
    robot-description: Recursively checks all links on a site, looking for
    broken or redirected links. Checks all off-site links using HEAD
    requests and does not progress further. Designed to behave well and to
    be very configurable.
    robot-history: Built using WWW-Robot-0.022 perl module. Currently in
    beta test. Seeking approval for public release.
    robot-environment: internal
    modified-date: Fri, 20 Jan 2001 02:22:00 EST
    modified-by: Thomas Gimon

  • robot-id:linkscan
    robot-name:LinkScan
    robot-cover-url:http://www.elsop.com/
    robot-details-url:http://www.elsop.com/linkscan/overview.html
    robot-owner-name:Electronic Software Publishing Corp. (Elsop)
    robot-owner-url:http://www.elsop.com/
    robot-owner-email:sales@elsop.com
    robot-status:Robot actively in use
    robot-purpose:Link checker, SiteMapper, and HTML Validator
    robot-type:Standalone
    robot-platform:Unix, Linux, Windows 98/NT
    robot-availability:Program is shareware
    robot-exclusion:No
    robot-exclusion-useragent:
    robot-noindex:Yes
    robot-host:*
    robot-from:
    robot-useragent:LinkScan Server/5.5 | LinkScan Workstation/5.5
    robot-language:perl5
    robot-description:LinkScan checks links, validates HTML and creates site maps
    robot-history: First developed by Elsop in January,1997
    robot-environment:Commercial
    modified-date:Fri, 3 September 1999 17:00:00 PDT
    modified-by: Kenneth R. Churilla

  • robot-id: linkwalker
    robot-name: LinkWalker
    robot-cover-url: http://www.seventwentyfour.com
    robot-details-url: http://www.seventwentyfour.com/tech.html
    robot-owner-name: Roy Bryant
    robot-owner-url:
    robot-owner-email: rbryant@seventwentyfour.com
    robot-status: active
    robot-purpose: maintenance, statistics
    robot-type: standalone
    robot-platform: windowsNT
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: linkwalker
    robot-noindex: yes
    robot-host: *.seventwentyfour.com
    robot-from: yes
    robot-useragent: LinkWalker
    robot-language: c++
    robot-description: LinkWalker generates a database of links.
    We send reports of bad ones to webmasters.
    robot-history: Constructed late 1997 through April 1998.
    In full service April 1998.
    robot-environment: service
    modified-date: Wed, 22 Apr 1998
    modified-by: Roy Bryant

  • robot-id:lockon
    robot-name:Lockon
    robot-cover-url:
    robot-details-url:
    robot-owner-name:Seiji Sasazuka & Takahiro Ohmori
    robot-owner-url:
    robot-owner-email:search@rsch.tuis.ac.jp
    robot-status:active
    robot-purpose:indexing
    robot-type:standalone
    robot-platform:UNIX
    robot-availability:none
    robot-exclusion:yes
    robot-exclusion-useragent:Lockon
    robot-noindex:yes
    robot-host:*.hitech.tuis.ac.jp
    robot-from:yes
    robot-useragent:Lockon/xxxxx
    robot-language:perl5
    robot-description:This robot gathers only HTML document.
    robot-history:This robot was developed in the Tokyo university of information sciences in 1998.
    robot-environment:research
    modified-date:Tue. 10 Nov 1998 20:00:00 GMT
    modified-by:Seiji Sasazuka & Takahiro Ohmori

  • robot-id:logo_gif
    robot-name: logo.gif Crawler
    robot-cover-url: http://www.inm.de/projects/logogif.html
    robot-details-url:
    robot-owner-name: Sevo Stille
    robot-owner-url: http://www.inm.de/people/sevo
    robot-owner-email: sevo@inm.de
    robot-status: under development
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: logo_gif_crawler
    robot-noindex: no
    robot-host: *.inm.de
    robot-from: yes
    robot-useragent: logo.gif crawler
    robot-language: perl
    robot-description: meta-indexing engine for corporate logo graphics
    The robot runs at irregular intervals and will only pull a start page and
    its associated /.*logo\.gif/i (if any). It will be terminated once a
    statistically
    significant number of samples has been collected.
    robot-history: logo.gif is part of the design diploma of Markus Weisbeck,
    and tries to analyze the abundance of the logo metaphor in WWW
    corporate design.
    The crawler and image database were written by Sevo Stille and Peter
    Frank of the Institut für Neue Medien, respectively.
    robot-environment: research, statistics
    modified-date: 25.5.97
    modified-by: Sevo Stille

  • robot-id: lycos
    robot-name: Lycos
    robot-cover-url: http://lycos.cs.cmu.edu/
    robot-details-url:
    robot-owner-name: Dr. Michael L. Mauldin
    robot-owner-url: http://fuzine.mt.cs.cmu.edu/mlm/home.html
    robot-owner-email: fuzzy@cmu.edu
    robot-status:
    robot-purpose: indexing
    robot-type:
    robot-platform:
    robot-availability:
    robot-exclusion: yes
    robot-exclusion-useragent:
    robot-noindex: no
    robot-host: fuzine.mt.cs.cmu.edu, lycos.com
    robot-from:
    robot-useragent: Lycos/x.x
    robot-language:
    robot-description: This is a research program in providing information
    retrieval and discovery in the WWW, using a finite memory
    model of the web to guide intelligent, directed searches for
    specific information needs
    robot-history:
    robot-environment:
    modified-date:
    modified-by:

  • robot-id: macworm
    robot-name: Mac WWWWorm
    robot-cover-url:
    robot-details-url:
    robot-owner-name: Sebastien Lemieux
    robot-owner-url:
    robot-owner-email: lemieuse@ERE.UMontreal.CA
    robot-status:
    robot-purpose: indexing
    robot-type:
    robot-platform: Macintosh
    robot-availability: none
    robot-exclusion:
    robot-exclusion-useragent:
    robot-noindex: no
    robot-host:
    robot-from:
    robot-useragent:
    robot-language: hypercard
    robot-description: a French Keyword-searching robot for the Mac The author has
    decided not to release this robot to the
    public
    robot-history:
    robot-environment:
    modified-date:
    modified-by:

  • robot-id: magpie
    robot-name: Magpie
    robot-cover-url:
    robot-details-url:
    robot-owner-name: Keith Jones
    robot-owner-url:
    robot-owner-email: Keith.Jones@blueberry.co.uk
    robot-status: development
    robot-purpose: indexing, statistics
    robot-type: standalone
    robot-platform: unix
    robot-availability:
    robot-exclusion: no
    robot-exclusion-useragent:
    robot-noindex: no
    robot-host: *.blueberry.co.uk, 194.70.52.*, 193.131.167.144
    robot-from: no
    robot-useragent: Magpie/1.0
    robot-language: perl5
    robot-description: Used to obtain information from a specified list of web pages for local indexing. Runs every two hours, and visits only a small number of sites.
    robot-history: Part of a research project. Alpha testing from 10 July 1996, Beta testing from 10 September.
    robot-environment: research
    modified-date: Wed, 10 Oct 1996 13:15:00 GMT
    modified-by: Keith Jones

  • robot-id: marvin
    robot-name: marvin/infoseek
    robot-details-url:
    robot-cover-url: http://www.infoseek.de/
    robot-owner-name: WSI Webseek Infoservice GmbH & Co KG.
    robot-owner-url: http://www.infoseek.de/
    robot-owner-email: marvin-team@webseek.de
    robot-status: development
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: marvin
    robot-noindex: yes
    robot-nofollow: yes
    robot-host: arthur*.sda.t-online.de
    robot-from: yes
    robot-useragent: marvin/infoseek (marvin-team@webseek.de)
    robot-language: java
    robot-description:
    robot-history: day of birth: 4.2. 2001 - replaces Infoseek Sidewinder
    robot-environment: comercial
    modified-date: Fri, 11 May 2001 17:28:52 GMT

  • robot-id: mattie
    robot-name: Mattie
    robot-cover-url: http://www.mcw.aarkayn.org
    robot-details-url: http://www.mcw.aarkayn.org/web/mattie.asp
    robot-owner-name: Matt
    robot-owner-url: http://www.mcw.aarkayn.org
    robot-owner-email: matt@mcw.aarkayn.org
    robot-status: Active
    robot-purpose: Procurement Spider
    robot-type: Standalone
    robot-platform: UNIX
    robot-availability: None
    robot-exclusion: Yes
    robot-exclusion-useragent: mattie
    robot-noindex: N/A
    robot-nofollow: Yes
    robot-host: mattie.mcw.aarkayn.org
    robot-from: Yes
    robot-useragent: M/3.8
    robot-language: C++
    robot-description: Mattie is an all-source procurement spider.
    robot-history: Created 2000 Mar. 03 Fri. 18:48:16 -0500 GMT (R) as an MP3
    spider, Mattie was reborn 2002 Jul. 07 Sun. 03:47:29 -0500 GMT (R) as an
    all-source procurement spider.
    robot-environment: Hobby
    modified-date: Fri, 13 Sep 2002 00:36:13 GMT
    modified-by: Matt

  • robot-id: mediafox
    robot-name: MediaFox
    robot-cover-url: none
    robot-details-url: none
    robot-owner-name: Lars Eilebrecht
    robot-owner-url: http://www.home.unix-ag.org/sfx/
    robot-owner-email: sfx@uni-media.de
    robot-status: development
    robot-purpose: indexing and maintenance
    robot-type: standalone
    robot-platform: (Java)
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: mediafox
    robot-noindex: yes
    robot-host: 141.99.*.*
    robot-from: yes
    robot-useragent: MediaFox/x.y
    robot-language: Java
    robot-description: The robot is used to index meta information of a
    specified set of documents and update a database
    accordingly.
    robot-history: Project at the University of Siegen
    robot-environment: research
    modified-date: Fri Aug 14 03:37:56 CEST 1998
    modified-by: Lars Eilebrecht

  • robot-id:merzscope
    robot-name:MerzScope
    robot-cover-url:http://www.merzcom.com
    robot-details-url:http://www.merzcom.com
    robot-owner-name:(Client based robot)
    robot-owner-url:(Client based robot)
    robot-owner-email:
    robot-status:actively in use
    robot-purpose:WebMapping
    robot-type:standalone
    robot-platform: (Java Based) unix,windows95,windowsNT,os2,mac etc ..
    robot-availability:binary
    robot-exclusion: yes
    robot-exclusion-useragent: MerzScope
    robot-noindex: no
    robot-host:(Client Based)
    robot-from:
    robot-useragent: MerzScope
    robot-language: java
    robot-description: Robot is part of a Web-Mapping package called MerzScope,
    to be used mainly by consultants, and web masters to create and
    publish maps, on and of the World wide web.
    robot-history:
    robot-environment:
    modified-date: Fri, 13 March 1997 16:31:00
    modified-by: Philip Lenir, MerzScope lead developper

  • robot-id: meshexplorer
    robot-name: NEC-MeshExplorer
    robot-cover-url: http://netplaza.biglobe.or.jp/
    robot-details-url: http://netplaza.biglobe.or.jp/keyword.html
    robot-owner-name: web search service maintenance group
    robot-owner-url: http://netplaza.biglobe.or.jp/keyword.html
    robot-owner-email: web-dir@mxa.meshnet.or.jp
    robot-status: active
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: NEC-MeshExplorer
    robot-noindex: no
    robot-host: meshsv300.tk.mesh.ad.jp
    robot-from: yes
    robot-useragent: NEC-MeshExplorer
    robot-language: c
    robot-description: The NEC-MeshExplorer robot is used to build database for the NETPLAZA
    search service operated by NEC Corporation. The robot searches URLs
    around sites in japan(JP domain).
    The robot runs every day, and visits sites in a random order.
    robot-history: Prototype version of this robot was developed in C&C Research
    Laboratories, NEC Corporation. Current robot (Version 1.0) is based
    on the prototype and has more functions.
    robot-environment: research
    modified-date: Jan 1, 1997
    modified-by: Nobuya Kubo, Hajime Takano

  • robot-id: MindCrawler
    robot-name: MindCrawler
    robot-cover-url: http://www.mindpass.com/_technology_faq.htm
    robot-details-url:
    robot-owner-name: Mindpass
    robot-owner-url: http://www.mindpass.com/
    robot-owner-email: support@mindpass.com
    robot-status: active
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: linux
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: MindCrawler
    robot-noindex: no
    robot-host: *
    robot-from: no
    robot-useragent: MindCrawler
    robot-language: c++
    robot-description:
    robot-history:
    robot-environment:
    modified-date: Tue Mar 28 11:30:09 CEST 2000
    modified-by:

  • robot-id: mnogosearch
    robot-name: mnoGoSearch search engine software
    robot-cover-url: http://www.mnogosearch.org
    robot-details-url: http://www.mnogosearch.org/features.html
    robot-owner-name: Lavtech.com corp.
    robot-owner-url: http://www.mnogosearch.org
    robot-owner-email: support@mnogosearch.org
    robot-status: active
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix, windows, mac
    robot-availability: source
    robot-exclusion: yes
    robot-exclusion-useragent: udmsearch
    robot-noindex: yes
    robot-host: *
    robot-from: no
    robot-useragent: UdmSearch
    robot-language: c
    robot-description: mnoGoSearch search engine software (formerly known
    as UDMSearch) is an advanced search solution for large-scale websites
    and Intranet. It is based on SQL database and supports numerous
    features.
    robot-history: Formerly known as UDMSearch was developed as the search
    engine for the Russian republic of Udmurtia.
    robot-environment: commercial
    modified-date: Wed, 12 Sept 2001
    modified-by: Dmitry Tkatchenko

  • robot-id:moget
    robot-name:moget
    robot-cover-url:
    robot-details-url:
    robot-owner-name:NTT-ME Infomation Xing,Inc
    robot-owner-url:http://www.nttx.co.jp
    robot-owner-email:moget@goo.ne.jp
    robot-status:active
    robot-purpose:indexing,statistics
    robot-type:standalone
    robot-platform:unix
    robot-availability:none
    robot-exclusion:yes
    robot-exclusion-useragent:moget
    robot-noindex:yes
    robot-host:*.goo.ne.jp
    robot-from:yes
    robot-useragent:moget/1.0
    robot-language:c
    robot-description: This robot is used to build the database for the search service operated by goo
    robot-history:
    robot-environment:service
    modified-date:Thu, 30 Mar 2000 18:40:37 GMT
    modified-by:moget@goo.ne.jp

  • robot-id: momspider
    robot-name: MOMspider
    robot-cover-url: http://www.ics.uci.edu/WebSoft/MOMspider/
    robot-details-url:
    robot-owner-name: Roy T. Fielding
    robot-owner-url: http://www.ics.uci.edu/dir/grad/Software/fielding
    robot-owner-email: fielding@ics.uci.edu
    robot-status: active
    robot-purpose: maintenance, statistics
    robot-type: standalone
    robot-platform: UNIX
    robot-availability: source
    robot-exclusion: yes
    robot-exclusion-useragent:
    robot-noindex: no
    robot-host: *
    robot-from: yes
    robot-useragent: MOMspider/1.00 libwww-perl/0.40
    robot-language: perl 4
    robot-description: to validate links, and generate statistics. It's usually run
    from anywhere
    robot-history: Originated as a research project at the University of
    California, Irvine, in 1993. Presented at the First
    International WWW Conference in Geneva, 1994.
    robot-environment:
    modified-date: Sat May 6 08:11:58 1995
    modified-by: fielding@ics.uci.edu

  • robot-id: monster
    robot-name: Monster
    robot-cover-url: http://www.neva.ru/monster.list/russian.www.html
    robot-details-url:
    robot-owner-name: Dmitry Dicky
    robot-owner-url: http://wild.stu.neva.ru/
    robot-owner-email: diwil@wild.stu.neva.ru
    robot-status: active
    robot-purpose: maintenance, mirroring
    robot-type: standalone
    robot-platform: UNIX (Linux)
    robot-availability: binary
    robot-exclusion: yes
    robot-exclusion-useragent:
    robot-noindex: no
    robot-host: wild.stu.neva.ru
    robot-from:
    robot-useragent: Monster/vX.X.X -$TYPE ($OSTYPE)
    robot-language: C
    robot-description: The Monster has two parts - Web searcher and Web analyzer.
    Searcher is intended to perform the list of WWW sites of
    desired domain (for example it can perform list of all
    WWW sites of mit.edu, com, org, etc... domain)
    In the User-agent field $TYPE is set to 'Mapper' for Web searcher
    and 'StAlone' for Web analyzer.
    robot-history: Now the full (I suppose) list of ex-USSR sites is produced.
    robot-environment:
    modified-date: Tue Jun 25 10:03:36 1996
    modified-by:

  • robot-id: motor
    robot-name: Motor
    robot-cover-url: http://www.cybercon.de/Motor/index.html
    robot-details-url:
    robot-owner-name: Mr. Oliver Runge, Mr. Michael Goeckel
    robot-owner-url: http://www.cybercon.de/index.html
    robot-owner-email: Motor@cybercon.technopark.gmd.de
    robot-status: developement
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: mac
    robot-availability: data
    robot-exclusion: yes
    robot-exclusion-useragent: Motor
    robot-noindex: no
    robot-host: Michael.cybercon.technopark.gmd.de
    robot-from: yes
    robot-useragent: Motor/0.2
    robot-language: 4th dimension
    robot-description: The Motor robot is used to build the database for the
    www.webindex.de search service operated by CyberCon. The robot ios under
    development - it runs in random intervals and visits site in a priority
    driven order (.de/.ch/.at first, root and robots.txt first)
    robot-history:
    robot-environment: service
    modified-date: Wed, 3 Jul 1996 15:30:00 +0100
    modified-by: Michael Goeckel (Michael@cybercon.technopark.gmd.de)

  • robot-id: msnbot
    robot-name: MSNBot
    robot-cover-url: http://search.msn.com
    robot-details-url: http://search.msn.com/msnbot.htm
    robot-owner-name: Microsoft Corp.
    robot-owner-url: http://www.microsoft.com
    robot-owner-email: msnbot@microsoft.com
    robot-status: active
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: Windows Server 2000, Windows Server 2003
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: msnbot
    robot-noindex: yes
    robot-host: <TBD>
    robot-from: yes
    robot-useragent: MSNBOT/0.1 (http://search.msn.com/msnbot.htm)
    robot-language: C++
    robot-description: MSN Search Crawler
    robot-history: Developed by Microsoft Corp.
    robot-environment: commercial
    modified-date: June 23, 2003
    modified-by: msnbot@microsoft.com

  • robot-id: muncher
    robot-name: Muncher
    robot-details-url: http://www.goodlookingcooking.co.uk/info.htm
    robot-cover-url: http://www.goodlookingcooking.co.uk
    robot-owner-name: Chris Ridings
    robot-owner-url: http://www.goodlookingcooking.co.uk
    robot-owner-email: muncher@ridings.org.uk
    robot-status: development
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: muncher
    robot-noindex: yes
    robot-nofollow: yes
    robot-host: www.goodlookingcooking.co.uk
    robot-from: no
    robot-useragent: yes
    robot-language: perl
    robot-description: Used to build the index for www.goodlookingcooking.co.uk.
    Seeks out cooking and recipe pages.
    robot-history: Private project september 2001
    robot-environment: hobby
    modified-date: Wed, 5 Sep 2001 19:21:00 GMT

  • robot-id: muscatferret
    robot-name: Muscat Ferret
    robot-cover-url: http://www.muscat.co.uk/euroferret/
    robot-details-url:
    robot-owner-name: Olly Betts
    robot-owner-url: http://www.muscat.co.uk/~olly/
    robot-owner-email: olly@muscat.co.uk
    robot-status: active
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: MuscatFerret
    robot-noindex: yes
    robot-host: 193.114.89.*, 194.168.54.11
    robot-from: yes
    robot-useragent: MuscatFerret/<version>
    robot-language: c, perl5
    robot-description: Used to build the database for the EuroFerret
    <URL:http://www.muscat.co.uk/euroferret/>
    robot-history:
    robot-environment: service
    modified-date: Tue, 21 May 1997 17:11:00 GMT
    modified-by: olly@muscat.co.uk

  • robot-id: mwdsearch
    robot-name: Mwd.Search
    robot-cover-url: (none)
    robot-details-url: (none)
    robot-owner-name: Antti Westerberg
    robot-owner-url: (none)
    robot-owner-email: Antti.Westerberg@mwd.sci.fi
    robot-status: active
    robot-purpose: indexing
    robot-type: standalone
    robot-platform: unix (Linux)
    robot-availability: none
    robot-exclusion: yes
    robot-exclusion-useragent: MwdSearch
    robot-noindex: yes
    robot-host: *.fifi.net
    robot-from: no
    robot-useragent: MwdSearch/0.1
    robot-language: perl5, c
    robot-description: Robot for indexing finnish (toplevel domain .fi)
    webpages for search engine called Fifi.
    Visits sites in random order.
    robot-history: (none)
    robot-environment: service (+ commercial)mwd.sci.fi>
    modified-date: Mon, 26 May 1997 15:55:02 EEST
    modified-by: Antti.Westerberg@mwd.sci.fi

    Next Page

  • WEBMASTERS
    Search Engine Submit Global
    Web Hosting FAQ
    Web Hosting Glossary
    Search engine ranking tips
    Download free scripts
    Keyword Suggestion Tool
    Downloads
    Google Page Ranking
    Search Engine Analysis
    Robots Index
    Web Crawlers
    Affiliates
    WHOIS
    SUPPORT
    24/7 Help Desk
    Cpanel
    Contact
    WE RECOMMEND
       
    Dependable Linux Servers providing cheap web hosting worldwide
    INTRO | HOME | WEB HOSTING | DEDICATED SERVERS | DEDICATED SERVERS STOCK | NETWORK DIAGRAMM |WEB DESIGN | DOMAIN PARKING | FREE FLASH MENU GENERATORS | FREE GRAPHICS NAVBARS | DHTML/CSS CODE GENERATORS | JAVA SCRIPT CSS CODE GENERATORS | FREE SEARCH ENGINE SUBMISSION | WEB HOSTING F.A.Q | WEB HOSTING GLOSSARY | WEEKLY SEARCH ENGINE RANKING TIPS | DOWNLOAD FREE SCRIPTS & PROGRAMMS | SEARCH ENGINE ANALYSIS | SEARCH TERM SUGESSTION TOOL | TECH NEWS FEED | DOWNLOAD FREE HTML TOOLS | GOOGLE PAGE RANK TIPS | ROBOTS INDEX | WEB CRAWLERS | CPANEL DOCUMENTATION | TERMS OF USE | CONTACT | FORUMS
    © 2002 Hostsun™ All wrignts reserved

    Dedicated servers provider in Europe and Greece