Search Engines Robots List
Search Engines Robots List
| Home page/search engine | Robot identifier | IP address(es) |
|---|---|---|
|
www.abacho.com |
AbachoBOT | srv-ze-robot1.tricus.com |
|
www.aesop.com |
AESOP_com_SpiderMan | 209.189.115.49 |
|
www.ah-ha.com |
ah-ha.com crawler (crawler@ah-ha.com) | c7pub-216-250-141-186.center7.com |
|
www.alexa.com |
ia_archiver |
green.alexa.com sarah.alexa.com |
|
www.altavista.com |
Scooter |
test-scooter.pa.alta-vista.net brillo.pa.alta-vista.net av-dev4.pa.alta-vista.net scooter.aveurope.co.uk bigip1-snat.sv.av.com |
| Mercator |
mercator.pa-x.dec.com scooter.pa.alta-vista.net election2000crawl-complaints-to-admin.webresearch.pa-x.dec.com |
|
| Scooter2_Mercator_3-1.0 | scooter.sv.av.com | |
| roach.smo.av.com-1.0 | avfwclient.sv.av.com | |
| Tv<nn>_Merc_resh_26_1_D-1.0 | tv<nn>.sv.av.com | |
|
www.altavista.co.uk |
AltaVista-Intranet jan.gelin@av.com |
host-119.altavista.se |
|
www.alltheweb.com |
FAST-WebCrawler crawler@fast.no |
209.67.247.154 |
|
www.fast.no/faq/ |
||
| Wget | ext-gw.trd.fast.no | |
|
www.acoon.de |
Acoon Robot | 194.231.42.178 |
|
www.atomz.com |
Atomz | router-sc.atomz.com |
|
www.batsch.com |
spider.batsch.com | batsch.com |
|
www.crawler.de |
Crawler admin@crawler.de |
crawlit.crawler.de |
|
www.daum.net |
RaBot Agent-admin/ phortse@hanmail.net |
210.183.28.46 |
| contact/jylee@kies.co.kr | 211.50.57.6 | |
|
RaBot Agent-admin/ webmaster@kisco.go.kr |
202.30.94.34 | |
|
www.domanova.co.uk |
Jack www.domanova.co.uk/faq.html |
|
|
www.excite.com |
ArchitextSpider |
Musical instrumentss are used in the name such as viola.excite.com cello.excite.com piano.excite.com kazoo.excite.com ride.excite.com sabian.excite.com sax.excite.com bugle.excite.com snare.excite.com ziljian.excite.com bongos.excite.com maturana.excite.com mandolin.excite.com piccolo.excite.com kettle.excite.com ichiban.excite.com (and the rest of the band) more recently first names are being used like philip.excite.com peter.excite.con perdita.excite.com macduff.excite.com agouti.excite.com |
| (excite) | ArchitectSpider |
crimpshrine.atext.com ichiban.atext.com |
|
www.euroseek.net |
Arachnoidea arachnoidea@euroseek.net |
212.209.54.134 |
|
www.ezresults.com |
EZResult | 216.28.23.59 |
|
www.fastsearch.net |
Fast PartnerSite Crawler | psprdcrw001.sac2.fastsearch.net |
|
www.findsame.com |
DIIbot | 207.230.106.188 |
|
(see also www.powerinter.net below) |
robot@digital-integrity.com |
|
|
www.fireball.de |
KIT-Fireball | ???? |
|
www.geckobot.com |
geckobot | ???.rdc1.az.coxatwork.com |
|
www.gendoor.com (Genealogical Search Engine) |
GenCrawler | ???? |
|
www.google.com |
Googlebot googlebot@googlebot.com http://googlebot.com/ |
c<nn>.googlebot.com |
|
www.goo.ne.jp |
moget/2.0 moget@goo.ne.jp |
202.229.31.13 |
|
www.girafa.com |
Aranha | Aranha.girafa.com |
| (inktomi) | Slurp.so/1.0 | q2004.inktomisearch.com |
|
slurp@inktomi.com |
j5006.inktomisearch.com | |
| (inktomi) | Slurp/2.0j | 202.212.5.34 |
|
slurp@inktomi.com www.inktomisearch.com |
goo313.goo.ne.jp | |
| (inktomi) |
Slurp/2.0-KiteHourly slurp@inktomi.com; www.inktomi.com/slurp.html |
y400.inktomi.com |
| (inktomi) |
Slurp/2.0-OwlWeekly spider@aeneid.com www.inktomi.com/slurp.html |
209.185.143.198 |
| (inktomi) |
Slurp/3.0-AU slurp@inktomi.com www.inktomisearch.com |
j6000.inktomi.com |
|
http://hoppa.com/ (need V5 browsers to view) |
Toutatis 2.5-2 | tisnix.xs4all.nl |
|
www.hubat.com |
Hubater | 209.114.176.250 |
|
www.almaden.ibm.com (research centre) |
http://www.almaden.ibm.com/ |
wfp2.almaden.ibm.com |
|
www.incywincy.com |
IncyWincy | 64.81.243.66 |
|
www.infoseek.com |
UltraSeek |
cde2c923.infoseek.com cde2c91f.infoseek.com |
| InfoSeek Sidewinder | cca26215.infoseek.com | |
|
www.informatch.com |
MP3Bot | 212.204.169.52 |
|
www.ip3000.com |
C-PBWF-ip3000.com-crawler ip3000.com-crawler |
www.ip3000.com |
|
www.lexis-nexis.com |
LNSpiderguy | firewall5.lexis-nexis.com |
|
www.looksmart.com |
MantraAgent | fjupiter.looksmart.com |
|
www.loopimprovements.com |
NetResearchServer | leg-64-133-109-250-STK.sprinthome.com |
|
(see also www.incywincy.com) |
www.loopimprovements.com |
|
|
www.lycos.com |
Lycos_Spider_(T-Rex) |
bos-spider<n>.bos.lycos.com 216.35.194.188 |
|
www.mirago.co.uk |
HenryTheMiragoRobot | 194.202.39.46 |
|
www.northernlight.com |
Gulliver |
marvin.northernlight.com taz.northernlight.com |
|
www.portaljuice.com |
PJspider | timber.nextopia.com |
|
www.powerinter.net but it won’t let us in |
DIIbot | node-d8e93393.powerinter.net |
|
http://navi.ocn.ne.jp/ |
nttdirectory_robot super-robot@super.navi.ocn.ne.jp |
lilis00.navi.ocn.ne.jp |
|
griffon griffon@super.navi.ocn.ne.jp |
lilis04.navi.ocn.ne.jp | |
|
www.maxbot.com |
Spider/maxbot.com admin@maxbot.com |
search.wport.com |
| ??? | various (fakes agent on each access) | pool0058.cvx2-bradley.dialup.earthlink.net |
| ??? | gazz/1.0 | deleuze.infobee.ne.jp |
|
gazz@nttrd.com |
derrida.infobee.ne.jp | |
| ??? | ??? | search-8.xift.com |
|
www.nationaldirectory.com |
NationalDirectory-SuperSpider |
spider.nationaldirectory.com 209.116.58.143 |
|
www.naver.com |
dloader(NaverRobot)/ dumrobo(NaverRobot)/ |
211.218.151.209 |
|
www.openfind.com |
Openfind piranha,Shark | ??? |
| (Chinese language) |
robot-response@openfind.com.tw |
|
|
www.picsearch.org |
psbot www.picsearch.org/bot.html |
217.75.104.26 |
|
www.pinpoint.com |
CrawlerBoy Pinpoint.com | nitrogen.pinpoint.com |
|
www.petersnews.com |
user<n>.ip3000.com | news<n>.petersnews.com |
|
www.vestris.com/alkaline |
AlkalineBOT | host130.uv-ray.com |
|
www.searchhippo.com |
Fluffy the spider info@searchhippo.com) |
208.148.122.27 |
|
www.singingfish.com |
asterias | grouper.singingfish.com |
|
www.speedfind.de |
speedfind ramBot xtreme | BWEB.highway.telekom.at |
|
www.s.u-tokyo.ac.jp |
Kototoi/0.1 | crawler-red3.is.s.u-tokyo.ac.jp |
|
www.surfnomore.com |
Surfnomore Spider v1.1 | 165.90.194.245 |
|
www.supersnooper.com |
Robot@SuperSnooper.Com |
207.8.212.162 |
|
www.teoma.com |
teoma_agent1 teoma_admin@hawkholdings.com |
63.236.92.148 |
|
www.travel-finder.com |
ESISmartSpider | 202.46.33.15 |
|
www.uksearcher.co.uk |
UK Searcher Spider | - |
|
www.walhello.com |
appie | …speed.planet.nl |
|
www.websmostlinked.com |
Nazilla | - |
|
www.webwombat.com.au |
www.WebWombat.com.au |
202.139.99.131 |
Още за четене :
- Bot IP list74.6.87.123 yahoo 74.6.86.223 yahoo 74.6.86.232 yahoo 72.14.199.5 googlefeed 38.100.225.3 sproose 38.100.225.26 sproose 38.98.120.78 shipwiki 88.151.114.33 webbot 72.5.115.39 nimblecrawler 65.55.209.61 msn 65.55.209.53 msn 65.55.209.55 msn 65.55.210.39 msn 65.55.210.38 msn 65.55.210.41 msn 65.55.210.40 msn 65.55.210.36 msn 65.55.210.37 msn 65.55.215.46 msn 65.55.209.59 msn 157.82.254.30 shim 65.55.213.38 msn 208.66.64.168 technorati 74.6.75.19 yahoo 74.6.75.38 yahoo 74.6.71.93 yahoo 74.6.71.59 yahoo 74.6.71.7 [...]...
- IP адреси на търсачки и ботове12.98.160.35 pockey 12.101.15.84 2kcity 12.148.209.19 nameprotect 12.148.209.196 nameproject 12.148.209.198 npbot 12.175.0.35 npbot 12.220.82.57 xenu 12.229.232.231 msiecrawler 24.8.124.146 semantic 24.43.91.103 production 24.73.195.114 production 24.83.205.219 iupui 24.107.33.4 faxobot 24.177.134.6 about 24.191.70.29 siteexpert 24.228.19.86 sureseeker 38.98.120.78 shipwiki 38.100.225.3 sproose 38.100.225.4 sproose 38.100.225.5 sproose 38.100.225.9 sproose 38.100.225.10 sproose 38.100.225.24 sproose 38.100.225.26 sproose 38.119.96.107 updatedcom 38.170.72.194 kanoodle 61.135.145.204 baidu 61.135.145.206 [...]...
- Search Engines RegistrationРегистрация сайтов в условиях изменившегося мира поиска Тем специалистам по web-оптимизации, которые регулярно регистрируют сайты своих клиентов на поисковых машинах, может показаться, что процесс регистрации сайтов с использованием стандартных форм является самым обычным и скучным занятием. Мы легко забываем о множестве работающих в маленьких компаниях web-мастеров, которые самостоятельно занимаются этим процессом и которым необходим путеводитель [...]...
- Robots.txt ValidatorRobots.txt Validator Попълнете точния URL адрес до robots.txt файла на вашия сървър Note: Please use the FULL http://www.mydomain.com/robots.txt url to your robots.txt file. This allows you to use alternate filenames in order to test development copies of your robots.txt file before making it live on the web and risking a robot running into it when [...]...
- Search Engine Spider SimulatorSearch Engine Spider [наричан още паяк или бот] е софтуер който посещава уеб сайта и чете информацията в страниците. След това този паяк предава прочетената информация в своята домашна база данни или индекс, за да бъде включена тя в резултатите от заявките към машината за търсене. Изключително важно е да се знае какво точно „вижда“ [...]...