Curious About PF Robots: Exploring Member/Guest/Robot Dynamics

  • Thread starter Thread starter anorlunda
  • Start date Start date
anorlunda
Staff Emeritus
Science Advisor
Homework Helper
Insights Author
Messages
11,326
Reaction score
8,755
The right column of the PF home page says "Robots 221, members 62, guests 1266" under "members online now". How interesting.

Greg,
Does a web crawler count as a robot?
How do you detect robots as opposed to guests?
What are those robots doing, and what motivates those who send them?
Does PF send robots to monitor other sites?

The number of guests is also remarkable. PF members need to be aware of that; especially with controversial dangerous threads. 95% of those viewing PF are silent and not identifiable.
 
  • Like
Likes   Reactions: BvU
Physics news on Phys.org
anorlunda said:
Does PF send robots to monitor other sites?
What do you think we mentors are? Humans?
 
  • Like
Likes   Reactions: Drakkith, phinds, anorlunda and 1 other person
Not Greg, but here are some answers:

Web crawlers count as robots, and they identify themselves as robot (otherwise they are counted as guests). They are crawling the forums for search engines and similar tools.
PF doesn't operate search engines (outside the forums) or anything like that, no need to crawl other websites.

Most visitors are guests, yes. Most of them come from search engines.
 
anorlunda said:
Does a web crawler count as a robot?
Yes.
How do you detect robots as opposed to guests?
Polite robots identify themselves in the User-Agent field of their requests to web servers. For example, here are some requests for the home page of my web site, from my server log:
Code:
141.8.184.11 - - [27/Dec/2016:17:44:52 -0500] "GET / HTTP/1.1" 200 4442 "-" "Mozilla/5.0 (compatible; YandexBot/3.0; +[PLAIN]http://yandex.com/bots)"[/PLAIN] 
40.77.167.20 - - [27/Dec/2016:18:14:05 -0500] "GET / HTTP/1.1" 200 4442 "-" "Mozilla/5.0 (compatible; bingbot/2.0; +[PLAIN]http://www.bing.com/bingbot.htm)"[/PLAIN] 
66.249.79.141 - - [27/Dec/2016:20:27:54 -0500] "GET / HTTP/1.1" 200 4442 "-" "Mozilla/5.0 (compatible; Googlebot/2.1; +[PLAIN]http://www.google.com/bot.html)"[/PLAIN] 
77.75.77.109 - - [27/Dec/2016:22:47:29 -0500] "GET / HTTP/1.1" 304 - "-" "Mozilla/5.0 (compatible; SeznamBot/3.2; +[PLAIN]http://napoveda.seznam.cz/en/seznambot-intro/)"[/PLAIN]
What are those robots doing, and what motivates those who send them?
The ones shown above are for search engines. There are also companies that crawl the web and collect statistics that they sell to website owners, e.g. statistics about who links to your site, and what your site links to. There is at least one site (archive.org) that crawls the web in order to maintain a historical archive of the web, where you can look up what a website looked like in the past.
 
Last edited by a moderator:
  • Like
Likes   Reactions: DrClaude and anorlunda
DrClaude said:
What do you think we mentors are? Humans?
I'm one-quarter lawn gnome on my mother's side.
 
  • Like
Likes   Reactions: DrClaude and OmCheeto

Similar threads

Replies
14
Views
2K
  • Sticky
  • · Replies 2 ·
Replies
2
Views
505K