Spider blocking in .htaccess
Hello
If i block spiders in .htaccess file like:
WebLog Expert still does count them, right?
If i block spiders in .htaccess file like:
SetEnvIfNoCase User-Agent Googlebot bot
Deny from env=bot
WebLog Expert still does count them, right?
for forbidden spiders i have in the summary:
Total Hits 18,531
Visitor Hits 14,565
Spider Hits 3,966
Total Bandwidth 232.10 MB
Visitor Bandwidth 229.34 MB
Spider Bandwidth 2.76 MB
Bandwidth statistic shows that spiders' hits are counted but spiders do not access pages
If i allow spiders their traffic matches visitors' one.
RewriteRule ^index.php/(.*)$ http://v-gornom.ru/$1 [R=301,L]
then those hits are not counted by WLE
For example, i have a lot of lines like these in my log file, but no "/kanaly/" pages in the report at all. Why is that?
185.26.122.23 - - [22/Jan/2018:02:10:05 +0300] "GET /index.php/kanaly/15-kholodnyj-belok?format=feed&type=rss&rss_fulltext=1 HTTP/1.0" 301 296 "-" "-" www.altai-info.ga
185.26.122.23 - - [22/Jan/2018:02:10:05 +0300] "GET /kanaly/15-kholodnyj-belok?format=feed&type=rss&rss_fulltext=1 HTTP/1.0" 200 119767 "-" "-" v-gornom.ru
Actually, that's not correct, because url (page) is the whole string:/index.php/kanaly/15-kholodnyj-belok
It is just Joomla specific method to include "index.php" in the url
http://www.domain.com/ kanaly/15-kholodnyj-belok/
http://www.domain.com/ index.php/kanaly/15-kholodnyj-belok/
Maybe these two pages aren't among top 50 pages shown by default. You can change this number (up to 10000) in the program options ("Report | Contents" category) or profile properties. Just select an appropriate item and click the "Properties" button.